- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Subscribe
- Printer Friendly Page
- Report Inappropriate Content
Created on 09-28-2018 06:36 PM - edited 08-17-2019 06:22 AM
Related: ApacheCon 2018 in Montreal
With my talk in a dual language city, I thought perhaps I should do my talk in French. My college French is very rusty and my accent is very New Jersey. The two don't mix well. So let's have Apache NiFi do it for me. After publically debating this on Twitter, I decided to see if I could implement a solution.
Secret:
Most of the heavy lifting is done by Python which calls Google Translate API under the covers, automagically.
My Presentation is here: https://www.slideshare.net/bunkertor/apache-deep-learning-101-apachecon-montreal-2018-v031
Flow to Extract French
Apache Tika extracts the text from PDF or PPTX and converts to text or HTML. ( I chose text).
Run the Translate Python with a sentence extracted from the PDF or PPTX.
Let's send that translated french line to it's own slack channel.
Let's send the english to another.
And there it is:
runtranslate.sh
python3.6 -W ignore /Volumes/TSPANN/2018/talks/IOT/translate.py "$1" 2>/dev/null
translate.py
from textblob import TextBlob import sys text = "" for x in sys.argv[1:]: text += str(x) #text = sys.stdin.read() #print(text) blob = TextBlob(text) #for sentence in blob.sentences: # print(sentence.sentiment.polarity) # 0.060 # -0.341 print(blob.translate(to="fr") )
NiFi Flow
Created on 09-28-2018 09:30 PM
- Mark as Read
- Mark as New
- Bookmark
- Permalink
- Report Inappropriate Content
This is an extension of this article: