1973
Posts
1225
Kudos Received
124
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1915 | 04-03-2024 06:39 AM | |
| 3011 | 01-12-2024 08:19 AM | |
| 1644 | 12-07-2023 01:49 PM | |
| 2421 | 08-02-2023 07:30 AM | |
| 3364 | 03-29-2023 01:22 PM |
11-08-2017
02:10 PM
1 Kudo
Someone has an InfluxDB Writer https://github.com/fsauer65/NiFi-Extensions/tree/master/nifi-influxdb-bundle/nifi-influxdb-processors/src/main/java/org/apache/nifi/influxdb
... View more
10-27-2017
06:15 PM
3 Kudos
If you have not attended a DataWorksSummit, I highly recommend it. It is an amazing event held at three locations a year and is a great community experience. The content is deep and highly technical and you will learn about the current state of the art and what is coming next. It's not just Big Data, but AI, Streaming, Microservices, Containers, Cloud and many other topics that startups and enterprises alike need to know. My topic was a simple talk on using Apache NiFi to ingest and transform various data types. There is a small group forming around my quickly released Inception V3 TensorFlow Apache NiFi Processor, I encourage you to try it and provide feedback, pull requests, bug reports, documentation, unit tests, examples and more. The Java API for TensorFlow is new so this is really basic. Thanks to @Simon Elliston Ball for a major cleanup on it. https://github.com/tspannhw/nifi-tensorflow-processor What do we want to do? MiniFi ingests camera images and sensor data Run TensorFlow Inception v3 to recognize
objects in image NiFi stores images, metadata and enriched data in Hadoop NiFi ingests social data and feeds NiFi analyzes sentiment of
textual data •TensorFlow (C++, Python, Java)
via ExecuteStreamCommand
•
•TensorFlow NiFi Java Custom Processor
•
•TensorFlow Running on Edge Nodes (MiniFi)
•
•
• •TensorFlow Mobile (iOS, Android, RPi)
•
•TensorFlow on Spark (Yahoo) via Livy, S2S,
Kafka
•
•TensorFlow Running in Containers in YARN 3.0
on Hadoop
• (NiFI 1.4) gRPC Call to TensorFlow Serving python classify_image.py
--image_file/dir/solarroofpanel.jpg<br>solar dish, solar collector, solar furnace (score
= 0.98316)<br>window screen
(score = 0.00196)<br>manhole cover
(score = 0.00070)<br>radiator (score
= 0.00041)<br>doormat,
welcome mat (score = 0.00041) Python Uses pip install -U textblob python -m textblob.download_corpora pip install -U spacy python -m spacy.en.download all
pip install -U nltk pip install -U numpy run.sh python sentiment.py "$@” sentiment.py
sentiment.pyfrom nltk.sentiment.vader
import SentimentIntensityAnalyzer
import sys
sid = SentimentIntensityAnalyzer()
ss = sid.polarity_scores(sys.argv[1])
print('Compound {0} Negative {1} Neutral {2} Positive {3} '.format( ss['compound'],ss['neg'],ss['neu'],ss['pos']))
These are some good Python libraries to be using. I recommend using Python 3.X unless you are stuck with 2.6/2.7. I have also created two processors for working with text/NLP, these are listed below for Apache OpenNLP and Stanford CoreNLP. Please comment in HCC (here), check out github and do pull requests (https://github.com/tspannhw) and come to a meetup (https://www.meetup.com/futureofdata-princeton/). References:
https://github.com/tspannhw/dws2017sydney https://dataworkssummit.com/sydney-2017/sessions/real-time-ingesting-and-transforming-sensor-data-and-social-data-with-nifi-and-tensorflow/ https://www.slideshare.net/Hadoop_Summit/realtime-ingesting-and-transforming-sensor-data-and-social-data-with-nifi-and-tensorflow https://hortonworks.com/blog/7-sessions-dataworks-summit-sydney-see/ https://community.hortonworks.com/articles/58265/analyzing-images-in-hdf-20-using-tensorflow.html https://community.hortonworks.com/articles/76935/using-sentiment-analysis-and-nlp-tools-with-hdp-25.html http://www.nltk.org/install.html https://github.com/tspannhw/nifi-nlp- processor https://community.hortonworks.com/articles/80418/open-nlp-example-apache-nifi-processor.html https://community.hortonworks.com/articles/81270/adding-stanford-corenlp-to-big-data-pipelines-apac-1.html
... View more
Labels:
10-20-2017
08:49 PM
ExecuteStreamCommand ExecuteStreamCommand[id=e029b4cc-cdf4-3726-0000-000000000000] Failed to write flow file to stdin due to java.io.IOException: Broken pipe: java.io.IOException: Broken pipe
java.io.IOException: Broken pipe
... View more
Labels:
- Labels:
-
Apache MiNiFi
-
Apache NiFi
10-19-2017
04:57 PM
You will have to run your algorithms on your cluster with your data to get a reasonable performance analysis. What language are you looking at? The Python Spark interface is pretty clean. http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science.html H2O has a few more algorithms than Spark MLib. https://spark.apache.org/docs/latest/ml-classification-regression.html
... View more
10-19-2017
03:20 AM
what version of windows? what JDK? use a shorter directory structure on windows like c:\nifi? how much RAM? disk space? firewall settings. You are not running JDK 9 or a weird old or new JVM? https://issues.apache.org/jira/browse/NIFI-4419
... View more
10-16-2017
01:50 PM
https://community.hortonworks.com/questions/64463/selecthiveql-processor-hive-connection-pool-config.html
... View more
10-16-2017
01:27 PM
Check out: https://community.hortonworks.com/articles/93771/connecting-nifi-to-cdh-hive.html This is due to Apache Hive 1.1 and Apache NiFi using Apache Hive 1.2.
... View more
10-16-2017
01:24 PM
See https://community.hortonworks.com/articles/93771/connecting-nifi-to-cdh-hive.html
... View more
10-09-2017
06:15 PM
There has been a major upgrade to cache in Apache NiFi 1.4 and now you can use Redis!
... View more
10-08-2017
10:34 PM
easy to integrate NiFi -> Kafka -> Spark or Storm or Flink or APEX Also NiFi -> S2s -> Spark / Flink / ...
... View more