Community Articles

TimothySpann · ‎02-03-2017

Sentiment CoreNLP Processor

[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP -
Adding annotator tokenize[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.TokenizerAnnotator
- No tokenizer type provided. Defaulting to PTBTokenizer.[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP -
Adding annotator ssplit[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP -
Adding annotator parse[pool-1-thread-1] INFO edu.stanford.nlp.parser.common.ParserGrammar
- Loading parser from serialized file edu/stanford/nlp/models/lexparser/englishPCFG.ser.gz
... done [0.4 sec].[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP -
Adding annotator sentimentFILE:Header,Header2,Header3Value,Value2,Value3Value4,Value5,Value6Attribute: {"names":"NEGATIVE"}

Service Source Code

JUnit Test for Processor

To Add Sentiment Analysis to Your NiFi Data Flow, just add the custom processor, CoreNLPProcessor. You can downloada pre-built NAR from the github listed below. Add to your NiFi/lib directory and restart each node.

The results of the run will be an attribute named sentiment:

You can see how easy it is to add to your dataflows.

If you would like to add more features to this processor, please fork the github below.

This is not an official NiFi processor, just one I wrote in a couple of hours for my own use and for testing.

There are four easy ways to add Sentiment Analysis to your Big Data pipelines: executescript of Python NLP scripts, call my custom processor, make a REST call to a Stanford CoreNLP sentiment server, make a REST call to a public sentiment as a service and send a message via Kafka (or JMS) to Spark or Storm to run other JVM sentiment analysis tools.

Download a release

Cloudera Community

Community Articles

Adding Stanford CoreNLP To Big Data Pipelines (Apache NiFi 1.1/HDF 2.1) Part 2 of 2

Apache NiFi

Apache NiFi - Part 2 (Twitter Flow)

NiFi/HDF Dataflow Optimization (Part 1 of 2)

Geo Enrich NiFi Provenance Event Data using Lookup...

Using Apache NiFi to Validate that Records Adhere ...

Integrating Apache Spark 2.x Jobs with Apache NiFi...

Apache Ambari Workflow Manager View for Apache Ooz...

Big Data Processing Engines, The Technical Series ...

Druid - Part 2

Apache NiFi Journey from HDF 3.0 to 3.1 - Part 2

Test Driven Development for Big Data (Unofficial G...