Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Connecting AWS ECS container with Stanford NLP to NiFi

New Contributor

This question is really a follow up to @Timothy Spann's guide series for the Stanford NLP and its use in NiFi.

Problem:

I have NiFi up in AWS, and I also have the Stanford Core NLP jar file running in an ECS task. I can't get them connected. My current flow is this:

1) GenerateFlowFile - with custom text: "Testing because I have no idea how this works?" (just under 50B)

2) InvokeHTTP - POST, and url = http://xx.xxx.xx.xxx:port (ip and port, throws no errors)

3) ???? - I currently have the original and response connected to a LogAttribute, to see what comes out. For response, when I check the list queue, the flowfile has nothing in it, upon viewer inspection, and when I download the file, it just gives me the Apache Tika license agreement. Original just puts that message as an attribute.

95402-screen-shot-2018-12-11-at-50255-pm.png

How do I call *entity* analysis? I know the NLP is running over in that ECS. I have no idea how to input a correct url call, or what type of processor must come after InvokeHTTP.

If I am asking the wrong question/a dumb question, please let me know. Thanks

1 REPLY 1

Super Guru

You can run my Stanford Corenlp Processor

https://github.com/tspannhw/nifi-corenlp-processor/releases

If you are calling the server you will have to open those ports (check Amazon open ports):

https://stanfordnlp.github.io/CoreNLP/corenlp-server.html

test from the command line first with curl

https://wiki.apache.org/tika/TikaAndNER

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.