Created 10-02-2016 02:09 PM
I want to move data from hdfs to hive using puthivestreaming of nifi. does anyone have example?
Created on 10-03-2016 06:51 AM - edited 08-19-2019 04:39 AM
Hi
Hive streaming is supported against tables having the following :
Also, Hive must have the following properties set
Please follow the documentation here https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest
PutHiveStreaming requires your input data to be in Avro format, as is provided in the documentation here:
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveStreaming/
Considering your data is JSON, we can construct a Nifi workflow as follows:
ListHDFS--> FetchHDFS--> ConvertJsonToAvro-->PutHiveStreaming
Let me know if this helps.
Created on 10-03-2016 06:51 AM - edited 08-19-2019 04:39 AM
Hi
Hive streaming is supported against tables having the following :
Also, Hive must have the following properties set
Please follow the documentation here https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest
PutHiveStreaming requires your input data to be in Avro format, as is provided in the documentation here:
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveStreaming/
Considering your data is JSON, we can construct a Nifi workflow as follows:
ListHDFS--> FetchHDFS--> ConvertJsonToAvro-->PutHiveStreaming
Let me know if this helps.
Created 10-04-2016 01:08 PM
Great answer! Just to add a caveat, if you are using HDF 2.0 and HDP 2.5, please see the following: https://community.hortonworks.com/questions/59681/puthivestreaming-nifi-processor-various-errors.htm...