I have hdp cluster 2.6.5 with one master and two data nodes. And also I install stand alone a nifi server. I try to streaming csv file according to this scenario https://community.hortonworks.com/articles/52856/stream-data-into-hive-like-a-king-using-nifi.html
Nifi 1.9.2 I already installed. What should I do to stream csv file. I tried a lot of things during previous 10 days but I cannot find a solution. Everytime there is a different error when I solved previous one. I hope maybe there is a solution about this link. Please support me for this topic. Thank you very much.
One thing i observed in the article is author created hive external table but PutHiveStreaming expects internal table.
Create hive internal table
CREATE TABLE OLYMPICS(CITY STRING,EDITION INT,SPORT STRING,SUB_SPORT STRING,ATHLETE STRING,COUNTRY STRING,GENDER STRING,EVENT STRING,EVENT_GENDER STRING,MEDAL STRING) CLUSTERED BY (EDITION)INTO 3 BUCKETS ROW FORMAT DELIMITED STORED AS ORC LOCATION '/user/hive/olympics' TBLPROPERTIES('transactional'='true');
-> PutHiveStreaming processor expects input data in Avro format.
Use ConvertRecord processor to convert data from other formats(csv..etc) to AVRO format.
Then feed the avro format data to PutHiveStreaming processor.