Support Questions

Find answers, ask questions, and share your expertise

PutHiveStreaming has issue

New Contributor

Dear All,

I have hdp cluster 2.6.5 with one master and two data nodes. And also I install stand alone a nifi server. I try to streaming csv file according to this scenario https://community.hortonworks.com/articles/52856/stream-data-into-hive-like-a-king-using-nifi.html

Nifi 1.9.2 I already installed. What should I do to stream csv file. I tried a lot of things during previous 10 days but I cannot find a solution. Everytime there is a different error when I solved previous one. I hope maybe there is a solution about this link. Please support me for this topic. Thank you very much.

2 REPLIES 2

Super Guru

@Bora Özkan

One thing i observed in the article is author created hive external table but PutHiveStreaming expects internal table.

Create hive internal table

CREATE TABLE
OLYMPICS(CITY STRING,EDITION INT,SPORT STRING,SUB_SPORT STRING,ATHLETE STRING,COUNTRY STRING,GENDER STRING,EVENT STRING,EVENT_GENDER STRING,MEDAL STRING)
CLUSTERED BY (EDITION)INTO 3 BUCKETS
ROW FORMAT DELIMITED
STORED AS ORC 
LOCATION  '/user/hive/olympics'
TBLPROPERTIES('transactional'='true');

-> PutHiveStreaming processor expects input data in Avro format.

Use ConvertRecord processor to convert data from other formats(csv..etc) to AVRO format.

Then feed the avro format data to PutHiveStreaming processor.

New Contributor

hi @Shu thanks for answering, I already create an internal table and I got an error like that:

109476-whatsapp-image-2019-06-23-at-181843.jpeg