Created on 07-20-2016 03:11 PM - edited 08-19-2019 01:30 AM
Is there anything special to get this to work?
Hive Table
create table twitter( id int, handle string, hashtags string, msg string, time string, user_name string, tweet_id string, unixtime string, uuid string ) stored as orc tblproperties ("orc.compress"="ZLIB");
Data is paired down tweet:
{ "user_name" : "Tweet Person", "time" : "Wed Jul 20 15:09:42 +0000 2016", "unixtime" : "1469027382664", "handle" : "SomeTweeter", "tweet_id" : "755781737674932224", "hashtags" : "", "msg" : "RT some stuff" }
Created 07-30-2016 08:12 PM
Not optimal, but this is a nice workaround:
Use ReplaceText processor
insert into twitter values (${tweet_id}, '${handle:urlEncode()}','${hashtag:urlEncode()}', '${msg:urlEncode()}','${time}', '${user_name:urlEncode()}','${tweet_id}', '${unixtime}','${uuid}')
So that's attributes in there.
I do url encode because of quotes and such. Would like a prepared statement or custom processor or call a groovy script. But this works.
Created 07-20-2016 03:25 PM
Is Translate Field Names set to true? That should enable the matching of the column (which appears capitalized) against the field (which is lowercase)
Created 07-20-2016 03:26 PM
Also if you don't care about that column you can set the Unmatched Column Behavior to warn/ignore
Created on 07-20-2016 03:29 PM - edited 08-19-2019 01:29 AM
i set unmatched columns to ignore
i tried true and false on field names
Created 07-20-2016 03:44 PM
I had catalog and schema name and then left them off. I tried a few options. twitter is a table in default hive database
SelectHiveQL is working fine
Created 07-30-2016 08:12 PM
That did not work.
Created 08-01-2016 03:45 AM
I don't have a column called IS_AUTOINCREMENT. that's the something should be standard in JDBC. wonder if HIVE driver missing something
Created 07-30-2016 12:28 AM
@Timothy Spann did you find a solution to this? I'm hitting the same thing with a sample 3 column hive database
Created 07-30-2016 08:12 PM
Not optimal, but this is a nice workaround:
Use ReplaceText processor
insert into twitter values (${tweet_id}, '${handle:urlEncode()}','${hashtag:urlEncode()}', '${msg:urlEncode()}','${time}', '${user_name:urlEncode()}','${tweet_id}', '${unixtime}','${uuid}')
So that's attributes in there.
I do url encode because of quotes and such. Would like a prepared statement or custom processor or call a groovy script. But this works.
Created on 07-31-2016 02:15 PM - edited 08-19-2019 01:29 AM
I ended up with the same workaround to get it flowing, agreed not optimal but its working!