Member since
11-13-2013
2
Posts
0
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
32344 | 11-16-2013 11:16 AM |
11-16-2013
11:16 AM
I solved my problem. My problem was two fold... I had a lingering old install of hive which was what my shell was defaulting to. I removed the second bad install of hive. The second problem is that the latest version of cdh seems to be broken for parquet. There's a bunch of missing jars. I followed the following guide and I am now able to use parquet files. http://analog99.wordpress.com/2013/01/07/setting-up-stats-db-in-hive/
... View more
11-13-2013
10:56 AM
I’m facing the same problem. I installed Impala 1.1.1 and CDH4.1.2. I created a table in the impala-shell using the Parquet file format but I am not able to insert data into into. impala> create table data_parquet like data stored as parquetfile;
impala> insert into data_parquet select * from data;
Query: insert into data_parquet select * from data
ERROR: AnalysisException: Target table 'default.data_parquet' is incompatible with SELECT / PARTITION expressions.
Expression 'data.payload_user_uid' (type: BIGINT) is not compatible with column 'payload_user_type' (type: STRING) I tried the suggestion on this topic. hive> alter table data_parquet set serde 'parquet.hive.serde.ParquetHiveSerDe';
FAILED: RuntimeException java.lang.ClassNotFoundException: parquet.hive.DeprecatedParquetInputFormat I have the parquet jar in the hive lib directory. $ find /opt/cloudera/parcels/CDH/lib/hive/lib -name "parquet*.jar"
/opt/cloudera/parcels/CDH/lib/hive/lib/parquet-hive-1.0.jar
... View more