Reply
Explorer
Posts: 23
Registered: ‎09-25-2016
Accepted Solution

Hive select when column name different in parquet and impala

# Create parquet Impala table temp with a column a

# write parquet file using streaming applicaiton/ map reduce job call parquet schema for that 

 

#Impala

select a from default.temp

works and returns data

 

#hive

select a from default.temp

returns null because it tries to reference column name from parquet schema I think and it doesn't match.

 

Is there a way to force hive to read column name from metastore instead of parquet schema ?

 

 

 

 

Highlighted
Cloudera Employee
Posts: 211
Registered: ‎03-23-2015

Re: Hive select when column name different in parquet and impala

You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;
Announcements