Support Questions

Find answers, ask questions, and share your expertise

Hive select when column name different in parquet and impala

avatar
Contributor

# Create parquet Impala table temp with a column a

# write parquet file using streaming applicaiton/ map reduce job call parquet schema for that 

 

#Impala

select a from default.temp

works and returns data

 

#hive

select a from default.temp

returns null because it tries to reference column name from parquet schema I think and it doesn't match.

 

Is there a way to force hive to read column name from metastore instead of parquet schema ?

 

 

 

 

1 ACCEPTED SOLUTION

avatar
Super Guru
You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;

View solution in original post

1 REPLY 1

avatar
Super Guru
You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;