Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Hive select when column name different in parquet and impala

avatar
Contributor

# Create parquet Impala table temp with a column a

# write parquet file using streaming applicaiton/ map reduce job call parquet schema for that 

 

#Impala

select a from default.temp

works and returns data

 

#hive

select a from default.temp

returns null because it tries to reference column name from parquet schema I think and it doesn't match.

 

Is there a way to force hive to read column name from metastore instead of parquet schema ?

 

 

 

 

1 ACCEPTED SOLUTION

avatar
Super Guru
You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;

View solution in original post

1 REPLY 1

avatar
Super Guru
You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;