Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Hive select when column name different in parquet and impala

avatar
Contributor

# Create parquet Impala table temp with a column a

# write parquet file using streaming applicaiton/ map reduce job call parquet schema for that 

 

#Impala

select a from default.temp

works and returns data

 

#hive

select a from default.temp

returns null because it tries to reference column name from parquet schema I think and it doesn't match.

 

Is there a way to force hive to read column name from metastore instead of parquet schema ?

 

 

 

 

1 ACCEPTED SOLUTION

avatar
Super Guru
You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;

View solution in original post

1 REPLY 1

avatar
Super Guru
You might want to try map parquet tables by index, rather than column name:

SET parquet.column.index.access=true;