- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Hive select when column name different in parquet and impala
- Labels:
-
Apache Hive
Created on ‎10-25-2017 12:04 PM - edited ‎09-16-2022 05:26 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
# Create parquet Impala table temp with a column a
# write parquet file using streaming applicaiton/ map reduce job call parquet schema for that
#Impala
select a from default.temp
works and returns data
#hive
select a from default.temp
returns null because it tries to reference column name from parquet schema I think and it doesn't match.
Is there a way to force hive to read column name from metastore instead of parquet schema ?
Created ‎10-26-2017 03:55 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
SET parquet.column.index.access=true;
Created ‎10-26-2017 03:55 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
SET parquet.column.index.access=true;
