Support Questions

cjervis · ‎02-07-2020

Hello,

I am trying to custom parquet-avro schema creation for a table which is taken from Kafka Topic using Java Avro API. Output parquet file is worn with Hive table table. Decimal fields are created as fixed_len_byte_array on the schema. The Hive table can be queried using Hive and Spark. But, when impala query is used, it gives like below error.

I try to define byte array size according to cloudera impala documentation

1 <= precision <=9, then 4 bytes

10<= precision <=18, then 8 bytes

precision>18, then 16 bytes.

In the impala query screen on Cloudera Hue. It shows Hive table description.

I am using array size detection for fixed_len_byte_array like below (accrording to https://issues.apache.org/jira/browse/IMPALA-2515)

But it gives above error. What is the formula for catching expected array size for Impala?

lwang · ‎02-10-2020

Hi @Onur ,

I did a google search and found this existing Impala bug:

https://issues.apache.org/jira/browse/IMPALA-7087

Do you think maybe you are hitting this one?

Thanks!

Li Wang, Technical Solution Manager

Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

Learn more about the Cloudera Community:

Terms of Service

Community Guidelines

How to use the forum

Onur · ‎02-13-2020

Hi Iwang,

Thank you for your response. I tried with

TBLPROPERTIES ('OBJCAPABILITIES'='EXTREAD,EXTWRITE

hive table properties for creating table.

But, as Yongzhi stated that it is not solution for the fixed-length type.

My hive table creation script is;

CREATE TABLE ods.kds_limit_log(ONERILENLIMITADI STRING,PROPOSEDCOLLATERALTYPE STRING,ONERILENMAXVADE STRING,ONERILENDOVIZ STRING,KDSREFORDER DECIMAL(3,0),LASTUPDATED DECIMAL(15,0),ONERILENMORTGAGELIMITTUTARI DECIMAL(25,2),OID DECIMAL(16,0),ONERILENKGFLIMITTUTARI DECIMAL(25,2),MEVCUTMAXVADE STRING,STATUS DECIMAL(1,0),MEVCUTLIMITTUTARI DECIMAL(25,2),ENTERANCEDATE DECIMAL(8,0),ONERILENLIMITTUTARI DECIMAL(25,2),MEVCUTDOVIZ STRING,MEVCUTRISKTUTARI DECIMAL(25,2),ONERILENLIMITREFERANS STRING,ENTERANCETIME DECIMAL(6,0),MEVCUTLIMITADI STRING,EXISTINGCOLLATERALTYPE STRING,ONERILENEASILIMITTUTARI DECIMAL(25,2),KDSREFNO STRING,MEVCUTLIMITREFERANS STRING) STORED AS PARQUET LOCATION '/data/ods.db/kds_limit_log/' TBLPROPERTIES ('OBJCAPABILITIES'='EXTREAD,EXTWRITE')

Parquet file schema;

It can be run with Hive, but not with Impala

Best regards

Onur

Cloudera Community

Support Questions

invalid type length for Decimal fields when Impala query on Parquet stored Hive Table

Import RDBMS into Hive table stored as ORC with SQ...

How to optimize IMPALA/KUDU queries

Incompatible parquet Schema on Impala but queries ...

Build and use Parquet-tools to read parquet files

Hive Query against ORC table failing with serious ...

Impala Query on Hive transactional table returns 0...

Impala was not able to query hive table with colum...

Impala: Parquet error "Invalid file footer" on pip...

HBase – Query Performance when storing and retriev...

Not able to connect to IMPALA/ HIVE table