Welcome to the Cloudera Community

Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Who agreed with this topic

Backend 1:Decompressor: block size is too big. hbase impala

avatar
Contributor

I have a hcatalog registered table to an hbase table and I am tryin to use Impala to query it

I get the following error when I try to query it 

Backend 1:Decompressor: block size is too big. Data is likely corrupt. Size: 2564977884

 

Query is very simple but is trying to do a sort on the suffix portion of the key

 

select *
from stage.acct_txn_hbasetest
order by substr(key, locate('|',key,39)+1,14) desc
limit 25
;

 

 

hive resolves it in 300 sec

I figured Impala would be faster

 

Guess I have to try out SparkSQL

 

 

Who agreed with this topic