Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Error loading TwitterData from Flume into Hive.

Highlighted

Error loading TwitterData from Flume into Hive.

New Contributor

Hi All,

 

I am trying to analyse Twitter Data using Cloudera. Currently, I am able to stream Twitter Data into HDFS via Flume but I am experiencing issues when trying to load said data into Hive metastore with the following exception:

 

java.io.IOException: org.apache.avro.AvroRuntimeException: java.io.IOException: Block size invalid or too large for this implementation: -40

 

Does this mean that the data was loaded into Hive but cannot be queried or was it not loaded into Hive at all? Any assistance on this issue is appreciated. Thanks.