Reply
Highlighted
New Contributor
Posts: 1
Registered: ‎07-19-2015

Error loading TwitterData from Flume into Hive.

[ Edited ]

Hi All,

 

I am trying to analyse Twitter Data using Cloudera. Currently, I am able to stream Twitter Data into HDFS via Flume but I am experiencing issues when trying to load said data into Hive metastore with the following exception:

 

java.io.IOException: org.apache.avro.AvroRuntimeException: java.io.IOException: Block size invalid or too large for this implementation: -40

 

Does this mean that the data was loaded into Hive but cannot be queried or was it not loaded into Hive at all? Any assistance on this issue is appreciated. Thanks.

Announcements
New solutions