Hi,
I have a parquet based table and can successfully select it within Hive and Impala,
but if I want to select from that table in shark, I receive the error:
14/04/17 11:33:49 INFO parse.ParseDriver: Parse Completed
14/04/17 11:33:49 INFO parse.SharkSemanticAnalyzer: Get metadata for source tables
FAILED: Hive Internal Error: java.lang.RuntimeException(java.lang.ClassNotFoundException: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat)
14/04/17 11:33:50 ERROR shark.SharkDriver: FAILED: Hive Internal Error: java.lang.RuntimeException(java.lang.ClassNotFoundException: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat)
java.lang.RuntimeException: java.lang.ClassNotFoundException: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat
at org.apache.hadoop.hive.ql.metadata.Table.getInputFormatClass(Table.java:306)
at org.apache.hadoop.hive.ql.metadata.Table.<init>(Table.java:99)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:988)
at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:891)
at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:1083)
Where is this class included? what to do/link/install/configure to get rid of the error?
I am using CDH5, parquet libs are in /opt/cloudera/parcels/CDH/lib/parquet
thanks in advance, Gerd