04-01-2016 01:33 AM
I am not able to read Parquet files which were generated using Java API of Spark SQL in HUE. I am using CDH 5.5.1. All I am getting is "Failed to read Parquet file." It is the same when it is uncompressed or zipped.
If I am using MapReduce Parquet Java libraries and not Spark SQL, I am able to read it. Will the Parquet format from Spark SQL be also supported in HUE?
04-01-2016 01:38 AM
That error is probably created in function below in this file: ./parcels/CDH/lib/hue/apps/filebrowser/src/filebrowser/views.py
def _read_parquet(fhandle, path, offset, length, stats):
dumped_data = StringIO()
parquet._dump(fhandle, ParquetOptions(), out=dumped_data)
logging.exception("Could not read parquet file at %s" % path)
raise PopupException(_("Failed to read Parquet file."))