Member since
09-05-2019
4
Posts
0
Kudos Received
0
Solutions
09-08-2019
10:53 PM
Hi @EricL, Our IT Architecture team has closed off all access to our data storage other than the Hue web interface. So we can submit Hive and Impala queries. But my job often requires pulling data to local storage, usually more than 100000 rows. When looking at the results of a Hive query, on the left side of the table is the option to export the data. You can save the table on the cluster (not helpful in this case) or download the first 100000 rows as a CSV file or an Excel file. My question refers to the possibility of downloading the table in another format, different from CSV or Excel. But I suppose the more pressing question is how can we download more than 100000 rows? I know this is a horrible way to handle data transfer, but it is the only option left open to us by our architecture team. Cheers
... View more
09-06-2019
07:52 AM
Is there a way to download the table resulting from a Hive or Impala query from Hue to a local directory in parquet format? Currently Hue only allows CSV and Excel table format.
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Impala
-
Cloudera Hue
09-05-2019
07:19 AM
Found this fix: https://github.com/cloudera/thrift_sasl/commit/d105152438ff78c2b1c041c79d1bcce25a760bb2 Unfortunately it throws other errors down the line. There seem to be a series of bugs in thrift_sasl.
... View more
09-05-2019
07:03 AM
Having what appears to be exactly the same problem, on Windows 10 Enterprise. impyla==0.15.0 thrift==0.11.0 thrift-sasl==0.3.0 But in thrift/transport/TSocket.py there absolutely is an isOpen() method. Not an attribute, but the way it is used in the code as pointed out by the error looks correct.
... View more