Our IT Architecture team has closed off all access to our data storage other than the Hue web interface. So we can submit Hive and Impala queries. But my job often requires pulling data to local storage, usually more than 100000 rows.
When looking at the results of a Hive query, on the left side of the table is the option to export the data. You can save the table on the cluster (not helpful in this case) or download the first 100000 rows as a CSV file or an Excel file. My question refers to the possibility of downloading the table in another format, different from CSV or Excel.
But I suppose the more pressing question is how can we download more than 100000 rows? I know this is a horrible way to handle data transfer, but it is the only option left open to us by our architecture team.