Member since
03-23-2015
1288
Posts
114
Kudos Received
98
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3350 | 06-11-2020 02:45 PM | |
5064 | 05-01-2020 12:23 AM | |
2860 | 04-21-2020 03:38 PM | |
3563 | 04-14-2020 12:26 AM | |
2360 | 02-27-2020 05:51 PM |
10-26-2017
03:59 AM
What's the issue in impala? Impala don't see the new columns? Did you update table to include the new columns through Beeline? Have you validated metadata in Impala?
... View more
10-26-2017
03:57 AM
You will need to check both HS2 log and Spark application log to get the real error message. "Failed to create spark client" is too generic and it can be anything.
... View more
10-26-2017
03:55 AM
You might want to try map parquet tables by index, rather than column name: SET parquet.column.index.access=true;
... View more
10-25-2017
06:41 PM
Hmm, For JDBC and ODBC drivers, there is a setting called RowsFetchedPerBlock, the default value is 10,000. For 9.2M rows, this back and forth might cause the delay. Can you try to increase this value to see if it can help? See Doc below: http://www.cloudera.com/documentation/other/connectors/impala-jdbc/latest/Cloudera-JDBC-Driver-for-Impala-Install-Guide.pdf and search for RowsFetchedPerBlock.
... View more
10-25-2017
03:55 AM
How big is the file? And if you scp the result file within the cluster, how long does it take? This is to confirm if there are speed differences between within the cluster and across to the outside of cluster. If there is, how much different.
... View more
10-25-2017
03:49 AM
If you check the hue.ini file, which is the configuration for Hue, you can find below impala section: [impala] server_host={impala-daemon-host-url} server_port=21050 impala_principal=impala/{impala-daemon-host-url} impersonation_enabled=True server_conn_timeout=120 You can see that to update Hue to connect to a different impala, you can update value for server_host.
... View more
10-25-2017
03:45 AM
Do your users also use Hue to run Impala queries? Hue will hold the query handler open so that it can do paginations, and it will only kill the handler after user navigates away from the impala page. If user stays on the page, the handler will be kept open and the query is considered as in flight. Please confirm Hue usage first. Thanks
... View more
10-19-2017
08:14 PM
Glad that we identified the issue!
... View more
10-18-2017
07:44 PM
1 Kudo
If you are using CDH, you should already have parquet-tools command line available. If not, you can get it from here: https://github.com/apache/parquet-mr/tree/master/parquet-tools You just need to run: parquet-tools cat /path/to/parquet-file.parq Please note that the path is on local file system, not in HDFS. And then you can grep the output to search for string "error_message".
... View more