Member since
03-23-2015
1288
Posts
114
Kudos Received
98
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 5131 | 06-11-2020 02:45 PM | |
| 4595 | 04-21-2020 03:38 PM | |
| 3591 | 02-27-2020 05:51 PM | |
| 3570 | 01-23-2020 03:41 AM | |
| 23293 | 01-14-2020 07:14 PM |
08-09-2019
01:32 AM
Hi Vishal, Did you mean you just finished your certification exam or you got your certification? If you have your certification, then result should be sent to you already. If you just finished the exam, then expect the result in a couple of days max. Cheers Eric
... View more
07-28-2019
08:52 PM
Hi Tim, The doc says: This optimization does not apply if the queries contain any WHERE, GROUP BY, or HAVING clause. Do you know if it still applies if WHERE condition only has partition column as well? Cheers Eric
... View more
07-25-2019
04:21 PM
1 Kudo
Please use commands mentioned here: https://oozie.apache.org/docs/4.3.0/DG_CommandLineTool.html#Getting_list_of_available_sharelib to check if your sharedlib was installed correctly. Cheers Eric
... View more
07-24-2019
04:48 PM
1 Kudo
Thanks Lars for pointing it out. So solution is to disable HDFS shortcircuit read for coordinator only impala daemons: a) create a new role group and add all coordinator only impala daemon hosts to this group b) Go to "CM -> Cluster -> Impala services -> Configuration"; c) Add the following property into "Impala Daemon HDFS Advanced Configuration Snippet (Safety Valve)" for the new role group that you just created: <property>
<name>dfs.client.read.shortcircuit</name>
<value>false</value>
</property> d) Save the changes and restart the Impala Daemon instance that are affected. Hope above can help. Cheers Eric
... View more
07-24-2019
04:29 AM
Hi, COORDINATOR_ONLY impala daemon will not perform data reads, rather it only coordinates query execution by distributing jobs to executors. Why do you need to enable this for COORDINATOR_ONLY impala daemons? Or is it that all your COORDINATOR_ONLY impala daemons fail with this error? Also, have you checked whether socket /var/run/hdfs-sockets/dn exist on that impala daemon host? Cheers Eric
... View more
07-15-2019
03:35 AM
Hi, The purpose of compression is to save space, not speed up query time. Compression actually adds overhead to decompress the data before data can be read, so I would expect the query against compressed data will be slightly slower than uncompressed. So what you see is totally normal to me. Cheers Eric
... View more
07-15-2019
03:12 AM
1 Kudo
Hi, I do not think there is any different. Spark lazily executes statements, so you second 2 jobs version will behave the same way as the first single job, in my opinion. Cheers Eric
... View more
07-11-2019
05:36 PM
Hi Michael, Thanks for the info. I am curious on whether 2.6.1 version will work or not? 2.6.4 is marked as C6 compatible, I am not sure if that is meant for C6 or not. Cheers Eric
... View more
07-10-2019
04:48 PM
Hi, It looks like that you have set configuration hive.query.redaction.rules somewhere, though I can't see it from the command you ran. Couple of things to check: 1. run "which beeline" to confirm that you are running original beeline command, not a custom one 2. check under ~/.beeline directory to see if there is any user level files that might override the default settings 3. use different users to run beeline to see if there is any differences. Thanks Eric
... View more