Member since
08-16-2015
97
Posts
16
Kudos Received
12
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
893 | 07-11-2021 08:05 PM | |
1680 | 07-11-2021 06:37 PM | |
39656 | 06-04-2021 12:01 AM | |
1059 | 06-03-2021 11:43 PM | |
3479 | 04-26-2021 06:58 PM |
04-21-2021
09:59 PM
Hi, This is Cloudera Express 5.10.0
... View more
04-21-2021
06:43 PM
Hello An example: https://stackoverflow.com/questions/38086684/hadoop-fs-du-h-sorting-by-size-for-m-g-t-p-e-z-y
... View more
04-21-2021
05:44 PM
Hello Check your client library files are in the CLASSPATH Caused by: java.lang.NoClassDefFoundError: Could not initialize class org.janusgraph.diskstorage.es.rest.RestElasticSearchClient
... View more
04-21-2021
05:38 PM
Hello I am not sure the data in your Hive table got duplicates? Maybe try SELECT DISTINCT?
... View more
04-21-2021
05:27 PM
1 Kudo
Hello A good starting point: https://www.cloudera.com/tutorials.html
... View more
04-21-2021
12:17 AM
Hello Impala doesn't support parameterized view Some walk arounds been discussed here: https://stackoverflow.com/questions/52063217/create-parameterized-view-in-impala
... View more
04-21-2021
12:12 AM
Hello One example: https://stackoverflow.com/questions/44235019/delete-files-older-than-10days-on-hdfs
... View more
04-21-2021
12:07 AM
Hello Here is a good post FYR https://medium.com/@goyalsaurabh66/project-tungsten-and-catalyst-sql-optimizer-9d3c83806b63
... View more
04-20-2021
11:26 AM
Thank you, I appreciate the comment. This issue occurs after a hive sql query that joins around 15 tables(some of them big) so I think broadcast join do not applies, salting would imply breaking down the query and running the joins on spark functions instead of hive sql, because of the number of tables it can be time consuming, so my question is, is there any other way to force spark do distribute the partitions evenly to executors?
... View more
04-19-2021
08:59 AM
After I have checked the permission in Ranger, I noticed that there isn't any policy setup for hive user. I did add a policy to allow access to the folder /user/admin in HDFS to hive user. The problem is I cannot save the new policy due to another security error as show below. I wonder if the Hortonworks Sandbox HDP 3.0 that I downloaded is working version 😞
... View more