Member since
07-01-2015
460
Posts
78
Kudos Received
43
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1386 | 11-26-2019 11:47 PM | |
1325 | 11-25-2019 11:44 AM | |
9590 | 08-07-2019 12:48 AM | |
2214 | 04-17-2019 03:09 AM | |
3555 | 02-18-2019 12:23 AM |
12-11-2018
09:11 AM
It looks like a connection issue to your mysql db, is it? Also, are you sure that the hostname resolution works in your case correctly? You try mysql from command line without specifying the host (so it connects locally) but in your config there is a FQDN defined.
... View more
12-05-2018
04:14 AM
1 Kudo
Actually I got NULL value of columns in created new table which impala could return right results. So if your problem solved may give some suggestion for this. Thanks.
... View more
11-21-2018
10:52 PM
@Tomas79, Thank you for the inputs
... View more
11-21-2018
08:31 AM
Thanks a lot for your help and your time @Tomas79, I will try this solution.
... View more
11-21-2018
01:54 AM
Thank you! I have change it. Also I have changed my paths. Because the path is for directory and not for a file. I have also added a / to my path. Now I get results which I have expected. I changed "setMaster to "local" because it is just a small Cloudera VM without cluster. This is a simple Spark script which can be executed in hue per Spark editor: from pyspark import SparkContext, SparkConf appNameTEST ="my first working application" conf = SparkConf().setAppName(appNameTEST).setMaster("local") sc = SparkContext(conf=conf) text_file = sc.textFile("hdfs:///user/hive/warehouse/TEST/FilePath") counts = text_file.flatMap(lambda line: line.split(" ")) \ .map(lambda word: (word, 1)) \ .reduceByKey(lambda a, b: a + b) counts.saveAsTextFile("hdfs:///user/hive/warehouse/TEST/RESULT")
... View more
11-20-2018
08:26 AM
hdfs fsck will give you answers for your questions, there are multiple command line parameters, you can show the block's location as well.
... View more
11-16-2018
09:04 AM
@bgooley @Tomas79 I ended up loging in a user when the EMR is launching in the bootstrap action.I did this via curl commands.This will avoid any user being given super user status. For anyone needing guidance on the workaround you could follow the below steps. 1) Curl command to get the cookie.txt(it has the session id and csrf token) file. 2) Curl command to login(You have to grep from the cookie.txt file the session id and csrftoken) If anyone has a better idea please let me know
... View more
11-09-2018
02:26 AM
It worth to check if the use case is actually suited for using HDFS's NFS Gateway role[1] which is designed for such remote cluster access.
[1] - Adding and Configuring an NFS Gateway - https://www.cloudera.com/documentation/enterprise/5-12-x/topics/admin_hdfs_nfsgateway.html
... View more
11-03-2018
04:54 AM
Hi Tomas, Thanks for providing the solution. Can you please accept your findings as the solution so that it would be obvious to other community users? Cheers
... View more