Member since
02-07-2018
5
Posts
1
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
8350 | 02-12-2018 08:24 AM |
02-12-2018
08:24 AM
1 Kudo
I should have used the following code. This specifies the correct directory that Hadoop understands. data = LOAD '/user/cloudera/midsummer.txt' as (text:CHARARRAY); upper_case = FOREACH data GENERATE UPPER(text); STORE upper_case INTO '/user/cloudera/midsummerOutput2';
... View more
02-12-2018
07:51 AM
Looks like these cloudera community help requests go into a black hole!!. I was able to find a solution myself. The following script works and was good enough for this trail. It appears that specifiying the directory paths in the pig script was the problem. I put all the scripts in my local directory and also told pig/hadoop to create to create the output directory under the current working directory. data = LOAD 'midsummer.txt' as (text:CHARARRAY); upper_case = FOREACH data GENERATE UPPER(text); STORE upper_case INTO 'midsummerOutput';
... View more
02-07-2018
04:11 PM
Update:- After some testing with a small data set. The following works (without the STORE clause):- data = LOAD '/home/cloudera/smallfile.txt' as (text:CHARARRAY); upper_case = FOREACH data GENERATE UPPER(text); But once we add the STORE or the DUMP clause at the end of the script is when those errors are thrown. It is probably pointing to some permission issues between HUE and HDFS in the "reduce" process I guess. I was hoping the cloudera quick start VM will work out of the box. Calling on the cloudera experts to help!! Please point me to any knowledge base or workarounds to avoid this problem.
... View more
02-07-2018
09:09 AM
We would like to consider cloudera for development and ultimate commercial use if I can get over these initial hurdles. Just to add - I just want to run a simple pig script using the HUE editor on a newly installed cloudera quickstart VM. - Should I login into HUE as cloudera? - Any other additonal configuration or software needs to be upgraded?
... View more
02-07-2018
08:03 AM
Hi,
I am new to hdfs/pig and need quick help. I just installed the cloudera quickstart VM using VMWare.
On running this script(the upper text sample) -
data = LOAD '/home/cloudera/midsummer.txt' as (text:CHARARRAY); upper_case = FOREACH data GENERATE org.apache.pig.piggybank.evaluation.string.UPPER(text); STORE upper_case INTO '/home/cloudera/midsummer2.txt';
It takes 3 to 4 minutes and runs the map part and does not do the reduce. I see the following error on the job workflow page:-
Cannot access: /user/hue/oozie/workspaces/hue-oozie-1452553957.19/${wf:appPath()}/pig-b118.pig/. Note: you are a Hue admin but not a HDFS superuser, "hdfs" or part of HDFS supergroup, "supergroup".
InvalidPathException: Invalid path name Invalid file name: /user/hue/oozie/workspaces/hue-oozie-1452553957.19/${wf:appPath()}/pig-b118.pig (error 400)
I logged on to Hue using the cloudera user and on seeing this error also gave hadoop group to cloudera as well. From Oozie I see the error - JA0189 - Main class [org.apache.oozie.action.hadoop.PigMain], exit code [2]
Your help will be appreciated.
Thanks
Regards
Santhosh
... View more
Labels:
- Labels:
-
Cloudera Hue
-
HDFS
-
Quickstart VM