Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Failed to read data from "/user/guest/Batting.csv"

avatar
New Member

running "pig 1.pig" from the tutorial http://hortonworks.com/hadoop-tutorial/faster-pig-... yields the error

Input(s): Failed to read data from "/user/guest/Batting.csv"

Output(s): Counters: Total records written : 0 Total bytes written : 0 Spillable Memory Manager spill count : 0 Total bags proactively spilled: 0 Total records proactively spilled: 0

Job DAG: null->null, null

1 ACCEPTED SOLUTION

avatar

@Matthew bird You need a home directory for the user in HDFS so here is what is needed -

#Login as root to the sandbox
su - hdfs 
hdfs dfs -mkdir /user/root
hdfs dfs -chown root:hadoop /user/root
hdfs dfs -chmod 755 /user/root

Try to run the pig script after you've done the above steps.

View solution in original post

14 REPLIES 14

avatar
New Member

definitely progress. Now it is looping: http://pastebin.com/bqgSDYdb I am not sure if this is still about permissions or not.

avatar

Check if the services are up, looks like your Job History Server may be down.

avatar
New Member

I got it working with a fresh instance of HDP_2.3.2_virtualbox.

I set root:hadoop as owner of /user/root and did the same for /user/guest and that did the trick. Thanks everyone.

avatar

I see 'Connection Refused' which means either a service is down or connection to wrong port. Like Deepesh said, appears to be former and that History server is down.

avatar
New Member

I am still facing this issue. Matthew Bird - can you help me? How did you get fresh instance?