Support Questions
Find answers, ask questions, and share your expertise

distcp on cdh3u5 failing with timeout error

Solved Go to solution

distcp on cdh3u5 failing with timeout error

Rising Star
Hi,
 
  I am trying to backup dome data from our ancient hadoop cluster which is on cdh3u5 to S3 bucket with distcp. But my job is failing because some of the task are getting killed multiple times with following message 
 
   
Task attempt_201404090636_336528_m_000112_0 failed to report status for 600 seconds. Killing!
Task attempt_201404090636_336528_m_000112_1 failed to report status for 600 seconds. Killing!
Task attempt_201404090636_336528_m_000112_2 failed to report status for 600 seconds. Killing!
 
I was trying to distcp directory which has about ~1200 files with size 3MB - 5MB
 
We have 80 datanodes in the cluster.
 
 
Any help here with this please.
 
Thanks
roy
 
 
 
 
 
1 ACCEPTED SOLUTION

Accepted Solutions

Re: distcp on cdh3u5 failing with timeout error

Rising Star

looks like its working after addition of -D mapred.task.timeout=60000000

View solution in original post

1 REPLY 1

Re: distcp on cdh3u5 failing with timeout error

Rising Star

looks like its working after addition of -D mapred.task.timeout=60000000

View solution in original post