Support Questions

Find answers, ask questions, and share your expertise

distcp on cdh3u5 failing with timeout error

avatar
Expert Contributor
Hi,
 
  I am trying to backup dome data from our ancient hadoop cluster which is on cdh3u5 to S3 bucket with distcp. But my job is failing because some of the task are getting killed multiple times with following message 
 
   
Task attempt_201404090636_336528_m_000112_0 failed to report status for 600 seconds. Killing!
Task attempt_201404090636_336528_m_000112_1 failed to report status for 600 seconds. Killing!
Task attempt_201404090636_336528_m_000112_2 failed to report status for 600 seconds. Killing!
 
I was trying to distcp directory which has about ~1200 files with size 3MB - 5MB
 
We have 80 datanodes in the cluster.
 
 
Any help here with this please.
 
Thanks
roy
 
 
 
 
 
1 ACCEPTED SOLUTION

avatar
Expert Contributor

looks like its working after addition of -D mapred.task.timeout=60000000

View solution in original post

1 REPLY 1

avatar
Expert Contributor

looks like its working after addition of -D mapred.task.timeout=60000000