Created 09-07-2018 08:12 AM
I am running a mapreduce job, calculating the split locations and I see job.split and job.splitmetainfo files contain the locations, but mapper side it is prining the locations null
CDH enterprise 5.14.0
Sep 7, 9:16:49.300 AM INFO org.apache.hadoop.mapred.MapTask
Processing split: AccInputSplit [splitId=174, locations=[null, null, null, null, null, null, null, null, null, null], splitLength=516537674]
anybody seen like this?
Created on 09-10-2018 07:12 PM - edited 09-10-2018 07:12 PM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 09-07-2018 09:14 AM
Created 09-07-2018 09:20 AM
>>> your custom split class' readFields method
I will investigate this, update this thread, thanks
Created 09-10-2018 08:58 AM
@Harsh J you are right readFields is just a dummy,
but getLocations() is fine, we are using fair scheduler, which should take getLocations() and schedule the job to according to data localituy right?
Created on 09-10-2018 07:12 PM - edited 09-10-2018 07:12 PM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 09-11-2018 07:45 AM
Thanks for the help,
proper implmentation of readfields solved the problem.