Support Questions

pacosoplas · ‎02-17-2016

Hi: what can i do to improve the time for the reducer???

I have 107 mapper and just 1 reduce, so, which parameters could i change??

maybe thoste?

mapreduce.job.counters.max

Thanks

aervits · ‎02-17-2016

I would look at setting intermediate compression from map tasks and ouput compression from reduce tasks

You can also look at using combiner class.

mapreduce.map.output.compress
mapreduce.map.output.compress.codec

and output compression

mapreduce.output.fileoutputformat.compress.codec
mapreduce.output.fileoutputformat.compress.type
mapreduce.output.fileoutputformat.compress

nsabharwal · ‎02-17-2016

nsabharwal · ‎02-17-2016

Do you have support contract ?

Please install smartsense for better utilization of your cluster .

pacosoplas · ‎02-17-2016

Hi, we dont have yet, but the smartsense is free??

Thanks

nsabharwal · ‎02-17-2016

bleonhardi · ‎02-17-2016

Its mapred.reduce.tasks, if you run a mapreduce program from the hadoop client you would set it like this:

-Dmapred.reduce.tasks=x

Pig and Hive have different ways to predict reducer numbers.

pacosoplas · ‎02-17-2016

this -Dmapred.reduce.tasks=x is for mapreduce1 iam using mapreduce2 and yarn and i dont know how to change this parameter.

anny suggestion??

Thanks

bleonhardi · ‎02-17-2016

Still works on yarn, the official new one is mapreduce.job.reduces but I always used the one above and he still takes it.

aervits · ‎02-17-2016

I would look at setting intermediate compression from map tasks and ouput compression from reduce tasks

You can also look at using combiner class.

mapreduce.map.output.compress
mapreduce.map.output.compress.codec

and output compression

mapreduce.output.fileoutputformat.compress.codec
mapreduce.output.fileoutputformat.compress.type
mapreduce.output.fileoutputformat.compress

aervits · ‎02-17-2016

@Roberto Sancho here's a list of all deprecated mapred properties and new properties,

the property you're looking for is called mapreduce.job.reduces

reducer tasks long time