Support Questions

Report Inappropriate Content · ‎05-21-2018

Hi Team,

What is the use of minimum and maximum container size. if i put the my minimum container size is 2gb and maximum container size is 8 gb. in our cluster we run the different types of jobs. one job can consume the 5 gb memory, another will consume 8gb, this 2 jobs are run successfully because my max container size is 8 gb. in case if i run the jobs using the memory 10 gb, 15 gb, and 20gb . will it run successfully in my cluster using above min and mx container size.

Single job can use single container or multiple containers?

Can anyone of you help me on this.

schhabra1 · ‎05-22-2018

These are yarn parameters which controls the maximum and minimum conatiner sizes which yarn can allocate to containers:

YARN PARAMETERS:

----> yarn.scheduler.minimum-allocation-mb - The minimum allocation for every container request at the RM, in MBs. Memory requests lower than this won't take effect, and the specified value will get allocated at minimum.

----> yarn.scheduler.maximum-allocation-mb - The maximum allocation for every container request at the RM, in MBs. Memory requests higher than this won't take effect, and will get capped to this value.

MAPREDUCE PARAMETERS:

Client side parameters which job requests. We can override this.

mapreduce.map.memory.mb - Map container size

mapreduce.map.reduce.mb - Reducer container size

Note : If we request memory > yarn max allocation limit, Job will fail as yarn will report it can not allocate that much memory.

Below given are few examples:

----------------------------------------------------------------------------------------------------------------------------------------------------------------

Example: (Following will fail)

+=============================+

Server side:

yarn.scheduler.minimum-allocation-mb=1024

yarn.scheduler.maximum-allocation-mb=8196

Client size:

mapreduce.map.memory.mb=10240

----------------------------------------------------------------------------------------------------------------------------------------------------------------

Another example: (Following will work):

+=============================+

Server side:

yarn.scheduler.minimum-allocation-mb=1024

yarn.scheduler.maximum-allocation-mb=8196

Client size:

mapreduce.map.memory.mb=800

In this case mapper will get 1024 (Minimum conatiner size)

----------------------------------------------------------------------------------------------------------------------------------------------------------------

Another example: (Following will work):

+=============================+

Server side:

yarn.scheduler.minimum-allocation-mb=1024

yarn.scheduler.maximum-allocation-mb=8196

Client size:

mapreduce.map.memory.mb=1800

In this case mapper will get 2048

Note: Single job can use single/multiple containers depending upon size of input data, split size and nature of data.

Cloudera Community

Support Questions

what is the use of minimum and maximum container size

What are the minimum and maximum cluster sizes?

LLAP sizing and setup

Set maximum containers on a Hive query

Is there a way to set minimum/maximum number of co...

NiFi Sizing Guide & Deployment Best Practices

Maximum DataNode log size

Zookeeper Sizing and Placement

Yarn container size

Unable to write flowfile content to content reposi...

SelectHiveQl properties(Fetch Size,Maximum Number ...