Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Confused with CDH5 YARN/Mr2 container memory settings

avatar
Expert Contributor

Hi, I'm trying to switch to CDH5 🙂

 

I have several nodes each as 32GB ram. Here is my confusion:

1. How to control tasks concurrency? Really, I don't any pools/schedulers right now. I need one huge pool. I don't need any complicity right now, since there is no reason for it. How can I force run all my MR jobs in one huge pool?

2. How to contol memory allocation? I've used mapred.child.java.opts in MR1, in MR2 it doesn't work.

3. I see 100500 setting related to *.memory.mb *.memory.max. I've read description and don't see how they fluence on my MR jobs.

4. Resource manager shows 9 node managers (NM). Each NM has 8Vcores (it's ok, there is 8 HT cores on each node) and 8GB RAM

Why 8GB RAM? I have 32GB per node? How can I change it?

1 ACCEPTED SOLUTION

avatar
Expert Contributor
3 REPLIES 3

avatar
Expert Contributor

avatar
Mentor
In addition to Roy's link, we do have a guide in our documentation that targets both Developers and Operators/Administrators in terms of what needs to be changed/mapped between MR1 and MR2. You can find these at http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/cdh_ig_mapreduce_to_yar...

avatar
Expert Contributor
Hi, sorry, no luck. Still suffering from MR2/YARN. I have no idea how it works.
Right now I'm getting deadlock several times a day. I have single user which submits jpb. It has huge pool (32*8 mem and 4*CPU) and It has limit for 8 applications at once. Suddenly everything stops.
What does it mean? Who can I get the idea of what's went wrong?