Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Heterogeneous worker nodes with HDP2.3

avatar

What precautions & extra configurations, if any, are needed when adding worker nodes with different capacity to a cluster? My understanding is that YARN will be able to just manage the nodes without anything special.

For e.g - Any issues with adding 3 nodes with following config to an existing POC cluster that has similar nodes 8 cores, 32 Gigs, 3 TB DAS for data-

node1 - 8 cores, 64GB RAM, NO STORAGE 

node2 - 8 cores, 64GB RAM, 2 TB

node3 - 8 cores, 64GB RAM, 2TB

Also, how do you configure YARN to utilize different amount of memory on these heterogeneous boxes?

1 ACCEPTED SOLUTION

avatar
Master Mentor

@bsaini@hortonworks.com

Node 1 - no storage - I guess you wont be using this for data node but it can play role of worker node (Nodemanager if you like and other components)

You will be creating a config group for node1

Please see this screenshot. In my case, node4 is datanode and I want to customize the data directories

Created new config for node4

329-screen-shot-2015-10-25-at-100127-am.png

select the config and change parameters that you want to change.

There is

328-screen-shot-2015-10-25-at-100205-am.png

Doc example

View solution in original post

2 REPLIES 2

avatar
Master Mentor

you need to run the configuration utility against these nodes to account for memory, disk and cpu sizing. If you don't put node1 with no storage in its own group and specify not to look for whatever the hdfs or yarn directory you will run out of disk pretty quick because hdfs will try to write data to nonexisting directories in filesystem. So for node 1, I'd put it in it's own Ambari configuration group.

avatar
Master Mentor

@bsaini@hortonworks.com

Node 1 - no storage - I guess you wont be using this for data node but it can play role of worker node (Nodemanager if you like and other components)

You will be creating a config group for node1

Please see this screenshot. In my case, node4 is datanode and I want to customize the data directories

Created new config for node4

329-screen-shot-2015-10-25-at-100127-am.png

select the config and change parameters that you want to change.

There is

328-screen-shot-2015-10-25-at-100205-am.png

Doc example