Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

downtime for addition of nodes

avatar

want to add a few nodes to a running cluster. does it require any down time and will the running jobs get affected?

1 ACCEPTED SOLUTION

avatar
Explorer

Basically, adding nodes (hosts) requires no downtime and will not affect running jobs at all.

Btw, you may try to do this by Ambari Add Hosts function. Scenario:

1. Go to Ambari Hosts page.

2. Click on Actions button located on the top left of this page.

3. Select "Add New Hosts " option.

You will see a wizard which will guide you through the whole process, you can add hosts with any slaves (DataNode, NodeManager, clients .etc) installed on them.

Hope this helps 🙂

View solution in original post

4 REPLIES 4

avatar

@ARUN :

No downtime is required for simple adding nodes for just DN, NM and other components. Even for adding client node(Edge node) doesn't require any downtime.

avatar
Explorer

Basically, adding nodes (hosts) requires no downtime and will not affect running jobs at all.

Btw, you may try to do this by Ambari Add Hosts function. Scenario:

1. Go to Ambari Hosts page.

2. Click on Actions button located on the top left of this page.

3. Select "Add New Hosts " option.

You will see a wizard which will guide you through the whole process, you can add hosts with any slaves (DataNode, NodeManager, clients .etc) installed on them.

Hope this helps 🙂

avatar
Super Guru

@ARUN

As @Xi Wang responded and suggested. I assess her response as appropriate. Please see also doc link (assuming you use Ambari 2.1.1.0, change to the proper link if you use an earlier version):

http://docs.hortonworks.com/HDPDocuments/Ambari-2.1.1.0/bk_Ambari_Users_Guide/content/_adding_hosts_...

Also, keep in mind that only new data will use the new data nodes, unless you execute rebalance hdfs command, then you have a distribution of all the data across all data nodes. Default threshold is 10, but you could change it to your desired threshold. You may want to execute it during off-peak hours.