Support Questions

Find answers, ask questions, and share your expertise

Hadoop balancer performing slow.

New Contributor

Hadoop balancer performing very slow in our production cluster.

 

CDH version 5.5.1 express edition

Cluster capacity 850 nodes.

 

Highest % occupies nodes - 85%

Lowest % occupied nodes - 5 %

 

Recently we have added 40 nodes in our production cluster and initiated the hadoop balancer with following parameters but there is no significant improvement while transfering the data Can you please suggest what are parameters which I need to configure to improve the performance.

 

Is it possible to transfer TB of data in each iteation.

 

sudo -u apps hdfs balancer
-Ddfs.balancer.movedWinWidth=5400000 \
-Ddfs.balancer.moverThreads=2000 \
-Ddfs.balancer.dispatcherThreads=2000 \
-Ddfs.balance.bandwidthPerSec=200000000 \
-Ddfs.balancer.getBlocks.size=1000000000000 \
-Ddfs.datanode.balance.max.concurrent.moves=24 \
-Ddfs.balancer.getBlocks.min-block-size=536870912 -threshold 15 -include <Datanodes>

 

 

Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To Move  Bytes Being Moved

Jan 18, 2019 7:44:55 AM       61       568.44 GB           136.22 TB        280 GB
Jan 18, 2019 7:52:39 AM       62       576.80 GB           136.22 TB        280 GB
Jan 18, 2019 7:58:23 AM       63       583.06 GB           136.21 TB        280 GB
Jan 18, 2019 8:04:36 AM       64       590.99 GB           136.21 TB        280 GB
Jan 18, 2019 8:10:48 AM       65       602.05 GB           136.24 TB        280 GB

 

 

 

 

1 REPLY 1

Contributor
Although I don't have this problem, I would like to know the answer to this. It would definitely be good to know. Unfortunately, I have posted questions and received nothing but crickets as it appears you have.
Hopefully, someone will get back to you.