Created 06-12-2017 10:38 AM
Hi All,
what is the impact on existing cluster if we remove 50 of 200 datanodes from existing cluster . space is not an issue as the system is 30% of the hdfs filesystem usage . how long does it takes to rebalance the complete cluster ?
Regards
Srinivas S
Created 06-12-2017 05:06 PM
Don't just remove the DataNodes. Even with rack awareness, removing >2 nodes from different racks will lead to data loss. Instead, you should decommission them first as described here:
You may know this already, but I want to make it clear for others who read this discussion in the future.
Created 06-12-2017 11:37 AM
Hi @srinivas s if you're using rack awareness you should probably get rid of 50 datanodes by decomissioning them without loosing some blocks, otherwise you'll probably will.
rebalance time depends on your network and cluster utilization, you can adjust some parameters to make it fastest if necessary, basically
hdfs dfsadmin -setBalancerBandwidth <bandwidth (kb/s)>
or within your HDFS params (example) :
dfs.balance.bandwidthPerSec=100000000 dfs.datanode.max.transfer.threads=16384 dfs.datanode.balance.max.concurrent.moves=500
please check Accept if you're satisfied with my answer
Created 06-12-2017 05:12 PM
@Laurent Edel this answer is incorrect. Please consider editing it to mention decommissioning. Else someone may assume it's OK to just remove nodes if they have rack awareness.
Created 06-12-2017 05:06 PM
Don't just remove the DataNodes. Even with rack awareness, removing >2 nodes from different racks will lead to data loss. Instead, you should decommission them first as described here:
You may know this already, but I want to make it clear for others who read this discussion in the future.