Support Questions

Find answers, ask questions, and share your expertise

disk used percentage is not symmetric on datanodes

avatar
Contributor

hi,

 

 we have a hdfs cluster that have 7 datanodes at first,  every datanode have 2TB disk.

lately we add datanode to 20, those new datanodes have 500GB disk. and then the hdfs

cluster had already been balanced.

 

now the problem is disk used percentage is always not symmetric when the cluster works a period of time,

those 2TB datanodes disk used percentage  are about 50%, but the 500GB datanodes disk used percentage

are about 90% , which made disk used percentage warning occurs. we have to balance the cluster manually.

 

how do I configure the cluster so as to it keep balance automatic?

any help would be appreciated.

 

1 ACCEPTED SOLUTION

avatar
Community Manager

At the present time there is no functionality in HDFS to do a per-disk balancing on a DataNode.

 

A long-standing Jira has been opened to add this functionality in a future release:

 

https://issues.apache.org/jira/browse/HDFS-1312



David Wilder, Community Manager


Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

Learn more about the Cloudera Community:

Terms of Service

Community Guidelines

How to use the forum

View solution in original post

1 REPLY 1

avatar
Community Manager

At the present time there is no functionality in HDFS to do a per-disk balancing on a DataNode.

 

A long-standing Jira has been opened to add this functionality in a future release:

 

https://issues.apache.org/jira/browse/HDFS-1312



David Wilder, Community Manager


Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

Learn more about the Cloudera Community:

Terms of Service

Community Guidelines

How to use the forum