Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Balancing Blocks Between Disks on Datanode

avatar
Explorer

Hi. Some of my datanodes have different disk size. For example:

 

/dev/sdc1 918G 384G 534G 42% /data/disk1
/dev/sdd1 459G 381G 78G 84% /data/disk2
/dev/sde1 459G 391G 69G 86% /data/disk3
/dev/sdf1 459G 389G 70G 85% /data/disk4

 

 

My understanding is that there is currently no functionality for balancing within a datanode, so I'd have to move data around manually. I've found this article on performing the procedure: http://www-01.ibm.com/support/docview.wss?uid=swg21702775 (Procedure 1). Has anyone actually done this (or something similar)? Can you share any issues/caveats you ran across? Is this the best way to do it? If the other 3 disks fill up, will that datanode continue to write to disk1? 

 

Thank you. 

1 ACCEPTED SOLUTION

avatar
Expert Contributor
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login
4 REPLIES 4

avatar
Expert Contributor
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login

avatar
Explorer
Thank you. This is a great feature, and I appreciate the link. Unfortunately, our cluster is running 5.7.1, and given my lack of experience with CDH (I inherited this cluster) I'm loathe to upgrade it at the moment.

avatar
Expert Contributor

Same issue, do we have any such thing for  version older than 5.8 . disk balancer

avatar
Expert Contributor
diskbalanacer is a new feature in CDH5.8, and by definition, a new feature will not be backported to an older minor version.