Currently, our cluster's hosts has 12 data disks(900G) installed besides the OS disk. and I created files systems /data01~data12 for every data disk. and set dfs.datanode.data.dir
Now, I'd like add several hosts with only 5 data disks(1.8T) installed to the cluster as datanodes. can I do it by creating 5 files systems /data01~data05 and adding new config groups with the dfs.datanode.data.dir in Ambari?
For such exending capablity scenario, What is the best practice?
I guess you should use the Config Group concept via Ambari to achieve the same. You can add those new hosts to your cluster using ambari and then create a new HDFS config group and choose those hosts where you want different value for the mentioned property.
You can add new hosts to ambari cluster via Ambari itself as mentioned in the following link.
- Once those hosts are added to your cluster then you can deploy your desired services/components to them like DataNode via ambari. The create the config groups.