I am managing CDH 5.14 cluster with 4 datanodes. Each datanode has 10*3TB of total disk space, split in 10 mount points (dfs.datanode.data.dir). The 4 datanodes have similar configuration so far.
Now, I want to add an extra datanode, which has 8*4TB of space. I have the following questions:
1. Do II have to create 10 similar mount points as in the rest datanodes?
2. Is it a balancing problem that the extra datanode will have a bit greater capacity? The main services used are Impala, Hive and Spark