Support Questions
Find answers, ask questions, and share your expertise

Datanodes volumes similarity

Rising Star

Hello,

I am managing  CDH 5.14 cluster with 4 datanodes. Each datanode has 10*3TB of total disk space, split in 10 mount points (dfs.datanode.data.dir). The 4 datanodes have similar configuration so far.

 

Now, I want to add an extra datanode, which has 8*4TB of space. I have the following questions:

1. Do II have to create 10 similar mount points as in the rest datanodes?

2. Is it a balancing problem that the extra datanode will have a bit greater capacity? The main services used are Impala, Hive and Spark

 

Thank you!