Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

HDFS disk allocation

avatar
Super Collaborator

does the location of these two parameters tell where HDFS will be using its space from?

NameNode directories=/u01/hadoop/hdfs/namenode 
DataNode directories=/u01/hadoop1/hdfs/data

also if i want to add more space to the cluster by adding another disk , how do i add it to the cluster?

1 ACCEPTED SOLUTION

avatar

@Sami Ahmad Datanode directories is where HDFS data is actually stored while Namenode Directory is used for storing Namenode Metadata i.e. file, block information etc. associated with HDFS data stored on Datanodes. To add additional space to HDFS with additional disks, you should add disk to Datanode Directories i.e. if you add u02 to server(s), add the same to Datanode Directories.

DataNode directories=/u01/hadoop1/hdfs/data,/u02/hadoop1/hdfs/data

View solution in original post

3 REPLIES 3

avatar

@Sami Ahmad Datanode directories is where HDFS data is actually stored while Namenode Directory is used for storing Namenode Metadata i.e. file, block information etc. associated with HDFS data stored on Datanodes. To add additional space to HDFS with additional disks, you should add disk to Datanode Directories i.e. if you add u02 to server(s), add the same to Datanode Directories.

DataNode directories=/u01/hadoop1/hdfs/data,/u02/hadoop1/hdfs/data

avatar
Super Collaborator

thanks Pardeep.

so should we keep the HDFS disk seperate from the OS disk , i.e where the linux operating system is installed ?

avatar

Yes that is recommended approach.