Created 06-27-2017 06:26 PM
does the location of these two parameters tell where HDFS will be using its space from?
NameNode directories=/u01/hadoop/hdfs/namenode DataNode directories=/u01/hadoop1/hdfs/data
also if i want to add more space to the cluster by adding another disk , how do i add it to the cluster?
Created 06-27-2017 06:31 PM
@Sami Ahmad Datanode directories is where HDFS data is actually stored while Namenode Directory is used for storing Namenode Metadata i.e. file, block information etc. associated with HDFS data stored on Datanodes. To add additional space to HDFS with additional disks, you should add disk to Datanode Directories i.e. if you add u02 to server(s), add the same to Datanode Directories.
DataNode directories=/u01/hadoop1/hdfs/data,/u02/hadoop1/hdfs/data
Created 06-27-2017 06:31 PM
@Sami Ahmad Datanode directories is where HDFS data is actually stored while Namenode Directory is used for storing Namenode Metadata i.e. file, block information etc. associated with HDFS data stored on Datanodes. To add additional space to HDFS with additional disks, you should add disk to Datanode Directories i.e. if you add u02 to server(s), add the same to Datanode Directories.
DataNode directories=/u01/hadoop1/hdfs/data,/u02/hadoop1/hdfs/data
Created 06-27-2017 06:44 PM
thanks Pardeep.
so should we keep the HDFS disk seperate from the OS disk , i.e where the linux operating system is installed ?
Created 06-29-2017 06:40 PM
Yes that is recommended approach.