Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

File URI required for dfs.datanode.data.dir ?

avatar

Greeings, all.

Can anybody tell me if a File URI is mandatory for each component of the value of the dfs.datanode.data.dir element in the hdfs-site.xml file?

For example, am I required to specify "file:///grid/hadoop/hdfs/dn, file:///grid1/hadoop/hdfs/dn"? Or, could I use "/grid/hadoop/hdfs/dn, /grid1/hadoop/hdfs/dn", instead, and have effectively the same metadata?

I'm using HDP 2.4.3.

Thanks very much!

~ Joseph

1 ACCEPTED SOLUTION

avatar

@Joseph Hawkins - You can just have the value as "/grid/hadoop/hdfs/dn,/grid1/hadoop/hdfs/dn" if they are simple directories on the file system. This is the list of directories where the data will be stored. However, if you have Heterogeneous storage (like DISK, SSD, ARCHIVE or RAM_DISK) you would need to append the File URI.

Below is a very good article explaining HDFS Metadata Directories:

https://hortonworks.com/blog/hdfs-metadata-directories-explained/

View solution in original post

2 REPLIES 2

avatar
Super Collaborator

@Joseph Hawkins Can you please tell me the action/operation which you are trying ?

avatar

@Joseph Hawkins - You can just have the value as "/grid/hadoop/hdfs/dn,/grid1/hadoop/hdfs/dn" if they are simple directories on the file system. This is the list of directories where the data will be stored. However, if you have Heterogeneous storage (like DISK, SSD, ARCHIVE or RAM_DISK) you would need to append the File URI.

Below is a very good article explaining HDFS Metadata Directories:

https://hortonworks.com/blog/hdfs-metadata-directories-explained/