Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

How to specify more than one path for storage in hadoop?

Can you explain how to specify more than one path for storage in Hadoop?

2 REPLIES 2

The parameter to specify more than one path for storage in Hadoop is in hdfs-site.xml.

Property: dfs.datanode.data.dir (Please verify)
dfs.datanode.data.dir value can be any directory which is available on the data node. It determines where on the local filesystem data node should store its blocks.
It can be a directory where disk partitions are mounted like '/user1/hadoop/data, /user2/hadoop/data' which is in case if you have multiple disks partitions to be used for HDFS the purpose. When it has multiple values, data is copied to the HDFS in a round-robin fashion. If one of the directory's disk is full, round-robin data copy will continue on the rest of the directories.

Super Collaborator
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.