Created 09-20-2017 11:17 AM
Hello,
I need to configure YARN and MapReduce as mentioned in this link
I'm blocked at the step of specifying disks number as shown in "step.png".
How can i extract disks number on each node of my cluster ?
How can i resolve this issue ?
Created 09-20-2017 02:09 PM
That is wrongly printed parameter. It should be 'dfs.datanode.data.dir' instead of 'dfs.data.dir'. This has been rectified in higher version of documentation.
dfs.datanode.data.dir determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. The number of comma-delimited list equals to number of disks. |
Hope this helps you.
Created 09-20-2017 02:09 PM
That is wrongly printed parameter. It should be 'dfs.datanode.data.dir' instead of 'dfs.data.dir'. This has been rectified in higher version of documentation.
dfs.datanode.data.dir determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. The number of comma-delimited list equals to number of disks. |
Hope this helps you.
Created 09-25-2017 01:44 PM
Thank you for your pertinent answer.
Created 09-20-2017 02:59 PM
@raouia Check the number of data disks you are using in Ambari (under HDFS configs look for dfs.datanode.data.dir) or if you havent installed yet, do a "df -h" on the node to count number of data disks - usually looks like data01, data02 ..
Created 09-25-2017 01:45 PM