<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Name Node and Data Node Directories in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140878#M32047</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11008/chandramoulimuthukumaran.html" nodeid="11008"&gt;@chandramouli muthukumaran&lt;/A&gt;&lt;/P&gt;&lt;P&gt;1. Namenode : 2 directories are enough for backing up namenode metadata in case of any crash of namenode. Usually 1st disk should be local disk and prefer 2nd disk as network storage[san/nas](Just incase local machine goes down you can have backup of namenode metadata on network storage). If you do not have network storage then 2 local disk are fine.&lt;/P&gt;&lt;P&gt;Disadvantage of multiple disk for storage - I/O performance will hamper as namenode will copy metadata to all the disk.&lt;/P&gt;&lt;P&gt;2. Datanodes : If you have multiple HDD attached to the machine then we can usually use them for HDFS data storage.&lt;/P&gt;&lt;P&gt;Multiple disk on datanode is not a problem. For datanode it will not store multiple copies of same data on all disk as of Namenode.&lt;/P&gt;&lt;P&gt;Please check link below with basic concept explained -http://hortonworks.com/blog/hdfs-metadata-directories-explained/&lt;/P&gt;</description>
    <pubDate>Thu, 16 Jun 2016 01:08:52 GMT</pubDate>
    <dc:creator>sshimpi</dc:creator>
    <dc:date>2016-06-16T01:08:52Z</dc:date>
    <item>
      <title>Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140874#M32043</link>
      <description>&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/5037-directories.png"&gt;directories.png&lt;/A&gt;Hi,&lt;/P&gt;&lt;P&gt;I see that by default after the automated install using Ambari, there are bunch of directories under name node and data node settings in the configuration. Can you tell me if that is the best practice or if we should remove some directories in that and keep only one in each. Also please let me know if removing those do not affect the hdfs services.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Chandra&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 00:45:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140874#M32043</guid>
      <dc:creator>Chandra</dc:creator>
      <dc:date>2016-06-16T00:45:32Z</dc:date>
    </item>
    <item>
      <title>Re: Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140875#M32044</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11008/chandramoulimuthukumaran.html" nodeid="11008"&gt;@chandramouli muthukumaran&lt;/A&gt; if you intend to use the /opt/Symantec/hadoop/hdfs/namenode for namenode and /opt/Symantec/hadoop/hdfs/data only then you can remove the other entries. Save and then  you might need to restart the HDFS and corresponding services as indicated by ambari. &lt;/P&gt;&lt;P&gt;Incase if you have multiple directories for Data then add those entries like this.&lt;/P&gt;&lt;P&gt;/opt/Symantec/hadoop/hdfs/data1,/opt/Symantec/hadoop/hdfs/data2,/opt/Symantec/hadoop/hdfs/data3&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 01:01:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140875#M32044</guid>
      <dc:creator>yjagadeesan</dc:creator>
      <dc:date>2016-06-16T01:01:22Z</dc:date>
    </item>
    <item>
      <title>Re: Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140876#M32045</link>
      <description>&lt;P&gt;Thanks for your answer. If we have multiple directories, will the hdfs files be stored multiple times in those directories? sorry I am a newbie hence need to get this clarified. &lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 01:01:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140876#M32045</guid>
      <dc:creator>Chandra</dc:creator>
      <dc:date>2016-06-16T01:01:28Z</dc:date>
    </item>
    <item>
      <title>Re: Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140877#M32046</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/11008/chandramoulimuthukumaran.html"&gt;@chandramouli muthukumaran&lt;/A&gt; No, as for HDFS files, their storage will depend only on replication factor. Think about it this way. You start with a fresh linux install. You have different mount points in your system with different capacities. Which mount points would you like to use to store your HDFS data (datanode) as well as your metadata (namenode).&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 01:07:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140877#M32046</guid>
      <dc:creator>mqureshi</dc:creator>
      <dc:date>2016-06-16T01:07:54Z</dc:date>
    </item>
    <item>
      <title>Re: Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140878#M32047</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11008/chandramoulimuthukumaran.html" nodeid="11008"&gt;@chandramouli muthukumaran&lt;/A&gt;&lt;/P&gt;&lt;P&gt;1. Namenode : 2 directories are enough for backing up namenode metadata in case of any crash of namenode. Usually 1st disk should be local disk and prefer 2nd disk as network storage[san/nas](Just incase local machine goes down you can have backup of namenode metadata on network storage). If you do not have network storage then 2 local disk are fine.&lt;/P&gt;&lt;P&gt;Disadvantage of multiple disk for storage - I/O performance will hamper as namenode will copy metadata to all the disk.&lt;/P&gt;&lt;P&gt;2. Datanodes : If you have multiple HDD attached to the machine then we can usually use them for HDFS data storage.&lt;/P&gt;&lt;P&gt;Multiple disk on datanode is not a problem. For datanode it will not store multiple copies of same data on all disk as of Namenode.&lt;/P&gt;&lt;P&gt;Please check link below with basic concept explained -http://hortonworks.com/blog/hdfs-metadata-directories-explained/&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 01:08:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140878#M32047</guid>
      <dc:creator>sshimpi</dc:creator>
      <dc:date>2016-06-16T01:08:52Z</dc:date>
    </item>
    <item>
      <title>Re: Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140879#M32048</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11008/chandramoulimuthukumaran.html" nodeid="11008"&gt;@chandramouli muthukumaran&lt;/A&gt; No it does not store multiple times.  &lt;/P&gt;&lt;P&gt;Good Luck with your hadooping.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 01:11:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140879#M32048</guid>
      <dc:creator>yjagadeesan</dc:creator>
      <dc:date>2016-06-16T01:11:39Z</dc:date>
    </item>
    <item>
      <title>Re: Name Node and Data Node Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140880#M32049</link>
      <description>&lt;P&gt;thanks much&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2016 01:25:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Name-Node-and-Data-Node-Directories/m-p/140880#M32049</guid>
      <dc:creator>Chandra</dc:creator>
      <dc:date>2016-06-16T01:25:36Z</dc:date>
    </item>
  </channel>
</rss>

