<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Difference between dfs.data.dir &amp;amp; dfs.datanode.data.dir in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Difference-between-dfs-data-dir-amp-dfs-datanode-data-dir/m-p/34496#M35818</link>
    <description>&lt;P&gt;Hi Team,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone please let me know, what is the difference between these 2 parameters ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cloudera manager sets&amp;nbsp;dfs.datanode.data.dir inside&amp;nbsp;/swap/ folder by default. In EC2 instance, /swap is a temporary directory, which gets deleted and recreated at bootup.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does that mean some blocks will be deleted&amp;nbsp;at start up and cluster will be curropted ?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have set up a single node cluster on a EC2 machine with CDH5.5.0 and facing the cluster curruption just after shut down and restart.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can it be one of the reason ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Vikas&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 09:50:38 GMT</pubDate>
    <dc:creator>VikasSharma</dc:creator>
    <dc:date>2022-09-16T09:50:38Z</dc:date>
    <item>
      <title>Difference between dfs.data.dir &amp; dfs.datanode.data.dir</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Difference-between-dfs-data-dir-amp-dfs-datanode-data-dir/m-p/34496#M35818</link>
      <description>&lt;P&gt;Hi Team,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone please let me know, what is the difference between these 2 parameters ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cloudera manager sets&amp;nbsp;dfs.datanode.data.dir inside&amp;nbsp;/swap/ folder by default. In EC2 instance, /swap is a temporary directory, which gets deleted and recreated at bootup.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does that mean some blocks will be deleted&amp;nbsp;at start up and cluster will be curropted ?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have set up a single node cluster on a EC2 machine with CDH5.5.0 and facing the cluster curruption just after shut down and restart.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can it be one of the reason ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Vikas&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:50:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Difference-between-dfs-data-dir-amp-dfs-datanode-data-dir/m-p/34496#M35818</guid>
      <dc:creator>VikasSharma</dc:creator>
      <dc:date>2022-09-16T09:50:38Z</dc:date>
    </item>
    <item>
      <title>Re: Difference between dfs.data.dir &amp; dfs.datanode.data.dir</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Difference-between-dfs-data-dir-amp-dfs-datanode-data-dir/m-p/34584#M35819</link>
      <description>Hi Vikas,&lt;BR /&gt;&lt;BR /&gt;In general, we recommend storing data on instance storage drives for EC2 since EBS volumes are slow and charge you per access. Instance storage is ephemeral, which means that whether the dir is named "/swap" or something else, it'll disappear if you restart the machine. You should back up your data to a safe location before powering down your EC2 machine, as discussed here:&lt;BR /&gt;&lt;A href="http://www.cloudera.com/content/www/en-us/documentation/other/reference-architecture/PDF/cloudera_ref_arch_aws.pdf" target="_blank"&gt;http://www.cloudera.com/content/www/en-us/documentation/other/reference-architecture/PDF/cloudera_ref_arch_aws.pdf&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;The difference between dfs.data.dir and dfs.datanode.data.dir is that the first is a very old name used in CDH3 and perhaps earlier, while the second the preferred current config name as of CDH4. They are both logically the same thing.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Darren</description>
      <pubDate>Mon, 30 Nov 2015 21:50:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Difference-between-dfs-data-dir-amp-dfs-datanode-data-dir/m-p/34584#M35819</guid>
      <dc:creator>Darren</dc:creator>
      <dc:date>2015-11-30T21:50:20Z</dc:date>
    </item>
  </channel>
</rss>

