<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question RedHat and HDFS report different values for &amp;quot;du&amp;quot; in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RedHat-and-HDFS-report-different-values-for-quot-du-quot/m-p/39333#M24379</link>
    <description>&lt;P&gt;I copied a large folder structure from HDFS to RedHat using copyToLocal. &amp;nbsp;While it looked&amp;nbsp;successful, i want to validate it was copied correctly by checking the size of the data in HDFS and in RedHat. &amp;nbsp;I'm using "du" but my numbers are still off.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I run the following on RH: "du -s -b &amp;lt;PATH&amp;gt;"&lt;/P&gt;&lt;P&gt;I run the following on HDFS: "hadoop fs -du -s &amp;lt;PATH&amp;gt;"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I noticed that RH reports&amp;nbsp;101 bytes for empty folders while HDFS (CDH5.5.2) reports 0 bytes for empty folders. &amp;nbsp;So my question is, how to I validate the entire directory of data was fully&amp;nbsp;transferred?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;</description>
    <pubDate>Mon, 04 Apr 2016 21:33:47 GMT</pubDate>
    <dc:creator>asherma5</dc:creator>
    <dc:date>2016-04-04T21:33:47Z</dc:date>
    <item>
      <title>RedHat and HDFS report different values for "du"</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RedHat-and-HDFS-report-different-values-for-quot-du-quot/m-p/39333#M24379</link>
      <description>&lt;P&gt;I copied a large folder structure from HDFS to RedHat using copyToLocal. &amp;nbsp;While it looked&amp;nbsp;successful, i want to validate it was copied correctly by checking the size of the data in HDFS and in RedHat. &amp;nbsp;I'm using "du" but my numbers are still off.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I run the following on RH: "du -s -b &amp;lt;PATH&amp;gt;"&lt;/P&gt;&lt;P&gt;I run the following on HDFS: "hadoop fs -du -s &amp;lt;PATH&amp;gt;"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I noticed that RH reports&amp;nbsp;101 bytes for empty folders while HDFS (CDH5.5.2) reports 0 bytes for empty folders. &amp;nbsp;So my question is, how to I validate the entire directory of data was fully&amp;nbsp;transferred?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Mon, 04 Apr 2016 21:33:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RedHat-and-HDFS-report-different-values-for-quot-du-quot/m-p/39333#M24379</guid>
      <dc:creator>asherma5</dc:creator>
      <dc:date>2016-04-04T21:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: RedHat and HDFS report different values for "du"</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RedHat-and-HDFS-report-different-values-for-quot-du-quot/m-p/39463#M24380</link>
      <description>As an alternative solution, I am comparing the two file structures for the same number of folders and files.</description>
      <pubDate>Thu, 07 Apr 2016 16:02:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RedHat-and-HDFS-report-different-values-for-quot-du-quot/m-p/39463#M24380</guid>
      <dc:creator>asherma5</dc:creator>
      <dc:date>2016-04-07T16:02:40Z</dc:date>
    </item>
  </channel>
</rss>

