<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Does the 6.3.3 version Cloudera still experiencing small files issue? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291693#M215666</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/75200"&gt;@Mondi&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, small files can still cause an impact in CDH 6.3.3. This has nothing to do with the version of Cloudera but the way that the Namenode and HDFS interact when a lot of small files are stored in HDFS. Lots of small files create a lot of metadata that the Namenode must store and manage in memory.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To understand more about the impact of small files in HDFS and how to manage this, please refer to this article:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://blog.cloudera.com/small-files-big-foils-addressing-the-associated-metadata-and-application-challenges/" target="_blank"&gt;https://blog.cloudera.com/small-files-big-foils-addressing-the-associated-metadata-and-application-challenges/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Steve&lt;/P&gt;</description>
    <pubDate>Fri, 13 Mar 2020 11:24:57 GMT</pubDate>
    <dc:creator>StevenOD</dc:creator>
    <dc:date>2020-03-13T11:24:57Z</dc:date>
    <item>
      <title>Does the 6.3.3 version Cloudera still experiencing small files issue?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291688#M215664</link>
      <description>&lt;P&gt;Small and empty files are recurring on our current version of CDH Cluster. Does is still exist on 6.3.3 version?&lt;/P&gt;</description>
      <pubDate>Fri, 13 Mar 2020 11:00:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291688#M215664</guid>
      <dc:creator>Mondi</dc:creator>
      <dc:date>2020-03-13T11:00:39Z</dc:date>
    </item>
    <item>
      <title>Re: Does the 6.3.3 version Cloudera still experiencing small files issue?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291693#M215666</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/75200"&gt;@Mondi&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, small files can still cause an impact in CDH 6.3.3. This has nothing to do with the version of Cloudera but the way that the Namenode and HDFS interact when a lot of small files are stored in HDFS. Lots of small files create a lot of metadata that the Namenode must store and manage in memory.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To understand more about the impact of small files in HDFS and how to manage this, please refer to this article:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://blog.cloudera.com/small-files-big-foils-addressing-the-associated-metadata-and-application-challenges/" target="_blank"&gt;https://blog.cloudera.com/small-files-big-foils-addressing-the-associated-metadata-and-application-challenges/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Steve&lt;/P&gt;</description>
      <pubDate>Fri, 13 Mar 2020 11:24:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291693#M215666</guid>
      <dc:creator>StevenOD</dc:creator>
      <dc:date>2020-03-13T11:24:57Z</dc:date>
    </item>
    <item>
      <title>Re: Does the 6.3.3 version Cloudera still experiencing small files issue?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291800#M215719</link>
      <description>&lt;P&gt;Thanks &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/26024"&gt;@StevenOD&lt;/a&gt; i'll check on this&lt;/P&gt;</description>
      <pubDate>Mon, 16 Mar 2020 02:50:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Does-the-6-3-3-version-Cloudera-still-experiencing-small/m-p/291800#M215719</guid>
      <dc:creator>Mondi</dc:creator>
      <dc:date>2020-03-16T02:50:37Z</dc:date>
    </item>
  </channel>
</rss>

