- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Does the 6.3.3 version Cloudera still experiencing small files issue?
- Labels:
-
Cloudera Manager
Created on
‎03-13-2020
02:51 AM
- last edited on
‎03-13-2020
04:00 AM
by
VidyaSargur
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Small and empty files are recurring on our current version of CDH Cluster. Does is still exist on 6.3.3 version?
Created ‎03-13-2020 04:24 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @Mondi ,
Yes, small files can still cause an impact in CDH 6.3.3. This has nothing to do with the version of Cloudera but the way that the Namenode and HDFS interact when a lot of small files are stored in HDFS. Lots of small files create a lot of metadata that the Namenode must store and manage in memory.
To understand more about the impact of small files in HDFS and how to manage this, please refer to this article:
Regards,
Steve
Created ‎03-13-2020 04:24 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @Mondi ,
Yes, small files can still cause an impact in CDH 6.3.3. This has nothing to do with the version of Cloudera but the way that the Namenode and HDFS interact when a lot of small files are stored in HDFS. Lots of small files create a lot of metadata that the Namenode must store and manage in memory.
To understand more about the impact of small files in HDFS and how to manage this, please refer to this article:
Regards,
Steve
Created ‎03-15-2020 07:50 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks @StevenOD i'll check on this
