- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Small file in hadoop
- Labels:
-
Apache Atlas
Created 07-20-2023 03:16 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Team ,
As we have more then 14 million small file as per Cloudera Navigator as below :
14.8M Small files created in
the last 30 days
14.8M / 21.4M
69.3% small files
We are doing partition of data per day wise can we increase it or any other suggestion is there to overcome the small file problem.
Thank You in Advance
Created 07-21-2023 04:11 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @cdl-support . You can refer to the below article and check if those help.
https://my.cloudera.com/knowledge/Issue-with-Small-Files-in-HDFS?id=308948
Using Hive : https://docs.cloudera.com/best-practices/latest/impala-performance/topics/bp-impala-avoiding-small-f...
Created 07-20-2023 01:24 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@cdl-support Welcome to the Cloudera Community!
To help you get the best possible solution, I have tagged our Atlas experts @BennyZ and @mayank_gupta who may be able to assist you further.
Please keep us updated on your post, and we hope you find a satisfactory solution to your query.
Regards,
Diana Torres,Community Moderator
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.
Learn more about the Cloudera Community:
Created 07-21-2023 04:11 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @cdl-support . You can refer to the below article and check if those help.
https://my.cloudera.com/knowledge/Issue-with-Small-Files-in-HDFS?id=308948
Using Hive : https://docs.cloudera.com/best-practices/latest/impala-performance/topics/bp-impala-avoiding-small-f...
Created 07-25-2023 01:02 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@cdl-support Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future. Thanks.
Regards,
Diana Torres,Community Moderator
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.
Learn more about the Cloudera Community:
