- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
HI .. any one help me to understand what is /user/username/.staging directory ,it reaches 4tb can i delete it?
Created ‎04-10-2017 12:05 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have noticed that in dfs file system /user/username/.staging directory reaches 4tb..and directory having old file
15160634 2016-02-09 09:30 /user/userprod/.staging/job_1443521267046_99999/job.jar /job_1443521267046_99999/job.split .staging/job_1443521267046_99999/job.splitme /.staging/job_1443521267046_99999/job.xml /.staging/job_1443521267046_99999/libjars /.staging/job_1443521267046_99999/tez-conf.pb /.staging/job_1443521267046_99999/tez-dag.pb. /.staging/job_1443521267046_99999/tez.session
Can i remove this data?
Created ‎04-10-2017 12:19 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Is the following value set to true?
keep.failed.task.files (MRv1) or mapreduce.task.files.preserve.failedtasks (MRv2).
If yes, that could be the reason staging files are not being deleted. Set this to false and delete the files manually. Do not delete files for currently running job.
In rare instances, due to job failure your staging files may not be deleted and you might its remnants here. These are just temporary map reduce files. If no current job is running, you can safely delete these files and reclaim the space. Make sure when you delete these files, they don't end up in trash folder (use -skipTrash option or later delete from trash folder also).
Created ‎04-10-2017 12:19 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Is the following value set to true?
keep.failed.task.files (MRv1) or mapreduce.task.files.preserve.failedtasks (MRv2).
If yes, that could be the reason staging files are not being deleted. Set this to false and delete the files manually. Do not delete files for currently running job.
In rare instances, due to job failure your staging files may not be deleted and you might its remnants here. These are just temporary map reduce files. If no current job is running, you can safely delete these files and reclaim the space. Make sure when you delete these files, they don't end up in trash folder (use -skipTrash option or later delete from trash folder also).
Created ‎04-10-2017 12:27 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks you so much @mqureshi
I could not find mapreduce.task.files.preserve.failedtasks , i am using MRv2 HDP 2.1.3 and currently i dont have running jobs..
