Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

AWS EMR format datanode metadata after completion of spark job

Highlighted

AWS EMR format datanode metadata after completion of spark job

New Contributor

I am using EMR & running a spark streaming job with yarn as resource manager and Hadoop 2.7.3-amzn-0, I want clean datanode files after completion of spark job : /mnt/hdfs/current/BP-2030300665-192.168.0.1-1495611838265/current/finalized/subdir0/subdir230/blk_1073800835 & blk_1073800835_60012.meta

Its increase my storage and facing disk storage full issue. Is there any way to achieve the same or any impact on my cluster if we delete the same?

Don't have an account?
Coming from Hortonworks? Activate your account here