Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

How to recover data in HDFS, suspect that the deletion was caused by a script running on spark-shel

How to recover data in HDFS, suspect that the deletion was caused by a script running on spark-shel

New Contributor

Hi, i am still investigating the deletion of our data in HDFS under /user included /user/hive/warehouse.

Example:
previously we have our data in /user/hive/warehouse/dbname/tablename/date_pr=20181201/partition

after deleted, it shows:
/user/hive/warehouse/dbname/tablename/

So, we assume it deleted directly the folder and it's partition.
It missed until 91TiB data and it hasn't moved to .Trash.
We also checked from Cloudera Navigator and didnt find any clues but we only suspect that the possibilty is coming from our daily ingestion using script running on spark-shell.
Please help and advise me how can i recover the data and how can i exactly trace the issue from logs or cloudera navigator.

Regards,