New Contributor
Posts: 1
Registered: ‎12-12-2018

How to recover data in HDFS, suspect that the deletion was caused by a script running on spark-shel

[ Edited ]

Hi, i am still investigating the deletion of our data in HDFS under /user included /user/hive/warehouse.

previously we have our data in /user/hive/warehouse/dbname/tablename/date_pr=20181201/partition

after deleted, it shows:

So, we assume it deleted directly the folder and it's partition.
It missed until 91TiB data and it hasn't moved to .Trash.
We also checked from Cloudera Navigator and didnt find any clues but we only suspect that the possibilty is coming from our daily ingestion using script running on spark-shell.
Please help and advise me how can i recover the data and how can i exactly trace the issue from logs or cloudera navigator.


New solutions