Support Questions
Find answers, ask questions, and share your expertise
Alert: Please see the Cloudera blog for information on the Cloudera Response to CVE-2021-4428

SnapshotDiff not showing correct report


We are doing incremental backup of hive tables using distcp -diff on the basis of two snapshots.

But sometimes this distcp is failing with error that particular file is not present in snapshots source directory.


When we do Snapshodiff between two snapshots, it shows some of the files are added

but when we do 'hdfs dfs -ls' for these files on snapshot folders. it shows file not present.


We suspect

  • its issue with Snapshotdiff.
  • or related hive compaction, as we get error only for hive delta/base files.image2021-4-26_17-9-14.pngimage2021-4-26_17-7-53.png