Support Questions
Find answers, ask questions, and share your expertise

SnapshotDiff not showing correct report

SnapshotDiff not showing correct report

New Contributor

We are doing incremental backup of hive tables using distcp -diff on the basis of two snapshots.

But sometimes this distcp is failing with error that particular file is not present in snapshots source directory.

 

When we do Snapshodiff between two snapshots, it shows some of the files are added

but when we do 'hdfs dfs -ls' for these files on snapshot folders. it shows file not present.

 

We suspect

  • its issue with Snapshotdiff.
  • or related hive compaction, as we get error only for hive delta/base files.image2021-4-26_17-9-14.pngimage2021-4-26_17-7-53.png