Support Questions
Find answers, ask questions, and share your expertise

CRC 32 Check for Same file in Hadoop

Highlighted

CRC 32 Check for Same file in Hadoop

I have copied a file from local to hdfs. Then I have copied this file into another folder in hdfs. Just to verify whether the file is copied properly I have used 'hadoop fs -checksum hdfs_filepath' for the same file in different location. But it results two different check sum though the data in the file in still same. I know that we can use -checksum even during the file copy. But just wanted to understand If wanted to see the md5 check sum of a file placed in different location, will it have different check sum values?