Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

HDFS Data migration from one data center to other data center

avatar
New Contributor

what is the procedure to migrate HDFS DATA  from one data centre to another data centre?

3 REPLIES 3

avatar
Master Collaborator

Hi @Visvanath_JP,

 

The question could be more specific like what hadoop versions are two clusters, are both clusters secured, are they CDH/CDP or HDP. Do you only migrate data in HDFS layer or other layer, for example hive / hbase / kudu.

 

The most common way is using distcp to migrate data between hdfs clusters. 

https://docs.cloudera.com/cdp-private-cloud-base/7.1.7/scaling-namespaces/topics/hdfs-distcp-to-copy...

If you are using CDH/CDP, BDR job is another choice (distcp integrated)

https://docs.cloudera.com/cdp-private-cloud-base/7.1.7/replication-manager/topics/rm-dc-hdfs-replica...

Distcp guide:

https://hadoop.apache.org/docs/current/hadoop-distcp/DistCp.html#:~:text=DistCp%20(distributed%20cop....

 

Regards,

Will

If the answer helps, please accept as solution and click thumbs up.

avatar
New Contributor

CDH Version 5.15.1

 

only we want to migrate HDFS data.

avatar
Master Collaborator

Then above solutions meet your needs.