Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Transferring data between CDH

Transferring data between CDH

Rising Star

Hi,

 

How do I transfer data between servers in 2 different data centers both running CDH4?

 

Thanks!

5 REPLIES 5

Re: Transferring data between CDH 4 and CDH 4

Master Guru
Could you clarify - what kind of data? HDFS files? (DistCp) Log/Event
files? (Flume) Or something else?

Re: Transferring data between CDH 4 and CDH 4

Rising Star

Both clusters are running cdh4.3.1.

 

I want to transfer data from one HDFS to another HDFS.

 

What are all possible ways?

 

Thank you!

Re: Transferring data between CDH 4 and CDH 4

Master Guru
There are a few ways:

Simple copies, run on dest.: hadoop fs -cp hdfs://source-nn/path
hdfs://dest-nn/path

Distributed (large) copies, run on dest.: hadoop
distcp hdfs://source-nn/path hdfs://dest-nn/path

Or if you have a Cloudera Enterprise license, use BDR:
http://www.cloudera.com/content/cloudera/en/resources/library/training/cloudera-enterprise-bdr-overv...

Re: Transferring data between CDH 4 and CDH 4

Rising Star

How do I transfer data between below clusters:

 

1. CDH4 to CDH5 

 

2. CDH5 to CDH4

 

3. CDH5 to CDH5

 

 

Highlighted

Re: Transferring data between CDH 4 and CDH 4

Master Guru
Could you clarify your goal here? The answer is still DistCp. Have you
looked at our documentation yet, and specifically have you looked at our
DistCp matrix page yet?
http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/cdh_admin_distcp_data_c...