Support Questions

Find answers, ask questions, and share your expertise
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Data replication in different cluster

New Contributor


We are planning to create a new cluster intended for different set of users. Currently we have data coming into our existing cluster in multiple ways. Mainly through Kafka and other batch jobs pushing data in regular intervals.

Also, data can be modified after replicated to new cluster. In this case, we want copy only delta instead of copy everything.

What would be an ideal tool for this job? Thanks in Advance!


use DistCp (Distributed Copy). this is the distcp command.


you can use Apache Nifi if you want a GUI driven solution

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.