Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Best practice to copy data and tables from one cluster to another

avatar
Expert Contributor

What is the best practice to copy data and hive managed and external tables from one cluster to another?

Any good tools for that to automate process or scripts to validate the data?

1 ACCEPTED SOLUTION
3 REPLIES 3

avatar

@PJ

i think we have two option for moving data from Cluster to Cluster

1) Distcp --this is basic and more conman method for moving the data

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_Sys_Admin_Guides/content/ref-4d13c401-c6...

2) Falcon --Falcon is another option and we will get nice GUI and we can scheduler the jobs as well. but backend falcon is also running on Distcp.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-.3.0/bk_data_governance/content/ch_hdp_data_gover...

I hope this will help you:)

avatar
Expert Contributor

Here's an updated Falcon doc link for HDP 2.5.3.