Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Best practice to copy data and tables from one cluster to another

avatar
Expert Contributor

What is the best practice to copy data and hive managed and external tables from one cluster to another?

Any good tools for that to automate process or scripts to validate the data?

1 ACCEPTED SOLUTION
3 REPLIES 3

avatar

@PJ

i think we have two option for moving data from Cluster to Cluster

1) Distcp --this is basic and more conman method for moving the data

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_Sys_Admin_Guides/content/ref-4d13c401-c6...

2) Falcon --Falcon is another option and we will get nice GUI and we can scheduler the jobs as well. but backend falcon is also running on Distcp.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-.3.0/bk_data_governance/content/ch_hdp_data_gover...

I hope this will help you:)

avatar
Expert Contributor

Here's an updated Falcon doc link for HDP 2.5.3.