Created 02-06-2017 10:48 PM
What is the best practice to copy data and hive managed and external tables from one cluster to another?
Any good tools for that to automate process or scripts to validate the data?
Created 02-07-2017 06:22 AM
Created 02-06-2017 11:33 PM
i think we have two option for moving data from Cluster to Cluster
1) Distcp --this is basic and more conman method for moving the data
2) Falcon --Falcon is another option and we will get nice GUI and we can scheduler the jobs as well. but backend falcon is also running on Distcp.
I hope this will help you:)
Created 02-07-2017 06:34 PM
Here's an updated Falcon doc link for HDP 2.5.3.
Created 02-07-2017 06:22 AM
Please have a look at below threads
https://community.hortonworks.com/questions/4496/how-to-migrate-hive-data-over-to-new-cluster.html
Hope this helps!