- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Best practice to copy data and tables from one cluster to another
- Labels:
-
Apache Hadoop
Created ‎02-06-2017 10:48 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
What is the best practice to copy data and hive managed and external tables from one cluster to another?
Any good tools for that to automate process or scripts to validate the data?
Created ‎02-07-2017 06:22 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created ‎02-06-2017 11:33 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
i think we have two option for moving data from Cluster to Cluster
1) Distcp --this is basic and more conman method for moving the data
2) Falcon --Falcon is another option and we will get nice GUI and we can scheduler the jobs as well. but backend falcon is also running on Distcp.
I hope this will help you:)
Created ‎02-07-2017 06:34 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Here's an updated Falcon doc link for HDP 2.5.3.
Created ‎02-07-2017 06:22 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Please have a look at below threads
https://community.hortonworks.com/questions/4496/how-to-migrate-hive-data-over-to-new-cluster.html
Hope this helps!
