Can some one help me in what are the best practices and ways to do the Data replication in Prod cluster.
Use falcon to mirror HDFS data.
Falcon can be used to mirror HIVE metadata.
Hope that helps!
Thanks for your answer. Do you know any best practices for using Distcp. I am looking for best practices other than Falcon.
With distcp you will not be able to replicate Hive metadata.Only HDFS data can be replicated!.
If you have any further question then feel free to ask.
If you like my answer then please select as best answer!
Hive Metastore replication
Once all the metastores are in HBase
Timothy, Thank you for your response. But I am looking for best ways to replicate HDFS also using Distcp.