I am planning to setup a Hadoop Cluster (A) with Cluster replication (B). so that once data is reached to Cluster A it will replicated to Cluster D. I am having one question if i delete data from Cluster A on the basis of Time like one month old data is it also removed from Cluster B. if yes how i can avoid this.
What i want to achieve.
1. Once data is reached to Cluster A it will automatically replicated to Cluster B.
2. After one year old data from Cluster A remove automatically but not from Cluster B.
I hope the below links will help you - Cloudera BDR