Can anyone suggest if we will implement hadoop in two different data center with same network Then it will impact the performance or not?
We are distributing the master nodes and data nodes in two different data center to overcome the down time.
However both the data center in same network, so it will impact the performance or not?
The Cloudera Distribution of Hadoop (CDH) can be deployed across data centers. Please take a look at this section of the Cloudera Bare Metal Reference Architecture:
Please pay close attention to the networking section. I generally do not recommend this topology because of the latency challenges that are introduced going across data centers.