According with the best practices of Cloudera to implement CDH on AWS, I have some questions.
We have a cluster with replica factor of 3. That means if one rack is down, my data would be available in another rack.
We have configured rack awareness based on the availability zone where this instance is located. For example.
instance a (east-1a) - > rack /a
instance b (east-1b) - > rack /b
instance c (east-1c) - > rack /c
In that case, we can't use placement groups, as placement group should have all the instances under the same availability zone.
So my question is, What is the best practice for this? If we use placement groups we can't use rack awareness.