Hi, I am looking for some recommendations for a typical CDH cluster filesystem layout. Can someone provide info on how the filesystems should be setup in the Util, Edge, Master and Data nodes for the various components like OS, logs, data etc. Thanks.
Assuming a very vanilla installation with average workload say 2 util/edge nodes, 3 master nodes, 10-20 data nodes. Data volume say 100-200TB, batch/realtime:50:50, 200-500users, yes HA needed. I am looking if someone can send me a df -h on their filesystem which was based on Cloudera recomendation. How the util,edge,master,datanode filesystems look like for a well-designed CDH cluster.