Can anyone provide me a link/video on how to tune and configure the hadoop like memory settings for MR, Yarn, hive, TEZ and advanced configurations (configuring high availability, zookeepers quorums, spark etc) understanding the properties based on the cluster size (computing resources, memory), understanding the architecture and other applications running along side hadoop. How to configure properties for different hadoop components. I would really appreciate if someone can guide me on understanding the advanced properties and adjusting its values according to the cluster size.
The short answer is that tuning a hadoop cluster is not as simple as turning a couple of knobs. The reality is there are many components within a cluster, often each of them need to be tuned independently. Additionally, tuning each component will vary widely on the usage pattern of the cluster and the respective component.
Here are a couple of articles that you may find helpful:
@Praneender Vuppala - In addition to above answer, I'm pasting few of my blogs which will help you.
Hope this information helps!