I know we have best practice for balance of # of cores and # of disks.
https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_command-line-installation/content/determ...
# of containers = min (2*CORES, 1.8*DISKS, (Total available RAM) / MIN_CONTAINER_SIZE)
I believe, it means that slave nodes in "2*CORES = 1.8*DISKS" are best balanced one in point of CPUs and Disks.
Does anyone know the "CORES" means whether number of "physical" cores or number of "virtual" cores (i.e. Hyper-Threading Technology)?
If it means "physical" cores, number of physical CPU cores is nice to be 12 with 12 disks.
If it means "virtual" cores by for example Intel HT, 6 physical cores would be enough with 12 disks (best balanced node).
Also, I'm wandering, we should enable Hyper-Threading or not to get better "throughput".
Any reply, comment and suggestion will help me. Thanks!