I am interested to know on what basis we decide cluster configuration like :
Number of nodes we need
RAM on each node
how many master and slaves we need
is there some formula to calculate above specification , suppose if i have to build cluster for a huge dataset and other one for small data set how I will decide what configurations I have to consider before building cluster .