I am using cloudbreak for Hortonworks data platfom on Azure.
Initially i have created cluster with 1 master node and 2 worker nodes. Now i am adding 2 more worker nodes. So, after cluster resize to 4 worker nodes, does data in hdfs gets distributed among 4 worker nodes ? also does data processing performance impacts after adding 2 more nodes ?
I do not have much knowledge about how Auto-scalling works.
Another thing, what is the performance difference between data processing on worker nodes and compute nodes ?