Hadoop distribute data equally across all the data nodes. Is there any way to change the way how it distribute the data. I want to give more data to that data node which has more processing power/memory.
I would probably stop the slower datanodes first and then run balancer, if you want to get sophisticated, you can try writing your own balancer job. Great question!