09-25-2017 06:25 AM
To improve the performance of a dataset access I would like to replicate the blocks of the file to all datanodes. It's a dimension dataset. One way would be setting the replication factor to a number higher than the number of datanodes, but I would like to know if there is a better way to do this.
Does anyone already did something like this?
09-25-2017 06:33 AM