Hi dear experts!
i'm curious how it possible to handle read IO size in my MR jobs.
for exampe, i have some file in HDFS, under the hood it's files in Linux filesystem /disk1/hadoop/.../.../blkXXX.
in ideal case this file size should be equal block size (128-256MB).
my question is how it possible to set IO size for reading operation?