Hi All,
I'm trying to understand How hadoop 3 store data on HDFS by erasure encoding.
As per erasure encoding, currently six built-in policies are supported:
RS-3-2-1024k,RS-6-3-1024k, RS-10-4-1024k, RS-LEGACY-6-3-1024k, XOR-2-1-1024k and REPLICATION.
Replication is general term which was also using in hadoop2(replicate the data 3x).
How Reed Solomony RS-3-2-1024k(3 data blocks, 2 parity blocks and 1024k cell size) or RS-6-3-1-24k(6 data blocks, 3 parity blocks and 1024k cell size) store the data?
Suppose we are having 3 data nodes, 2 NNs, 1 Edge node. We have to store the 1GB file(abc.txt) and Block size is 128MB. How RS-3-2-1024k, RS-6-3-1024k works?
What is meaning of 6 data blocks, 1024K?
Is there any specific prerequisites for number of DATANODE's required, according to policy?
Will appreciable in advance to help me to understand the hadoop 3 concept.
Regards,
Vinay K