Support Questions

Find answers, ask questions, and share your expertise

Who agreed with this solution

avatar
Master Guru
It was either written with less repicas by the client, or someone changed it after it was written. For example Solr Tlogs I believe are written with a replica of 1. 
Each DFSClient has the ability to control the number of replicas.  As said, Solr uses 1 for Tlogs, MR uses (or used to use) 10 for job files for better chance of data locality.  It’s a decision made by whoever creates the client.  So it is expected that any file can have a different replication factor, within the limits of dfs.namenode.replication.min and dfs.replication.max which is enforced by the NameNode.

Cheers!
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

View solution in original post

Who agreed with this solution