If the jobs submitted using oozie and all DNs and NN has replication factor, i checked hdfs-site.xml and mapred-site.xml at all the cluster nodes and all has the value 2, which service i should restart after the change?
yes, I'm looking at /etc/hadoop/conf.
I already tired and restarted the oozie with no success.
I'm using hadoop version 2.0.0-cdh4.3.0, tried to check under /var/run/mapred dirs but find only pid file.
Under /var/run this is what i see:
Changed at all the cluster nodes and restarted all services at the cluster after.
It didn't solve the issue.
Looking at one of the running jobs conf and see the following with replication factor 3:
Any other ideas?
The more intersting in the issue that it's happens only for the output of specific jobs and notf or all the HDFS.
Is there any way to set that the new written files to specific dir to be with specific replication factor?
Digging down in the cluster, i found one of the application that runs outside of the hadoop cluster has clients that make hdfs dfs -put to the hadoop cluster, these clients weren't have hdfs-site.xml and it got the default replication factor for the cluster, what i did? tested the hdfs dfs -put from a cleint server in my cluster and the client out side the cluster and notice the client outside the cluster put files with replication factor 3, to solve the issue i added hdfs-site.xml to each of the clients outside the cluster and override the default replication factor at the file.