Created 10-18-2015 05:43 AM
Does Falcon support replication and mirroring for HDFS and Hive between two Kerberized clusters (Prod and DR)?
Created 10-19-2015 01:42 AM
Set
<property> <name>oozie.launcher.mapreduce.job.hdfs-servers</name> <value>${nameNode1},${nameNode2}</value> </property>
As this documentation suggests: http://oozie.apache.org/docs/4.1.0/DG_DistCpActionExtension.html
Created 10-18-2015 02:56 PM
Can you tell if both these env using same KDC server? DistCP (which is what is used by Falcon for data replication) supports copying data between two kerberos environments so its a matter of making sure that the clusters can communicate correctly.
Created 10-19-2015 02:10 AM
Both clusters will be using either local KDC or AD Kerberos
Created 10-19-2015 01:42 AM
Set
<property> <name>oozie.launcher.mapreduce.job.hdfs-servers</name> <value>${nameNode1},${nameNode2}</value> </property>
As this documentation suggests: http://oozie.apache.org/docs/4.1.0/DG_DistCpActionExtension.html
Created 10-23-2015 03:50 PM
Falcon will automatically add this property to oozie jobs in secure kerberized clusters. User does not have to add the property separately.
When defining a cluster entity, please make sure to add the following cluster property.
<property name="dfs.namenode.kerberos.principal" value="nn/$my.internal@EXAMPLE.COM"/>
Created 10-23-2015 03:43 PM
Replication works between clusters using Kerberos since Dal (and some additional scenarios using HCat works from Dal-M10