Created 10-03-2015 08:13 AM
When using the Falcon "Mirror Recipe", what happens if an instance is still running when the next is scheduled to start?
Created 10-27-2015 08:23 PM
If you are asking about using recipes for mirroring, if you look at hdfs-replication-template.xml or hive-replication-template.xml has below properties set to ensure only one instance runs.
<parallel>1</parallel> <!-- Dir replication needs to run only once to catch up --> <order>LAST_ONLY</order>
--------------------------------------------------------------------------------------------------------------
In general, in Falcon this can be controlled using <parallel> and <order> properties in the entity xml. Parallel decides the concurrent replication instances that can run at any given time and order decides the execution order for replication instances like FIFO, LIFO and LAST_ONLY. For replication job needs to run only once to catch up. Setting below configs in entity xml will ensure this:
<parallel>1</parallel> <order>LAST_ONLY</order>
Parallel maps to concurrency and order maps to execution in Oozie. Please refer Oozie doc for more details.
Created 10-23-2015 04:48 PM
The Falcon "Mirror" Recipe has "parallelism" set to 1 since replication needs to run only once to catch up. The next instance will wait for the current instance to complete.
Created 10-28-2015 11:02 AM
Good answer. Somwa's was more complete with a description of how it works.
Created 10-27-2015 08:23 PM
If you are asking about using recipes for mirroring, if you look at hdfs-replication-template.xml or hive-replication-template.xml has below properties set to ensure only one instance runs.
<parallel>1</parallel> <!-- Dir replication needs to run only once to catch up --> <order>LAST_ONLY</order>
--------------------------------------------------------------------------------------------------------------
In general, in Falcon this can be controlled using <parallel> and <order> properties in the entity xml. Parallel decides the concurrent replication instances that can run at any given time and order decides the execution order for replication instances like FIFO, LIFO and LAST_ONLY. For replication job needs to run only once to catch up. Setting below configs in entity xml will ensure this:
<parallel>1</parallel> <order>LAST_ONLY</order>
Parallel maps to concurrency and order maps to execution in Oozie. Please refer Oozie doc for more details.
Created 10-27-2015 10:25 PM
@Sowmya Ramesh - Good answer! Since this was specific to "recipes", can you move the recipe part of the question to the top of your answer and keep the further detail on "parallel" and "order" after that. Then I can accept.
Created 10-27-2015 11:21 PM
@Sean Roberts: Done!