Created 01-12-2017 09:33 PM
Hello, couple of quick questions on Processor group with multiple input ports.
1) within the processor group, how do you distinguish between flowfiles that are coming from the various input ports 2) in data provenance page, is there a way to tell which flowfiles are from which input ports
Thanks.
Created on 01-12-2017 10:44 PM - edited 08-19-2019 02:58 AM
If you have FlowFiles arriving via multiple input ports and then passing through some common set of components downstream from them, there is no way to tell by looking at the FlowFile in a given queue which input port it originated from.
Input and output ports within a process group do not create provenance events either since they do not modify the FlowFiles in anyway. The only way an input port or output port would generate a Provenance event is if it was on the root canvas level since inout would generate a "create" event and output ports would create a "drop" event.
Provenance will show a lineage for a FlowFile which will show any processor which routed or modified the FlowFile in some way. So by looking at the details of the various events in the provenance lineage graph you can see where the FlowFile traversed through your Flow. However, as I stated not all processors create provenance events.
When you query provenance, you can access the lineage for any of the query results by clicking the show lineage icon:
A lineage graph for the specific FlowFile will then be created and displayed:
The red dot show the event the lineage was calculate from. Every circle is another event in this particular FlowFiles life. You can right click on any of the events to view the details of the event including which specific processor in your flow produced that event.
Thanks,
Matt
Created on 01-12-2017 10:44 PM - edited 08-19-2019 02:58 AM
If you have FlowFiles arriving via multiple input ports and then passing through some common set of components downstream from them, there is no way to tell by looking at the FlowFile in a given queue which input port it originated from.
Input and output ports within a process group do not create provenance events either since they do not modify the FlowFiles in anyway. The only way an input port or output port would generate a Provenance event is if it was on the root canvas level since inout would generate a "create" event and output ports would create a "drop" event.
Provenance will show a lineage for a FlowFile which will show any processor which routed or modified the FlowFile in some way. So by looking at the details of the various events in the provenance lineage graph you can see where the FlowFile traversed through your Flow. However, as I stated not all processors create provenance events.
When you query provenance, you can access the lineage for any of the query results by clicking the show lineage icon:
A lineage graph for the specific FlowFile will then be created and displayed:
The red dot show the event the lineage was calculate from. Every circle is another event in this particular FlowFiles life. You can right click on any of the events to view the details of the event including which specific processor in your flow produced that event.
Thanks,
Matt
Created 01-13-2017 03:37 AM
Thanks @Matt