Support Questions

Find answers, ask questions, and share your expertise

Couple of questions on processor group with multiple input ports

avatar
Expert Contributor

Hello, couple of quick questions on Processor group with multiple input ports.

1) within the processor group, how do you distinguish between flowfiles that are coming from the various input ports 2) in data provenance page, is there a way to tell which flowfiles are from which input ports

Thanks.

1 ACCEPTED SOLUTION

avatar
Master Mentor
@Raj B

If you have FlowFiles arriving via multiple input ports and then passing through some common set of components downstream from them, there is no way to tell by looking at the FlowFile in a given queue which input port it originated from.

Input and output ports within a process group do not create provenance events either since they do not modify the FlowFiles in anyway. The only way an input port or output port would generate a Provenance event is if it was on the root canvas level since inout would generate a "create" event and output ports would create a "drop" event.

Provenance will show a lineage for a FlowFile which will show any processor which routed or modified the FlowFile in some way. So by looking at the details of the various events in the provenance lineage graph you can see where the FlowFile traversed through your Flow. However, as I stated not all processors create provenance events.

When you query provenance, you can access the lineage for any of the query results by clicking the show lineage icon:

11380-screen-shot-2017-01-12-at-53917-pm.png

A lineage graph for the specific FlowFile will then be created and displayed:

11401-screen-shot-2017-01-12-at-54143-pm.png

The red dot show the event the lineage was calculate from. Every circle is another event in this particular FlowFiles life. You can right click on any of the events to view the details of the event including which specific processor in your flow produced that event.

Thanks,

Matt

View solution in original post

2 REPLIES 2

avatar
Master Mentor
@Raj B

If you have FlowFiles arriving via multiple input ports and then passing through some common set of components downstream from them, there is no way to tell by looking at the FlowFile in a given queue which input port it originated from.

Input and output ports within a process group do not create provenance events either since they do not modify the FlowFiles in anyway. The only way an input port or output port would generate a Provenance event is if it was on the root canvas level since inout would generate a "create" event and output ports would create a "drop" event.

Provenance will show a lineage for a FlowFile which will show any processor which routed or modified the FlowFile in some way. So by looking at the details of the various events in the provenance lineage graph you can see where the FlowFile traversed through your Flow. However, as I stated not all processors create provenance events.

When you query provenance, you can access the lineage for any of the query results by clicking the show lineage icon:

11380-screen-shot-2017-01-12-at-53917-pm.png

A lineage graph for the specific FlowFile will then be created and displayed:

11401-screen-shot-2017-01-12-at-54143-pm.png

The red dot show the event the lineage was calculate from. Every circle is another event in this particular FlowFiles life. You can right click on any of the events to view the details of the event including which specific processor in your flow produced that event.

Thanks,

Matt

avatar
Expert Contributor

Thanks @Matt