<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Issue with Nifi Merge Content : Files stay in the queue infinitely ! in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151774#M114253</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/3486/cstanca.html" nodeid="3486"&gt;@Constantin Stanca&lt;/A&gt;&lt;P&gt;@Mohammed El Moumni&lt;/P&gt;&lt;P&gt;Queue thresholds are per node and will cause a queue to no longer accept additional FlowFiles, It will not prevent downstream processor from processing FlowFiles that are already in that queue.&lt;/P&gt;&lt;P&gt;
Had he received two 700MB CSV files on one node, then the 1GB threshold would have been exceeded thus preventing any additional FlowFiles from entering that queue (including the corresponding 70 byte header files). In that case you would be stuck, since merge would not have the files even on a single node needed to merge a bin.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Matt&lt;/P&gt;</description>
    <pubDate>Wed, 15 Mar 2017 01:10:15 GMT</pubDate>
    <dc:creator>MattWho</dc:creator>
    <dc:date>2017-03-15T01:10:15Z</dc:date>
    <item>
      <title>Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151766#M114245</link>
      <description>&lt;P&gt;I have a flow where I am using the Merge Content Processor. I noticed lately that some flowfiles stay infinitely in the queue just before the Merge Content. I can't figure out the issue so I am asking for your help !&lt;/P&gt;&lt;P&gt;This is the part of the flow that I am talking about :&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13492-1.png" style="width: 520px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21146i1F99754DCEE2BFA4/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13492-1.png" alt="13492-1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The configuration of the merge content processor is here (merging in the attribute called "cle" and its value is the same for the 2 flowfiles in the queue ! But still they don't merge ) :&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13493-2.png" style="width: 1078px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21147i6892F43970890376/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13493-2.png" alt="13493-2.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Finally here is the content of the queue :&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13494-3.png" style="width: 1594px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21148i4E9305742CFEDD47/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13494-3.png" alt="13494-3.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Is this due to the first flowfile size (710 MB) ? is there a maximum size for a bin ? If yes why isn't it merged after reaching that size ?&lt;/P&gt;&lt;P&gt;Thank you for your help ! &lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:51:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151766#M114245</guid>
      <dc:creator>melmoumni_exter</dc:creator>
      <dc:date>2019-08-18T12:51:30Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151767#M114246</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/16168/melmoumniexterne.html" rel="nofollow noopener noreferrer" target="_blank"&gt;Mohammed El Moumni&lt;/A&gt;&lt;/P&gt;&lt;P&gt;A queue has a limit in size (1 GB) or 10,000 files by default.&lt;/P&gt;&lt;P&gt;To change the settings go to setting tab on "Configure" of that queue. See screenshot attached.&lt;/P&gt;&lt;P&gt;If it helps, please vote/accept response.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13505-screen-shot-2017-03-10-at-100448-am.png" style="width: 670px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21145iFC50C11A334CED6E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13505-screen-shot-2017-03-10-at-100448-am.png" alt="13505-screen-shot-2017-03-10-at-100448-am.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;It is also possible that downstream you may have another queue or processor stuck due to this limit set by default. You have to increase there and let the processor start processing to reduce the amount in the queue before your queue report may start to drain. Imagine all this flow like a river with all kind of streams and obstructions...&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:51:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151767#M114246</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2019-08-18T12:51:10Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151768#M114247</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16168/melmoumniexterne.html" nodeid="16168"&gt;@Mohammed El Moumni&lt;/A&gt;&lt;/P&gt;&lt;P&gt;If you take a look at the details of the flowfiles in the input queue for MergeContent, do you see the correlation attribute present on both flowfiles?  Is it possible that, elsewhere in the flow, a flowfile with a correlation ID the same as one of the two flowfiles in the incoming queue was sent to a failure relationship and had been dropped from the flow?  In the past, I have done a bit of processing of files from one of the Split* processors, and encountered errors processing one of the fragments.  Due to the way I had designed the flow, the fragment with the error was routed to a failure relationship to another processor that terminated the processing of that flowfile, so not all the fragments from the split were sent to MergeContent.  This caused all the other fragments to sit in the incoming queue of MergeContent indefinitely.&lt;/P&gt;</description>
      <pubDate>Sat, 11 Mar 2017 04:36:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151768#M114247</guid>
      <dc:creator>jts</dc:creator>
      <dc:date>2017-03-11T04:36:29Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151769#M114248</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/12547/jstorck.html" nodeid="12547"&gt;@Jeff Storck&lt;/A&gt;, the correlation attribute is present on both flowfiles and its value is the same. Also, I am sure that for a correlation attribute value, only two flowfiles will have that value. So with my settings : Minimum number of entries = 2, maximum number of entries = 2, I am sure that only those two flowfiles will merge. Still, in my case the two flowfiles in the screenshot stay infinitely in the queue ... I am pretty sure it's a size problem, but can't figure it out.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2017 14:42:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151769#M114248</guid>
      <dc:creator>melmoumni_exter</dc:creator>
      <dc:date>2017-03-13T14:42:31Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151770#M114249</link>
      <description>&lt;P&gt;Hi @&lt;A href="https://community.hortonworks.com/users/3486/cstanca.html" rel="nofollow noopener noreferrer" target="_blank"&gt;Constantin Stanca&lt;/A&gt;, I changed the back pressure data size to 2GB but the two flowfiles still don't merge ...&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13554-4.png" style="width: 489px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21144i7C64EF74E3FB11F1/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13554-4.png" alt="13554-4.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:51:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151770#M114249</guid>
      <dc:creator>melmoumni_exter</dc:creator>
      <dc:date>2019-08-18T12:51:02Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151771#M114250</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16168/melmoumniexterne.html" nodeid="16168"&gt;@Mohammed El Moumni&lt;/A&gt; Are other, smaller files merging?  I notice in both of your screenshots that the MergeContent processor is stopped, which will prevent files from being merged.  Was the processor stopped just to take the screenshots?&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2017 22:06:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151771#M114250</guid>
      <dc:creator>jts</dc:creator>
      <dc:date>2017-03-13T22:06:38Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151772#M114251</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/12547/jstorck.html" nodeid="12547"&gt;@Jeff Storck&lt;/A&gt; yes the processor was stopped just to take the screenshots (I left it for running for 1 day and the two files didn't merge). And yes smaller files merge (15MB files merge for example).&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2017 22:19:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151772#M114251</guid>
      <dc:creator>melmoumni_exter</dc:creator>
      <dc:date>2017-03-13T22:19:44Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151773#M114252</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16168/melmoumniexterne.html" nodeid="16168" target="_blank"&gt;@Mohammed El Moumni&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Each Node in a NiFi cluster runs its own copy of the dataflow and works on its own set of FlowFiles.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13628-screen-shot-2017-03-14-at-14329-pm.png" style="width: 938px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21143iDBBEC88DF2700085/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13628-screen-shot-2017-03-14-at-14329-pm.png" alt="13628-screen-shot-2017-03-14-at-14329-pm.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Looking at the screenshot you have above of your queue list, you can see that the two FlowFiles are not on the same node.  So each node is running a MergeContent processor and each node is waiting for another FlowFile to complete their bins. You will need to look back earlier in your dataflow to see how your data is being ingested by your nodes to make sure that the matching sets of files end up on the same node for merging.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Matt&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:50:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151773#M114252</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2019-08-18T12:50:55Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151774#M114253</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/3486/cstanca.html" nodeid="3486"&gt;@Constantin Stanca&lt;/A&gt;&lt;P&gt;@Mohammed El Moumni&lt;/P&gt;&lt;P&gt;Queue thresholds are per node and will cause a queue to no longer accept additional FlowFiles, It will not prevent downstream processor from processing FlowFiles that are already in that queue.&lt;/P&gt;&lt;P&gt;
Had he received two 700MB CSV files on one node, then the 1GB threshold would have been exceeded thus preventing any additional FlowFiles from entering that queue (including the corresponding 70 byte header files). In that case you would be stuck, since merge would not have the files even on a single node needed to merge a bin.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Matt&lt;/P&gt;</description>
      <pubDate>Wed, 15 Mar 2017 01:10:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151774#M114253</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2017-03-15T01:10:15Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151775#M114254</link>
      <description>&lt;P&gt;good eyes &lt;A rel="user" href="https://community.cloudera.com/users/525/mclark.html" nodeid="525"&gt;@Matt Clarke&lt;/A&gt; &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Mar 2017 01:17:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151775#M114254</guid>
      <dc:creator>Raj_B</dc:creator>
      <dc:date>2017-03-15T01:17:01Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151776#M114255</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10100/rbolla.html" nodeid="10100"&gt;@Raj B&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;Thank you... Sometimes the most important piece of information is in the fine details.  Other give away that it was clustered was that both FlowFiles in that queue had same position "1".  Two FlowFiles in the same queue on the same node cannot occupy the same position.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Mar 2017 01:25:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151776#M114255</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2017-03-15T01:25:39Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151777#M114256</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/88199/issue-with-nifi-merge-content-files-stay-in-the-qu.html#" rel="nofollow noopener noreferrer" target="_blank"&gt;@Matt Clarke&lt;/A&gt; This is an excellent answer, thank you very much. I am indeed using a cluster of nifi nodes, and my dataflow starts with a list/fetch as described by the answer of &lt;A href="https://community.hortonworks.com/questions/88199/issue-with-nifi-merge-content-files-stay-in-the-qu.html#" rel="nofollow noopener noreferrer" target="_blank"&gt;@Pierre Villard&lt;/A&gt; on this question : &lt;A href="https://community.hortonworks.com/questions/52112/nifi-load-distribution-in-getfile-processor.html" rel="nofollow noopener noreferrer" target="_blank"&gt;https://community.hortonworks.com/questions/52112/nifi-load-distribution-in-getfile-processor.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;So the beginning of my dataflow looks like this :&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13647-5.png" style="width: 835px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21142i1687C73C3A76FF90/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13647-5.png" alt="13647-5.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I am using the list/fetch pattern to take advantage of the cluster and improve the performance of the ingestion.&lt;/P&gt;&lt;P&gt;This leads me to ask the following question which is probably beyond the scope of the initial question and should be asked in the different post, but I am putting it here so that everyone in the same situation profits from your beautiful answers : does this mean that I can't use the merge content processor in these kind of dataflows (dataflows thar run on all nodes), as I don't have a way to control the node that will ingest a pair of matching flowfiles (flowfiles that have the same "cle" attribute) ? or could you think of a trick to handle this ?&lt;/P&gt;&lt;P&gt;Thanks again for your help !&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:50:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151777#M114256</guid>
      <dc:creator>melmoumni_exter</dc:creator>
      <dc:date>2019-08-18T12:50:47Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151778#M114257</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16168/melmoumniexterne.html" nodeid="16168" target="_blank"&gt;@Mohammed El Moumni&lt;/A&gt; 
Here is one possible dataflow design that can be used to make sure both FlowFiles in a pair end up on the same node after being distributed via the Remote Process Group (RPG):&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13715-screen-shot-2017-03-17-at-105928-am.png" style="width: 1158px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21141iFF9617CFD9A49F59/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13715-screen-shot-2017-03-17-at-105928-am.png" alt="13715-screen-shot-2017-03-17-at-105928-am.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;While it requires adding 5 additional processor to you flow, overhead is relatively light since you are dealing with very small FlowFiles all the way up to the point of the FetchFile processor.  You are still only fetching the ~700 MB content after cluster distribution.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Matt&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:50:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151778#M114257</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2019-08-18T12:50:39Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Nifi Merge Content : Files stay in the queue infinitely !</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151779#M114258</link>
      <description>&lt;P&gt;Great answer like usual ! Just tested your suggestion and it works perfectly ! Thank you so much !&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 23:46:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issue-with-Nifi-Merge-Content-Files-stay-in-the-queue/m-p/151779#M114258</guid>
      <dc:creator>melmoumni_exter</dc:creator>
      <dc:date>2017-03-17T23:46:17Z</dc:date>
    </item>
  </channel>
</rss>

