<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Why only flowfile repository disk is getting full and other repos are not? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/387780#M246426</link>
    <description>&lt;P&gt;Hello experts,&lt;/P&gt;&lt;P&gt;I am facing an issue in one of the Nifi server where we have multiple consume eventhub flows.&lt;/P&gt;&lt;P&gt;The flow file repository disc is getting full but content and provenance repos are not.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Have attached the screen shot of all repos usage and content of flowfile repo.&lt;/P&gt;&lt;P&gt;journals folder is occupying very large amount of data.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="hegdemahendra_0-1715418211654.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/40598i8C8C215BC636858E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="hegdemahendra_0-1715418211654.png" alt="hegdemahendra_0-1715418211654.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="hegdemahendra_1-1715418702964.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/40599iB65DE95771E5A3D0/image-size/medium?v=v2&amp;amp;px=400" role="button" title="hegdemahendra_1-1715418702964.png" alt="hegdemahendra_1-1715418702964.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;nifi.properties (related to flofile repo):&lt;BR /&gt;nifi.flowfile.repository.always.sync=false&lt;BR /&gt;nifi.flowfile.repository.checkpoint.interval=2 mins&lt;BR /&gt;nifi.flowfile.repository.directory=/flowfile&lt;BR /&gt;nifi.flowfile.repository.implementation=org.apache.nifi.controller.repository.WriteAheadFlowFileRepository&lt;BR /&gt;nifi.flowfile.repository.partitions=256&lt;BR /&gt;nifi.flowfile.repository.retain.orphaned.flowfiles=true&lt;BR /&gt;nifi.flowfile.repository.wal.implementation=org.apache.nifi.wali.SequentialAccessWriteAheadLog&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone help me understand what is the issue? how to resolve this?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;/P&gt;&lt;P&gt;Mahendra&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 11 May 2024 09:13:56 GMT</pubDate>
    <dc:creator>hegdemahendra</dc:creator>
    <dc:date>2024-05-11T09:13:56Z</dc:date>
    <item>
      <title>Why only flowfile repository disk is getting full and other repos are not?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/387780#M246426</link>
      <description>&lt;P&gt;Hello experts,&lt;/P&gt;&lt;P&gt;I am facing an issue in one of the Nifi server where we have multiple consume eventhub flows.&lt;/P&gt;&lt;P&gt;The flow file repository disc is getting full but content and provenance repos are not.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Have attached the screen shot of all repos usage and content of flowfile repo.&lt;/P&gt;&lt;P&gt;journals folder is occupying very large amount of data.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="hegdemahendra_0-1715418211654.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/40598i8C8C215BC636858E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="hegdemahendra_0-1715418211654.png" alt="hegdemahendra_0-1715418211654.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="hegdemahendra_1-1715418702964.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/40599iB65DE95771E5A3D0/image-size/medium?v=v2&amp;amp;px=400" role="button" title="hegdemahendra_1-1715418702964.png" alt="hegdemahendra_1-1715418702964.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;nifi.properties (related to flofile repo):&lt;BR /&gt;nifi.flowfile.repository.always.sync=false&lt;BR /&gt;nifi.flowfile.repository.checkpoint.interval=2 mins&lt;BR /&gt;nifi.flowfile.repository.directory=/flowfile&lt;BR /&gt;nifi.flowfile.repository.implementation=org.apache.nifi.controller.repository.WriteAheadFlowFileRepository&lt;BR /&gt;nifi.flowfile.repository.partitions=256&lt;BR /&gt;nifi.flowfile.repository.retain.orphaned.flowfiles=true&lt;BR /&gt;nifi.flowfile.repository.wal.implementation=org.apache.nifi.wali.SequentialAccessWriteAheadLog&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone help me understand what is the issue? how to resolve this?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;/P&gt;&lt;P&gt;Mahendra&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 11 May 2024 09:13:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/387780#M246426</guid>
      <dc:creator>hegdemahendra</dc:creator>
      <dc:date>2024-05-11T09:13:56Z</dc:date>
    </item>
    <item>
      <title>Re: Why only flowfile repository disk is getting full and other repos are not?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/387845#M246458</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/35454"&gt;@MattWho&lt;/a&gt;&amp;nbsp;- would appreciate if you have any comment on this issue. Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 14 May 2024 08:13:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/387845#M246458</guid>
      <dc:creator>hegdemahendra</dc:creator>
      <dc:date>2024-05-14T08:13:15Z</dc:date>
    </item>
    <item>
      <title>Re: Why only flowfile repository disk is getting full and other repos are not?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/388130#M246544</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/37332"&gt;@hegdemahendra&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Always very helpful if you include the exact version of Apache NiFI, Cloudera HDF, or Cloudera CFM being used.&lt;BR /&gt;&lt;BR /&gt;My guess here would be one or both of the following:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;You have multiple FlowFiles all pointing at the same content claims queued in connections within your dataflow(s) on the canvas.&amp;nbsp; As long as a FlowFile exists on the canvas it will exist in flowfile_repository.&amp;nbsp; &amp;nbsp;Users should avoid leaving FlowFiles queued in connection on NiFi. Some users tend to allow FlowFile to accumulate at stopped processor components rather then auto-terminate them.&amp;nbsp; Even if a FlowFile does not have any content its FlowFile attributes/metadata still consume disk space.&lt;/LI&gt;&lt;LI&gt;You are extracting content from your FlowFiles into FlowFile attributes resulting in large FlowFile attribute/metadata being stored in the flowfile_repository.&amp;nbsp; &amp;nbsp;Dataflow designers should avoid extracting large amounts flowfile content in to the FlowFile's attributes.&amp;nbsp; Instead try to build dataflows and utilize components that read content from the FlowFile's content instead of from FlowFile attributes.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Please help our community thrive. If you found&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;any&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;of the suggestions/solutions provided helped you with solving your issue or answering your question, please take a moment to login and click "&lt;SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;Accept as Solution&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/EM&gt;" on&amp;nbsp;&lt;STRONG&gt;one or more&lt;/STRONG&gt;&amp;nbsp;of them that helped.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you,&lt;BR /&gt;Matt&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 20 May 2024 21:01:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-only-flowfile-repository-disk-is-getting-full-and-other/m-p/388130#M246544</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2024-05-20T21:01:09Z</dc:date>
    </item>
  </channel>
</rss>

