<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question How to backup, store and read Nifi Provence ( lucene ) *.gz files in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413498#M254092</link>
    <description>&lt;P&gt;We were looking at storing the provenance *.gz ( lucene ) files so we could have a record of all activity through Nifi, as our IT security people are fussy about such things.&lt;/P&gt;&lt;P&gt;It seems Nifi writes to multiple files at the same time, so this may not be possible, or perhaps there is a better way of doing this?&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would welcome thoughts as to how we might do this, or maybe propose a different way to achieve the same logging outcome.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks in advance.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 04 Feb 2026 02:59:02 GMT</pubDate>
    <dc:creator>zzzz77</dc:creator>
    <dc:date>2026-02-04T02:59:02Z</dc:date>
    <item>
      <title>How to backup, store and read Nifi Provence ( lucene ) *.gz files</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413498#M254092</link>
      <description>&lt;P&gt;We were looking at storing the provenance *.gz ( lucene ) files so we could have a record of all activity through Nifi, as our IT security people are fussy about such things.&lt;/P&gt;&lt;P&gt;It seems Nifi writes to multiple files at the same time, so this may not be possible, or perhaps there is a better way of doing this?&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would welcome thoughts as to how we might do this, or maybe propose a different way to achieve the same logging outcome.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks in advance.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Feb 2026 02:59:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413498#M254092</guid>
      <dc:creator>zzzz77</dc:creator>
      <dc:date>2026-02-04T02:59:02Z</dc:date>
    </item>
    <item>
      <title>Re: How to backup, store and read Nifi Provence ( lucene ) *.gz files</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413503#M254096</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/136792"&gt;@zzzz77&lt;/a&gt;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Maybe this blog can help you:&amp;nbsp;&lt;A href="https://community.cloudera.com/t5/Community-Articles/Understanding-how-NiFi-s-Content-Repository-Archiving-works/ta-p/249418" target="_blank"&gt;https://community.cloudera.com/t5/Community-Articles/Understanding-how-NiFi-s-Content-Repository-Archiving-works/ta-p/249418&lt;/A&gt;&amp;nbsp;&lt;BR /&gt;There explains how to handle the repository archive and it could work for what you need.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also, there are other options like the Reporting Tasks documented here:&amp;nbsp;&lt;A href="https://nifi.apache.org/docs/nifi-docs/" target="_blank"&gt;https://nifi.apache.org/docs/nifi-docs/&lt;/A&gt;&amp;nbsp;&lt;BR /&gt;SiteToSiteProvenanceReportingTask is an option for your need.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Feb 2026 15:53:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413503#M254096</guid>
      <dc:creator>vafs</dc:creator>
      <dc:date>2026-02-04T15:53:33Z</dc:date>
    </item>
    <item>
      <title>Re: How to backup, store and read Nifi Provence ( lucene ) *.gz files</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413509#M254100</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/136792"&gt;@zzzz77&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Provenance can be very noisy depending on size of your dataflows and the amount of FlowFIles being processed through those dataflows.&amp;nbsp; The &lt;A href="https://nifi.apache.org/nifi-docs/administration-guide.html#provenance-repository" target="_blank" rel="noopener"&gt;provenance repo&lt;/A&gt; has age and size configuration that trigger roll-off of old events.&amp;nbsp; &amp;nbsp;So you may not reach the retention age if you reach size first.&amp;nbsp; Also would not be trying to read provenance files while they are being written to.&amp;nbsp; &amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;The&amp;nbsp;&lt;A href="https://nifi.apache.org/components/org.apache.nifi.reporting.SiteToSiteProvenanceReportingTask/" target="_blank" rel="noopener"&gt;SiteToSiteProvenanceReportingTask&lt;/A&gt;&amp;nbsp;might be the solution you are looking for in Apache NiFi.&amp;nbsp; &amp;nbsp;This reporting task will send all provenance events over Site-To-Site protocol to a target NiFi where you can then feed them into any long term storage medium of your choice in a human readable format.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please help our community grow. If you found&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;any&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;of the suggestions/solutions provided helped you with solving your issue or answering your question, please take a moment to login and click "&lt;SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;Accept as Solution&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/EM&gt;" on&amp;nbsp;&lt;STRONG&gt;one or more&lt;/STRONG&gt;&amp;nbsp;of them that helped.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you,&lt;BR /&gt;Matt&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Feb 2026 18:03:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-backup-store-and-read-Nifi-Provence-lucene-gz-files/m-p/413509#M254100</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2026-02-04T18:03:40Z</dc:date>
    </item>
  </channel>
</rss>

