<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to recover or clean a corrupt NiFi FlowFile and/or Provenance Repository? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-recover-or-clean-a-corrupt-NiFi-FlowFile-and-or/m-p/182393#M144559</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/286/dwynne.html" nodeid="286"&gt;@Wynner&lt;/A&gt;, thanks for answers. It would be really good if there was some kind of tool that could at least dump out the state of the repositories so that we can try to understand more about what's going on. Something for the future, perhaps.&lt;/P&gt;&lt;P&gt;I'm not sure if the issue is with the MergeContent processor exactly. I certainly don't understand why the repo has entered this state where FlowFiles either cannot be found or appear to be stale. The problem is that even with the processor running the queue does not get processed because the flowfiles for the given IDs can't be found or are stale.&lt;/P&gt;&lt;P&gt; We have min/max group size 64MB-256MB, min # of entries: 1, max # of entries: 10000, Max Bin Age: 1 Mins, max # of Bins: 100, Delimiter strategy: Text, Attribute strategy: Keep All Unique Attributes. Run Schedule: 0 sec&lt;/P&gt;</description>
    <pubDate>Wed, 19 Jul 2017 16:56:17 GMT</pubDate>
    <dc:creator>richard_d_corfi</dc:creator>
    <dc:date>2017-07-19T16:56:17Z</dc:date>
  </channel>
</rss>

