<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Nifi GetHDFS Warning - Could not remove from HDFS in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Nifi-GetHDFS-Warning-Could-not-remove-from-HDFS/m-p/310743#M224283</link>
    <description>&lt;P&gt;9 out of 10 times this message is caused because you run the GetHDFS on multiple nodes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Both nodes see it, perhaps even try to pick it up, but clearly not both of these can delete it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In old versions of NiFi you can fix this by setting the GetHDFS to run only on the primary node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, that will ofcourse burden the primary node more than it should.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So in recent versions (and likely yours) you will find the ListHDFS and FetchHDFS processors (and similar sets for different data sources). The lightweight List processor can then run on the primary node, and loadbalance to all nodes which will then Fetch.&lt;/P&gt;</description>
    <pubDate>Mon, 01 Feb 2021 14:27:09 GMT</pubDate>
    <dc:creator>DennisJaheruddi</dc:creator>
    <dc:date>2021-02-01T14:27:09Z</dc:date>
    <item>
      <title>Nifi GetHDFS Warning - Could not remove from HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Nifi-GetHDFS-Warning-Could-not-remove-from-HDFS/m-p/310579#M224222</link>
      <description>&lt;P&gt;I have created a process in Nifi to get a file from a first folder, compress it and delete the uncompressed file. I have used:&lt;/P&gt;&lt;P&gt;GetHDFS: to get the file, deleting it from the folder (Keep Source File is set to False)&lt;/P&gt;&lt;P&gt;PutHDFS: to compress the file and save in a second folder&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The process seems working, in fact the file is not anymore in the first folder and the compressed file is in the second folder.&lt;/P&gt;&lt;P&gt;The problem is that a warning message is displayed:&lt;/P&gt;&lt;P&gt;Could not remove &amp;lt;file path&amp;gt; from HDFS. Not ingesting this file...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So I have the doubt that the uncompressed file is still somewhere in the HDFS, but I don't know where.&lt;/P&gt;&lt;P&gt;What does it mean the warning message?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jan 2021 11:26:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Nifi-GetHDFS-Warning-Could-not-remove-from-HDFS/m-p/310579#M224222</guid>
      <dc:creator>sallyh</dc:creator>
      <dc:date>2021-01-28T11:26:56Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi GetHDFS Warning - Could not remove from HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Nifi-GetHDFS-Warning-Could-not-remove-from-HDFS/m-p/310743#M224283</link>
      <description>&lt;P&gt;9 out of 10 times this message is caused because you run the GetHDFS on multiple nodes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Both nodes see it, perhaps even try to pick it up, but clearly not both of these can delete it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In old versions of NiFi you can fix this by setting the GetHDFS to run only on the primary node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, that will ofcourse burden the primary node more than it should.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So in recent versions (and likely yours) you will find the ListHDFS and FetchHDFS processors (and similar sets for different data sources). The lightweight List processor can then run on the primary node, and loadbalance to all nodes which will then Fetch.&lt;/P&gt;</description>
      <pubDate>Mon, 01 Feb 2021 14:27:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Nifi-GetHDFS-Warning-Could-not-remove-from-HDFS/m-p/310743#M224283</guid>
      <dc:creator>DennisJaheruddi</dc:creator>
      <dc:date>2021-02-01T14:27:09Z</dc:date>
    </item>
  </channel>
</rss>

