<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question NiFi GetHDFSFileInfo Directory Size in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372067#M241145</link>
    <description>&lt;P&gt;Very new to NiFi and I am trying to create a daily email with throughput totals from the previous day.&amp;nbsp; &amp;nbsp;I was getting nowhere figuring out how to capture and report on NiFi processor throughput so I figured I could start with a GetHDFSFileInfo process that could look at the various HDFS tables and report on the previous day's directory size.&amp;nbsp; &amp;nbsp;Similar to "hdfs dfs -du -h /warehouse/tablespace/managed/hive/table1" and then grabbing the partition directory size from the previous day.&amp;nbsp; &amp;nbsp;I could script it easy enough but I would like to keep everything in NiFi so I don't have to worry about scripts and cron jobs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I try to use GetHDFSFileInfo and do not recurse directories, I can get a list of all the partition directories but the length value is always 0.&amp;nbsp; &amp;nbsp;If I enable recurse, then I get every file (and they have the length value),&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Just curious if there was a way to have GetHDFSFileInfo provide partition level directory disk usage.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Jeff&lt;/P&gt;</description>
    <pubDate>Fri, 02 Jun 2023 21:15:27 GMT</pubDate>
    <dc:creator>JeffB</dc:creator>
    <dc:date>2023-06-02T21:15:27Z</dc:date>
    <item>
      <title>NiFi GetHDFSFileInfo Directory Size</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372067#M241145</link>
      <description>&lt;P&gt;Very new to NiFi and I am trying to create a daily email with throughput totals from the previous day.&amp;nbsp; &amp;nbsp;I was getting nowhere figuring out how to capture and report on NiFi processor throughput so I figured I could start with a GetHDFSFileInfo process that could look at the various HDFS tables and report on the previous day's directory size.&amp;nbsp; &amp;nbsp;Similar to "hdfs dfs -du -h /warehouse/tablespace/managed/hive/table1" and then grabbing the partition directory size from the previous day.&amp;nbsp; &amp;nbsp;I could script it easy enough but I would like to keep everything in NiFi so I don't have to worry about scripts and cron jobs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I try to use GetHDFSFileInfo and do not recurse directories, I can get a list of all the partition directories but the length value is always 0.&amp;nbsp; &amp;nbsp;If I enable recurse, then I get every file (and they have the length value),&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Just curious if there was a way to have GetHDFSFileInfo provide partition level directory disk usage.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Jeff&lt;/P&gt;</description>
      <pubDate>Fri, 02 Jun 2023 21:15:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372067#M241145</guid>
      <dc:creator>JeffB</dc:creator>
      <dc:date>2023-06-02T21:15:27Z</dc:date>
    </item>
    <item>
      <title>Re: NiFi GetHDFSFileInfo Directory Size</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372069#M241147</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/104872"&gt;@JeffB&lt;/a&gt;&amp;nbsp;Welcome to the Cloudera Community!&lt;BR /&gt;&lt;BR /&gt;To help you get the best possible solution, I have tagged our NiFi experts&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/35454"&gt;@MattWho&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/95503"&gt;@steven-matison&lt;/a&gt;&amp;nbsp; who may be able to assist you further.&lt;BR /&gt;&lt;BR /&gt;Please keep us updated on your post, and we hope you find a satisfactory solution to your query.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Jun 2023 22:39:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372069#M241147</guid>
      <dc:creator>DianaTorres</dc:creator>
      <dc:date>2023-06-02T22:39:20Z</dc:date>
    </item>
    <item>
      <title>Re: NiFi GetHDFSFileInfo Directory Size</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372119#M241158</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Partition level HDFS directory disk usage is not avaible since this works on gievn direceoty path only and not at the disk level.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Thank you&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Jun 2023 11:32:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372119#M241158</guid>
      <dc:creator>ckumar</dc:creator>
      <dc:date>2023-06-05T11:32:40Z</dc:date>
    </item>
    <item>
      <title>Re: NiFi GetHDFSFileInfo Directory Size</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372326#M241220</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/104872"&gt;@JeffB&lt;/a&gt;&amp;nbsp;Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future. Thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Jun 2023 15:08:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-GetHDFSFileInfo-Directory-Size/m-p/372326#M241220</guid>
      <dc:creator>DianaTorres</dc:creator>
      <dc:date>2023-06-08T15:08:38Z</dc:date>
    </item>
  </channel>
</rss>

