<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Nifi to process flow file with sequence numbers in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Nifi-to-process-flow-file-with-sequence-numbers/m-p/281136#M209190</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am moving data between hdfs directories to pick the latest updated flow file. The code should verify on the source hdfs directories to pick the latest merged json files time greater than 2 hrs and process to the target along with the sub folders if the target directory doesn't have the directories available. Push the files and append sequence number for every new file received on the same date and delete it from the source directory after processing it. If new files received, then reprocess with new sequence number.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Source hdfs path:-&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;/data/json/incoming/year=2019/month=10/day=22/$flow-file&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Target hdfs path:-&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;/data/json/final/$path/$flow-file&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Filename(received) :- source_es_2019_10_21.jsonl&lt;/P&gt;&lt;P&gt;Filename(required post processing) :-&lt;/P&gt;&lt;P&gt;source_es_2019_10_21_1.jsonl&lt;/P&gt;&lt;P&gt;source_es_2019_10_21_2.jsonl&lt;/P&gt;&lt;P&gt;source_es_2019_10_21_3.jsonl&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am currently using nififlow&lt;/P&gt;&lt;P&gt;&amp;nbsp;listhdfs-&amp;gt;updateattribute-&amp;gt;puthdfs-&amp;gt;deletehdfs&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have completed all the activates except generating sequence number for each received flow file for the same date. Could you please check and help.&lt;/P&gt;</description>
    <pubDate>Wed, 23 Oct 2019 14:46:57 GMT</pubDate>
    <dc:creator>Deenag</dc:creator>
    <dc:date>2019-10-23T14:46:57Z</dc:date>
    <item>
      <title>Nifi to process flow file with sequence numbers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Nifi-to-process-flow-file-with-sequence-numbers/m-p/281136#M209190</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am moving data between hdfs directories to pick the latest updated flow file. The code should verify on the source hdfs directories to pick the latest merged json files time greater than 2 hrs and process to the target along with the sub folders if the target directory doesn't have the directories available. Push the files and append sequence number for every new file received on the same date and delete it from the source directory after processing it. If new files received, then reprocess with new sequence number.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Source hdfs path:-&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;/data/json/incoming/year=2019/month=10/day=22/$flow-file&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Target hdfs path:-&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;/data/json/final/$path/$flow-file&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Filename(received) :- source_es_2019_10_21.jsonl&lt;/P&gt;&lt;P&gt;Filename(required post processing) :-&lt;/P&gt;&lt;P&gt;source_es_2019_10_21_1.jsonl&lt;/P&gt;&lt;P&gt;source_es_2019_10_21_2.jsonl&lt;/P&gt;&lt;P&gt;source_es_2019_10_21_3.jsonl&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am currently using nififlow&lt;/P&gt;&lt;P&gt;&amp;nbsp;listhdfs-&amp;gt;updateattribute-&amp;gt;puthdfs-&amp;gt;deletehdfs&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have completed all the activates except generating sequence number for each received flow file for the same date. Could you please check and help.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Oct 2019 14:46:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Nifi-to-process-flow-file-with-sequence-numbers/m-p/281136#M209190</guid>
      <dc:creator>Deenag</dc:creator>
      <dc:date>2019-10-23T14:46:57Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi to process flow file with sequence numbers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Nifi-to-process-flow-file-with-sequence-numbers/m-p/281298#M209296</link>
      <description>&lt;P&gt;I have got the issue resolved by providing the following value by adding a new property as filename.&lt;/P&gt;&lt;P&gt;${filename:substringBeforeLast('.')}_${uuid}.${filename:substringAfter('.')}&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2019 18:10:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Nifi-to-process-flow-file-with-sequence-numbers/m-p/281298#M209296</guid>
      <dc:creator>Deenag</dc:creator>
      <dc:date>2019-10-24T18:10:10Z</dc:date>
    </item>
  </channel>
</rss>

