<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Transfer files to S3 based on file timestamp in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290969#M215218</link>
    <description>&lt;P&gt;1.&amp;nbsp;&lt;SPAN&gt;ListS3：List all object file path in the specific bucket.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;2.&amp;nbsp;RouteOnAttribute：Filter out unused file (optional).&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;3.&amp;nbsp;FetchS3Object&amp;nbsp;：Fetch file.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;4.&amp;nbsp;UpdateAttribute：Rename filename(file path in bucket) to the specific path.&amp;nbsp; ex：&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;5.&amp;nbsp;PutS3Object&amp;nbsp;：Put file to&amp;nbsp;the specific bucket.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;PS：Once execute&amp;nbsp;FetchS3Object, files will be load in memory. So, it's better to limit&amp;nbsp;Back Pressure Object Threshold or&amp;nbsp;Size Threshold&amp;nbsp;in the Connection after&amp;nbsp;FetchS3Object.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 04 Mar 2020 06:19:50 GMT</pubDate>
    <dc:creator>AustinLiu</dc:creator>
    <dc:date>2020-03-04T06:19:50Z</dc:date>
    <item>
      <title>Transfer files to S3 based on file timestamp</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290961#M215212</link>
      <description>&lt;P&gt;I have used case where i am reading files with timestamp and these files has to be transferred to S3 and create folder with respective dates. Ex: file names abcd.out.gz.20200303 ,&amp;nbsp;abcd.out.gz.20200302&lt;/P&gt;
&lt;P&gt;and the file&amp;nbsp;abcd.out.gz.20200303 need to be in S3 under /data/20200303&lt;/P&gt;
&lt;P&gt;and file&amp;nbsp;abcd.out.gz.20200302 under /data/20200302.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How can i achieve this in NiFi.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Mar 2020 04:22:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290961#M215212</guid>
      <dc:creator>Gubbi</dc:creator>
      <dc:date>2020-03-04T04:22:19Z</dc:date>
    </item>
    <item>
      <title>Re: Transfer files to S3 based on file timestamp</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290969#M215218</link>
      <description>&lt;P&gt;1.&amp;nbsp;&lt;SPAN&gt;ListS3：List all object file path in the specific bucket.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;2.&amp;nbsp;RouteOnAttribute：Filter out unused file (optional).&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;3.&amp;nbsp;FetchS3Object&amp;nbsp;：Fetch file.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;4.&amp;nbsp;UpdateAttribute：Rename filename(file path in bucket) to the specific path.&amp;nbsp; ex：&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;5.&amp;nbsp;PutS3Object&amp;nbsp;：Put file to&amp;nbsp;the specific bucket.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;PS：Once execute&amp;nbsp;FetchS3Object, files will be load in memory. So, it's better to limit&amp;nbsp;Back Pressure Object Threshold or&amp;nbsp;Size Threshold&amp;nbsp;in the Connection after&amp;nbsp;FetchS3Object.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Mar 2020 06:19:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290969#M215218</guid>
      <dc:creator>AustinLiu</dc:creator>
      <dc:date>2020-03-04T06:19:50Z</dc:date>
    </item>
    <item>
      <title>Re: Transfer files to S3 based on file timestamp</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290971#M215220</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/75092"&gt;@AustinLiu&lt;/a&gt;&amp;nbsp;: But i need to transfer file abcd_20200303 to S3 folder 20200303 and respectively based on dates. Every day when the files arrive my processor should identify file based on date and push it to respective date folder in S3.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Mar 2020 06:31:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290971#M215220</guid>
      <dc:creator>Gubbi</dc:creator>
      <dc:date>2020-03-04T06:31:32Z</dc:date>
    </item>
    <item>
      <title>Re: Transfer files to S3 based on file timestamp</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290973#M215222</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/75092"&gt;@AustinLiu&lt;/a&gt;&amp;nbsp;Just to clarify, i am transferring files from linux box to S3.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Mar 2020 06:42:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/290973#M215222</guid>
      <dc:creator>Gubbi</dc:creator>
      <dc:date>2020-03-04T06:42:50Z</dc:date>
    </item>
    <item>
      <title>Re: Transfer files to S3 based on file timestamp</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/291048#M215272</link>
      <description>&lt;P&gt;I got solution for this. Had to use expression language in Object Key to fetch date from file and it worked. Below is expression :${filename:substringAfter('.gz.')}/${filename}&lt;/P&gt;</description>
      <pubDate>Wed, 04 Mar 2020 20:15:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Transfer-files-to-S3-based-on-file-timestamp/m-p/291048#M215272</guid>
      <dc:creator>Gubbi</dc:creator>
      <dc:date>2020-03-04T20:15:04Z</dc:date>
    </item>
  </channel>
</rss>

