<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: NIFI - ListSFTP / FETCHSFTP / PUTHDFS in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136755#M43834</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10363/maykiwogno.html" nodeid="10363" target="_blank"&gt;@mayki wogno&lt;/A&gt; if you want to use the S2S protocol to distribute the SFTP fetches over the 4 NiFi nodes, then it will be necessary to have an RPG. ListSFTP would be configured to only run on the primary node and would connect to the RPG, which would point back to the same NiFi cluster.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8654-screen-shot-2016-10-18-at-102755-am.png" style="width: 714px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/22005i2D36703D3BC48D5B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8654-screen-shot-2016-10-18-at-102755-am.png" alt="8654-screen-shot-2016-10-18-at-102755-am.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;You would then connect the associated input port with the process group containing the FetchSFTP and PutHDFS processors. &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8655-screen-shot-2016-10-18-at-103033-am.png" style="width: 672px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/22006i9D0407A221492B23/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8655-screen-shot-2016-10-18-at-103033-am.png" alt="8655-screen-shot-2016-10-18-at-103033-am.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;In a NiFi cluster, each node is processing the same dataflow (with the exception of Isolated Processors like ListSFTP only run on the primary node). Without a distribution mechanism such as the S2S protocol, there is no means to partition the file listing metadata so that each processing node fetches a distinct subset of the files on the SFTP server. &lt;/P&gt;</description>
    <pubDate>Mon, 19 Aug 2019 08:48:50 GMT</pubDate>
    <dc:creator>slachterman</dc:creator>
    <dc:date>2019-08-19T08:48:50Z</dc:date>
    <item>
      <title>NIFI - ListSFTP / FETCHSFTP / PUTHDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136754#M43833</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I'm running cluster nifi with 4 nodes.&lt;/P&gt;&lt;P&gt;how I would like to setup a dataflow with sftp processors.&lt;/P&gt;&lt;P&gt;It is necessary to have RPG between listsftp and fetchsftp ?&lt;/P&gt;&lt;P&gt;Or can i simply make &lt;/P&gt;&lt;P&gt;listsftp (primary node) --&amp;gt; fetchsftp (all nodes) --&amp;gt; puthdfs (all nodes)&lt;/P&gt;&lt;P&gt;regards &lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 21:41:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136754#M43833</guid>
      <dc:creator>maykiwogno</dc:creator>
      <dc:date>2016-10-18T21:41:33Z</dc:date>
    </item>
    <item>
      <title>Re: NIFI - ListSFTP / FETCHSFTP / PUTHDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136755#M43834</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10363/maykiwogno.html" nodeid="10363" target="_blank"&gt;@mayki wogno&lt;/A&gt; if you want to use the S2S protocol to distribute the SFTP fetches over the 4 NiFi nodes, then it will be necessary to have an RPG. ListSFTP would be configured to only run on the primary node and would connect to the RPG, which would point back to the same NiFi cluster.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8654-screen-shot-2016-10-18-at-102755-am.png" style="width: 714px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/22005i2D36703D3BC48D5B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8654-screen-shot-2016-10-18-at-102755-am.png" alt="8654-screen-shot-2016-10-18-at-102755-am.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;You would then connect the associated input port with the process group containing the FetchSFTP and PutHDFS processors. &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8655-screen-shot-2016-10-18-at-103033-am.png" style="width: 672px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/22006i9D0407A221492B23/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8655-screen-shot-2016-10-18-at-103033-am.png" alt="8655-screen-shot-2016-10-18-at-103033-am.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;In a NiFi cluster, each node is processing the same dataflow (with the exception of Isolated Processors like ListSFTP only run on the primary node). Without a distribution mechanism such as the S2S protocol, there is no means to partition the file listing metadata so that each processing node fetches a distinct subset of the files on the SFTP server. &lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 08:48:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136755#M43834</guid>
      <dc:creator>slachterman</dc:creator>
      <dc:date>2019-08-19T08:48:50Z</dc:date>
    </item>
    <item>
      <title>Re: NIFI - ListSFTP / FETCHSFTP / PUTHDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136756#M43835</link>
      <description>&lt;P&gt;@Slachterman thanks.. For RPG, if with secured cluster Nifi that URL is used?&lt;/P&gt;&lt;P&gt;&lt;A href="Https://nifi001:9443/" target="_blank"&gt;Https://nifi001:9443/&lt;/A&gt;? &lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 23:16:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136756#M43835</guid>
      <dc:creator>maykiwogno</dc:creator>
      <dc:date>2016-10-18T23:16:38Z</dc:date>
    </item>
    <item>
      <title>Re: NIFI - ListSFTP / FETCHSFTP / PUTHDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136757#M43836</link>
      <description>&lt;P&gt;That's right, the URL would specify HTTPS and the port on which NiFi is running on that host. With the new masterless architecture in HDF 2.0, the URL specified in the RPG can be any cluster node (in previous versions it had to be the NCM).&lt;/P&gt;&lt;P&gt;Please accept the above answer if it was helpful to you.&lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 23:23:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136757#M43836</guid>
      <dc:creator>slachterman</dc:creator>
      <dc:date>2016-10-18T23:23:38Z</dc:date>
    </item>
    <item>
      <title>Re: NIFI - ListSFTP / FETCHSFTP / PUTHDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136758#M43837</link>
      <description>&lt;P&gt;thanks, I'll  try it and tell you it is ok.&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 14:35:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/NIFI-ListSFTP-FETCHSFTP-PUTHDFS/m-p/136758#M43837</guid>
      <dc:creator>maykiwogno</dc:creator>
      <dc:date>2016-10-20T14:35:21Z</dc:date>
    </item>
  </channel>
</rss>

