<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question ListSftp works but FetchSftp doesn't work in Cluster mode in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171085#M133382</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I deployed a 3 nodes cluster in AWS. One of them is NCM.&lt;/P&gt;&lt;P&gt;The embedded zookeeper servers are set in the two work nodes.&lt;/P&gt;&lt;P&gt;The data flow is: ListSftp -&amp;gt; FetchSftp -&amp;gt; PutFile.&lt;/P&gt;&lt;P&gt;The ListSftp is scheduled in the Primary node.&lt;/P&gt;&lt;P&gt;The issue is:&lt;/P&gt;&lt;P&gt;ListSftp works well. The test files are queued before coming into FetchSftp.&lt;/P&gt;&lt;P&gt;The error in FetchSftp is:&lt;/P&gt;&lt;P&gt;18:36:05 UTCERROR5cdfac90-2d07-443e-97b6-b06a1a883a22
172.31.48.155:8080FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22] FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22] failed to process due to org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error; rolling back session: org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error&lt;/P&gt;&lt;P&gt;I tried GetSftp -&amp;gt; PutFile with the same sftp setting. It works well.&lt;/P&gt;&lt;P&gt;I was wondering whether the issue is related with zookeeper or primary node talking with the other work node.&lt;/P&gt;&lt;P&gt;I didn't setup site-to-site property in nifi.properties.&lt;/P&gt;&lt;P&gt;Didn't setup distributed cache service.&lt;/P&gt;&lt;P&gt;How could I get more log details about this processor IOException?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
    <pubDate>Wed, 10 Aug 2016 02:01:24 GMT</pubDate>
    <dc:creator>alvinuw</dc:creator>
    <dc:date>2016-08-10T02:01:24Z</dc:date>
    <item>
      <title>ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171085#M133382</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I deployed a 3 nodes cluster in AWS. One of them is NCM.&lt;/P&gt;&lt;P&gt;The embedded zookeeper servers are set in the two work nodes.&lt;/P&gt;&lt;P&gt;The data flow is: ListSftp -&amp;gt; FetchSftp -&amp;gt; PutFile.&lt;/P&gt;&lt;P&gt;The ListSftp is scheduled in the Primary node.&lt;/P&gt;&lt;P&gt;The issue is:&lt;/P&gt;&lt;P&gt;ListSftp works well. The test files are queued before coming into FetchSftp.&lt;/P&gt;&lt;P&gt;The error in FetchSftp is:&lt;/P&gt;&lt;P&gt;18:36:05 UTCERROR5cdfac90-2d07-443e-97b6-b06a1a883a22
172.31.48.155:8080FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22] FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22] failed to process due to org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error; rolling back session: org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error&lt;/P&gt;&lt;P&gt;I tried GetSftp -&amp;gt; PutFile with the same sftp setting. It works well.&lt;/P&gt;&lt;P&gt;I was wondering whether the issue is related with zookeeper or primary node talking with the other work node.&lt;/P&gt;&lt;P&gt;I didn't setup site-to-site property in nifi.properties.&lt;/P&gt;&lt;P&gt;Didn't setup distributed cache service.&lt;/P&gt;&lt;P&gt;How could I get more log details about this processor IOException?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 02:01:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171085#M133382</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T02:01:24Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171086#M133383</link>
      <description>&lt;P&gt;Alvin,&lt;/P&gt;&lt;P&gt;You should be able to get more details by adding the following line to your conf/logback.xml file:&lt;/P&gt;&lt;P&gt;&amp;lt;logger name="org.apache.nifi.processors.standard.FetchSFTP" level="DEBUG" /&amp;gt;&lt;/P&gt;&lt;P&gt;That will cause it to log the full stack trace so that you can see what's going on.&lt;/P&gt;&lt;P&gt;FetchSFTP does not interact with ZooKeeper or site-to-site, so you should be okay there. The Distributed Cache Service is also not necessary to use FetchSFTP.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 02:13:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171086#M133383</guid>
      <dc:creator>mpayne</dc:creator>
      <dc:date>2016-08-10T02:13:43Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171087#M133384</link>
      <description>&lt;P&gt;Alvin,&lt;/P&gt;&lt;P&gt;You should try setting the FetchSFTP to primary node also and see if that clears the error.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 02:36:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171087#M133384</guid>
      <dc:creator>Wynner</dc:creator>
      <dc:date>2016-08-10T02:36:45Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171088#M133385</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/367/mpayne.html" nodeid="367"&gt;@mpayne&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I added above logger setting in logback.xml. However, I still didn't find any hints.&lt;/P&gt;&lt;P&gt;The FetchSFTP processor is Time Driven. Only ListSFTP is on Primary Node.&lt;/P&gt;&lt;P&gt;Below is the details. Thanks.&lt;/P&gt;&lt;P&gt;2016-08-09 18:59:48,090 INFO [Clustering Tasks Thread-2] org.apache.nifi.cluster.heartbeat Heartbeat created at 2016-08-09 18:59:47,963 and sent at 2016-08-09 18:59:48,090; send took 0 millis
2016-08-09 18:59:48,091 ERROR [Timer-Driven Process Thread-10] o.a.nifi.processors.standard.FetchSFTP FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22] FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22] failed to process due to org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error; rolling back session: org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error
2016-08-09 18:59:48,093 ERROR [Timer-Driven Process Thread-10] o.a.nifi.processors.standard.FetchSFTP
org.apache.nifi.processor.exception.ProcessException: IOException thrown from FetchSFTP[id=5cdfac90-2d07-443e-97b6-b06a1a883a22]: java.io.IOException: error
        at org.apache.nifi.controller.repository.StandardProcessSession.write(StandardProcessSession.java:2013) ~[nifi-framework-core-0.7.0.jar:0.7.0]
        at org.apache.nifi.processors.standard.FetchFileTransfer.onTrigger(FetchFileTransfer.java:238) ~[na:na]
        at org.apache.nifi.processor.AbstractProcessor.onTrigger(AbstractProcessor.java:27) ~[nifi-api-0.7.0.jar:0.7.0]
        at org.apache.nifi.controller.StandardProcessorNode.onTrigger(StandardProcessorNode.java:1054) [nifi-framework-core-0.7.0.jar:0.7.0]
        at org.apache.nifi.controller.tasks.ContinuallyRunProcessorTask.call(ContinuallyRunProcessorTask.java:136) [nifi-framework-core-0.7.0.jar:0.7.0]
        at org.apache.nifi.controller.tasks.ContinuallyRunProcessorTask.call(ContinuallyRunProcessorTask.java:47) [nifi-framework-core-0.7.0.jar:0.7.0]
        at org.apache.nifi.controller.scheduling.TimerDrivenSchedulingAgent$1.run(TimerDrivenSchedulingAgent.java:127) [nifi-framework-core-0.7.0.jar:0.7.0]
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_101]
        at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_101]
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_101]
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_101]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_101]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_101]
        at java.lang.Thread.run(Thread.java:745) [na:1.8.0_101]
Caused by: java.io.IOException: error
        at com.jcraft.jsch.ChannelSftp$2.read(ChannelSftp.java:1421) ~[na:na]
        at com.jcraft.jsch.ChannelSftp$2.read(ChannelSftp.java:1340) ~[na:na]
        at org.apache.nifi.stream.io.StreamUtils.copy(StreamUtils.java:35) ~[nifi-utils-0.7.0.jar:0.7.0]
        at org.apache.nifi.processors.standard.FetchFileTransfer$1.process(FetchFileTransfer.java:241) ~[na:na]
        at org.apache.nifi.controller.repository.StandardProcessSession.write(StandardProcessSession.java:1998) ~[nifi-framework-core-0.7.0.jar:0.7.0]
        ... 13 common frames omitted&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 02:53:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171088#M133385</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T02:53:44Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171089#M133386</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/286/dwynne.html" nodeid="286"&gt;@dwynne&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I tried to set FetchSFTP on primary node too.&lt;/P&gt;&lt;P&gt;The error result is the same.&lt;/P&gt;&lt;P&gt;I was wondering whether I missed something to setup listSftp-&amp;gt;fetchSftp in cluster&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 02:57:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171089#M133386</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T02:57:27Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171090#M133387</link>
      <description>&lt;P&gt;Alvin,&lt;/P&gt;&lt;P&gt;What run schedule do you have for the ListSFTP and FetchSFTP processors?&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:01:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171090#M133387</guid>
      <dc:creator>Wynner</dc:creator>
      <dc:date>2016-08-10T03:01:46Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171091#M133388</link>
      <description>&lt;P&gt;I didn't change the default one.&lt;/P&gt;&lt;P&gt;On both two processors:&lt;/P&gt;&lt;P&gt;Scheduling strategy: On primary node &lt;/P&gt;&lt;P&gt;Concurrent tasks: 1 &lt;/P&gt;&lt;P&gt;Run schedule: 0 sec&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:07:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171091#M133388</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T03:07:06Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171092#M133389</link>
      <description>&lt;P&gt;I'm not sure if this is root cause but if you are using embedded zookeeper make sure you are not using the NCM as a zookeeper node since it will not start up the embedded zookeeper.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:09:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171092#M133389</guid>
      <dc:creator>jsequeiros</dc:creator>
      <dc:date>2016-08-10T03:09:30Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171093#M133390</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2436/jsequeiros.html" nodeid="2436"&gt;@jsequeiros&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The embedded zookeeper servers are setup only on two worker nodes. I didn't use NCM as a zookeeper node.&lt;/P&gt;&lt;P&gt;But I am not sure whether this is a permission issue.&lt;/P&gt;&lt;P&gt;Since I am in a dev cluster, I started nifi with sudo. Is it a issue?&lt;/P&gt;&lt;P&gt;Unfortunately, the log only shows "java.io.IOException: error" without details.&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:30:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171093#M133390</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T03:30:36Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171094#M133391</link>
      <description>&lt;P&gt;On FetchSFTP are you putting in:&lt;/P&gt;&lt;P&gt;"${path}/${filename}"&lt;/P&gt;&lt;P&gt;For Remote path setting?&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:52:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171094#M133391</guid>
      <dc:creator>jsequeiros</dc:creator>
      <dc:date>2016-08-10T03:52:13Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171095#M133392</link>
      <description>&lt;P&gt;Is it because I didn't setup State Provider?&lt;/P&gt;&lt;P&gt;In /conf/state-management.xml&lt;/P&gt;&lt;P&gt;&amp;lt;cluster-provider&amp;gt;&lt;/P&gt;&lt;P&gt;
        &amp;lt;id&amp;gt;zk-provider&amp;lt;/id&amp;gt; &lt;/P&gt;&lt;P&gt;        &amp;lt;class&amp;gt;org.apache.nifi.controller.state.providers.zookeeper.ZooKeeperStateProvider&amp;lt;/class&amp;gt; &lt;/P&gt;&lt;P&gt;        &amp;lt;property name="Connect String"&amp;gt;&amp;lt;/property&amp;gt; &lt;/P&gt;&lt;P&gt;        &amp;lt;property name="Root Node"&amp;gt;/nifi&amp;lt;/property&amp;gt; &lt;/P&gt;&lt;P&gt;        &amp;lt;property name="Session Timeout"&amp;gt;30 seconds&amp;lt;/property&amp;gt; &lt;/P&gt;&lt;P&gt;        &amp;lt;property name="Access Control"&amp;gt;CreatorOnly&amp;lt;/property&amp;gt; &lt;/P&gt;&lt;P&gt;        &amp;lt;property name="Username"&amp;gt;nifi&amp;lt;/property&amp;gt;&lt;/P&gt;&lt;P&gt;
        &amp;lt;property name="Password"&amp;gt;nifi&amp;lt;/property&amp;gt;&lt;/P&gt;&lt;P&gt;
    &amp;lt;/cluster-provider&amp;gt;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:52:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171095#M133392</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T03:52:17Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171096#M133393</link>
      <description>&lt;P&gt;You certainly need to configure state-management.xml and fill in the "Connect String"&lt;/P&gt;&lt;P&gt;The Admin guide of the NIFI docs under "Help" link from within the UI has the steps to stand up the embedded zookeeper.&lt;/P&gt;&lt;P&gt;Also not sure if you saw my previous comment it could be you need that:&lt;/P&gt;&lt;P&gt;On FetchSFTP are you putting in:&lt;/P&gt;&lt;P&gt;"${path}/${filename}"&lt;/P&gt;&lt;P&gt;For Remote path setting?&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 03:57:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171096#M133393</guid>
      <dc:creator>jsequeiros</dc:creator>
      <dc:date>2016-08-10T03:57:01Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171097#M133394</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2436/jsequeiros.html" nodeid="2436"&gt;@jsequeiros&lt;/A&gt;&lt;/P&gt;&lt;P&gt;These two settings solved the issue for me.&lt;/P&gt;&lt;P&gt;The IOException refers to zookeeper state reading.&lt;/P&gt;&lt;P&gt;One more question:&lt;/P&gt;&lt;P&gt;In my test, the listSftp(on primary node) sends a list of 4 files to fetchSftp.&lt;/P&gt;&lt;P&gt;However, I found only the primary node fetches all the 4 files rather than evenly distributing tasks to two workers.&lt;/P&gt;&lt;P&gt;Any idea about the task allocation from listSftp to fetchSftp?&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 20:29:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171097#M133394</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-10T20:29:16Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171098#M133395</link>
      <description>&lt;P&gt;In order to distribute your data among your cluster nodes you need to use site to site.&lt;/P&gt;&lt;P&gt;Basically primary gets a list &amp;gt; send the list to a remote processing group ( RPG ) &amp;gt; configure an input port ( Name it something distinctive ) &amp;gt; send that relation to FetchSFTP&lt;/P&gt;&lt;P&gt;*** In order to see your Input port when you connect to the RPG, make sure you have configured the input port.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/articles/16120/how-do-i-distribute-data-across-a-nifi-cluster.html" target="_blank"&gt;https://community.hortonworks.com/articles/16120/how-do-i-distribute-data-across-a-nifi-cluster.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/HDF1/HDF-1.2/bk_UserGuide/content/site-to-site.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/HDF1/HDF-1.2/bk_UserGuide/content/site-to-site.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2016 21:37:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171098#M133395</guid>
      <dc:creator>jsequeiros</dc:creator>
      <dc:date>2016-08-10T21:37:45Z</dc:date>
    </item>
    <item>
      <title>Re: ListSftp works but FetchSftp doesn't work in Cluster mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171099#M133396</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2436/jsequeiros.html" nodeid="2436"&gt;@jsequeiros&lt;/A&gt;&lt;/P&gt;&lt;P&gt;After setting up site-to-site properties on all nodes, the fetchSftp works as expected. Both worker nodes are involved in file fetching.&lt;/P&gt;&lt;P&gt;How do we distribute tasks evenly among workers? since I found sometimes one worker took all the files. Sometimes, one took 3, while the other one took 1. I have total 4 test files.&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Aug 2016 01:13:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ListSftp-works-but-FetchSftp-doesn-t-work-in-Cluster-mode/m-p/171099#M133396</guid>
      <dc:creator>alvinuw</dc:creator>
      <dc:date>2016-08-11T01:13:39Z</dc:date>
    </item>
  </channel>
</rss>

