<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: sftp transfer to hdfs in spark as opposed to using a command in a script in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/sftp-transfer-to-hdfs-in-spark-as-opposed-to-using-a-command/m-p/51034#M48620</link>
    <description>&lt;P&gt;I don't know specifically, but yes, it is most likely because the libraries used were not built for distributed system. &amp;nbsp;For instance, if you had three executors running the code in the library then all three would be reading from the sftp side and directory all vying for the same files and copying them to the destination. &amp;nbsp;It would be a mess.&lt;/P&gt;</description>
    <pubDate>Thu, 16 Feb 2017 20:38:07 GMT</pubDate>
    <dc:creator>mbigelow</dc:creator>
    <dc:date>2017-02-16T20:38:07Z</dc:date>
  </channel>
</rss>

