<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to copy files from remote  system to HDFS using HTTPFS in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91317#M45750</link>
    <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="httpfs_kerberos_curl_error" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/5685i7B1A136459331377/image-size/large?v=v2&amp;amp;px=999" role="button" title="httpfs_kerberos_error.png" alt="httpfs_kerberos_curl_error" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;httpfs_kerberos_curl_error&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am getting 404 error when i tried to get filestatus using httpfs_ip:14000&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But with webhdfs port 50070, i am getting the result. Below is successfull command but for httpfs port 14000 it's not working.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;*****Working***&lt;/P&gt;&lt;P&gt;WEBHDFS(50070):&amp;nbsp;&lt;/P&gt;&lt;P&gt;curl -i --negotiate -u : "&lt;A href="http://gateway1.rev.com:50070/webhdfs/v1/user/root/ratemp/?op=LISTSTATUS" target="_blank"&gt;http://gateway1.rev.com:50070/webhdfs/v1/user/root/ratemp/?op=LISTSTATUS&lt;/A&gt;"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;*****Not working****&lt;/P&gt;&lt;P&gt;HTTPFS(14000):&lt;/P&gt;&lt;P&gt;curl --negotiate -u : -b ~/cookiejar.txt -c ~/cookiejar.txt &lt;A href="http://gateway1.rev.com:14000/webhdfs/v1/user/root/ratemp/test.txt?op=LISTSTATUS" target="_blank"&gt;http://gateway1.rev.com:14000/webhdfs/v1/user/root/ratemp/test.txt?op=LISTSTATUS&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am using coudera manager and is it requied to change&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 07 Jun 2019 05:31:40 GMT</pubDate>
    <dc:creator>Reavidence</dc:creator>
    <dc:date>2019-06-07T05:31:40Z</dc:date>
    <item>
      <title>How to copy files from remote windows system to HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/42205#M45743</link>
      <description>&lt;P&gt;Currently i am using&amp;nbsp;spooldir(source) for copying the files from local file system to HDFS, but i want to copy files from remote windows system.&lt;/P&gt;&lt;P&gt;So can some one suggest which source option can i use to copy the files from remote windows system to HDFS using flume where i can specify the username and password.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:26:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/42205#M45743</guid>
      <dc:creator>Raghava9</dc:creator>
      <dc:date>2022-09-16T10:26:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote windows system to HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/42637#M45744</link>
      <description>&lt;P&gt;It may be a bit of a long shot, but you could mount the directories of your remote server in your local server using samba and afterwards copy the files to hdfs from the command line.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jul 2016 15:19:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/42637#M45744</guid>
      <dc:creator>MVERVUURT</dc:creator>
      <dc:date>2016-07-06T15:19:08Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote windows system to HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/43692#M45745</link>
      <description>&lt;P&gt;In the role HDFS there is a "NFS gateway service" that let you mount an NFS image of the HDFS.&lt;/P&gt;&lt;P&gt;That is one way (you can directly copy file to it). (Check the performance).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hue (web ui) also let you upload files into HDFS (this is a more manual approach).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In our enterprise, for an automated process, we are using a custom Java application that is using the HCatWriter API for writting into Hive tables.&lt;/P&gt;&lt;P&gt;But you can also use the httpFs or the webHdfs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2016 08:00:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/43692#M45745</guid>
      <dc:creator>mathieu.d</dc:creator>
      <dc:date>2016-08-09T08:00:17Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote windows system to HDFS using HTTPFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/90950#M45746</link>
      <description>&lt;P&gt;&lt;A href="http://httpfs.server.com:14000/webhdfs/v1/user/rakesh/abc.csv?op=CREATE&amp;amp;user.name=hdfs" target="_blank"&gt;http://httpfs.server.com:14000/webhdfs/v1/user/rakesh/abc.csv?op=CREATE&amp;amp;user.name=hdfs&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;I am able to create only small files.&lt;BR /&gt;&lt;BR /&gt;How to increase the buffer size for uploading GB's of file??&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 28 May 2019 10:22:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/90950#M45746</guid>
      <dc:creator>Reavidence</dc:creator>
      <dc:date>2019-05-28T10:22:42Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote windows system to HDFS using HTTPFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/90972#M45747</link>
      <description>Didnt quite understand your requirement&lt;BR /&gt;Could you provide some more information</description>
      <pubDate>Tue, 28 May 2019 14:45:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/90972#M45747</guid>
      <dc:creator>csguna</dc:creator>
      <dc:date>2019-05-28T14:45:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote  system to HDFS using HTTPFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91242#M45748</link>
      <description>&lt;P&gt;HTTPFS:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;**************working with private ip and public ip irrespective of file size********&lt;BR /&gt;curl -X PUT -L -b cookie.jar "&lt;A href="http://192.168.1.3:14000/webhdfs/v1/user/abc.csv?op=CREATE&amp;amp;data=true&amp;amp;user.name=hdfs" target="_blank"&gt;http://192.168.1.3:14000/webhdfs/v1/user/abc.csv?op=CREATE&amp;amp;data=true&amp;amp;user.name=hdfs&lt;/A&gt;" --header "Content-Type:application/octet-stream" --header "Transfer-Encoding:chunked" -T "abc.csv"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Above command is for a non-kerberized cluster. I enabled Kerberos and what parameters should I pass to put a file to hdfs?&lt;/P&gt;</description>
      <pubDate>Wed, 05 Jun 2019 10:07:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91242#M45748</guid>
      <dc:creator>Reavidence</dc:creator>
      <dc:date>2019-06-05T10:07:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote  system to HDFS using HTTPFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91276#M45749</link>
      <description>&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/33244"&gt;@Reavidence&lt;/a&gt;,&lt;BR /&gt;&lt;BR /&gt;HTTPFS with Kerberos requires SPNEGO authentication to be used. Per &lt;A href="https://www.cloudera.com/documentation/enterprise/latest/topics/cdh_sg_httpfs_security.html" target="_blank"&gt;https://www.cloudera.com/documentation/enterprise/latest/topics/cdh_sg_httpfs_security.html&lt;/A&gt;, for curl (after kinit) this can be done by passing the below two parameters:&lt;BR /&gt;&lt;BR /&gt;"""&lt;BR /&gt;The '--negotiate' option enables SPNEGO in curl.&lt;BR /&gt;The '-u :' option is required but the username is ignored (the principal that has been specified for kinit is used).&lt;BR /&gt;"""</description>
      <pubDate>Thu, 06 Jun 2019 01:24:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91276#M45749</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2019-06-06T01:24:05Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote  system to HDFS using HTTPFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91317#M45750</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="httpfs_kerberos_curl_error" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/5685i7B1A136459331377/image-size/large?v=v2&amp;amp;px=999" role="button" title="httpfs_kerberos_error.png" alt="httpfs_kerberos_curl_error" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;httpfs_kerberos_curl_error&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am getting 404 error when i tried to get filestatus using httpfs_ip:14000&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But with webhdfs port 50070, i am getting the result. Below is successfull command but for httpfs port 14000 it's not working.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;*****Working***&lt;/P&gt;&lt;P&gt;WEBHDFS(50070):&amp;nbsp;&lt;/P&gt;&lt;P&gt;curl -i --negotiate -u : "&lt;A href="http://gateway1.rev.com:50070/webhdfs/v1/user/root/ratemp/?op=LISTSTATUS" target="_blank"&gt;http://gateway1.rev.com:50070/webhdfs/v1/user/root/ratemp/?op=LISTSTATUS&lt;/A&gt;"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;*****Not working****&lt;/P&gt;&lt;P&gt;HTTPFS(14000):&lt;/P&gt;&lt;P&gt;curl --negotiate -u : -b ~/cookiejar.txt -c ~/cookiejar.txt &lt;A href="http://gateway1.rev.com:14000/webhdfs/v1/user/root/ratemp/test.txt?op=LISTSTATUS" target="_blank"&gt;http://gateway1.rev.com:14000/webhdfs/v1/user/root/ratemp/test.txt?op=LISTSTATUS&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am using coudera manager and is it requied to change&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 05:31:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91317#M45750</guid>
      <dc:creator>Reavidence</dc:creator>
      <dc:date>2019-06-07T05:31:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to copy files from remote  system to HDFS using HTTPFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91323#M45751</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/5687i0AA79D132A0929A1/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I added above values and that was causing https to shutdown. After deleting those values , it started and working fine now.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/213"&gt;@Harsh J&lt;/a&gt;&amp;nbsp; for your reply.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 07:00:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-copy-files-from-remote-windows-system-to-HDFS/m-p/91323#M45751</guid>
      <dc:creator>Reavidence</dc:creator>
      <dc:date>2019-06-07T07:00:25Z</dc:date>
    </item>
  </channel>
</rss>

