<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Unable to move data to a S3 bucket using last CDH (5.14.0) in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/64253#M74219</link>
    <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to move data from HDFS to a &lt;STRONG&gt;S3 bucket&lt;/STRONG&gt;. I am using last version of CM/CDH (5.14.0). I have been able to copy data using the tool aws:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;aws s3api put-object&lt;/PRE&gt;&lt;P&gt;And also with the python SDK but I cannot copy data with hadoop distcp. I have added the following extra properties to core-site.xml in HDFS service.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="s3a.png" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/3835iF62ACFC46EC012C7/image-size/large?v=v2&amp;amp;px=999" role="button" title="s3a.png" alt="s3a.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;fs.s3a.access.key&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;X&amp;lt;/value&amp;gt;
&amp;lt;/property&amp;gt;
&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;fs.s3a.secret.key&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;X&amp;lt;/value&amp;gt;
&amp;lt;/property&amp;gt;
&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;fs.s3a.endpoint&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;s3.us-east-2.amazonaws.com&amp;lt;/value&amp;gt;
&amp;lt;/property&amp;gt;&lt;/PRE&gt;&lt;P&gt;Nothing happens when I execute a command like&lt;/P&gt;&lt;PRE&gt;hadoop distcp /blablabla s3a://bucket-name/&lt;/PRE&gt;&lt;P&gt;but it hangs for a while (I guess is trying several times). Same thing when I try to just list files in the bucket with&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;hadoop fs -ls s3a://bucket-name&lt;/PRE&gt;&lt;P&gt;I am sure it is not a credentials problem since I can connect using the same access and secret key with the python SKD and aws tool.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Anyone facing a similar issue? Thanks!&lt;/P&gt;</description>
    <pubDate>Fri, 02 Feb 2018 08:14:11 GMT</pubDate>
    <dc:creator>iyanmv</dc:creator>
    <dc:date>2018-02-02T08:14:11Z</dc:date>
    <item>
      <title>Unable to move data to a S3 bucket using last CDH (5.14.0)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/64253#M74219</link>
      <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to move data from HDFS to a &lt;STRONG&gt;S3 bucket&lt;/STRONG&gt;. I am using last version of CM/CDH (5.14.0). I have been able to copy data using the tool aws:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;aws s3api put-object&lt;/PRE&gt;&lt;P&gt;And also with the python SDK but I cannot copy data with hadoop distcp. I have added the following extra properties to core-site.xml in HDFS service.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="s3a.png" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/3835iF62ACFC46EC012C7/image-size/large?v=v2&amp;amp;px=999" role="button" title="s3a.png" alt="s3a.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;fs.s3a.access.key&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;X&amp;lt;/value&amp;gt;
&amp;lt;/property&amp;gt;
&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;fs.s3a.secret.key&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;X&amp;lt;/value&amp;gt;
&amp;lt;/property&amp;gt;
&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;fs.s3a.endpoint&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;s3.us-east-2.amazonaws.com&amp;lt;/value&amp;gt;
&amp;lt;/property&amp;gt;&lt;/PRE&gt;&lt;P&gt;Nothing happens when I execute a command like&lt;/P&gt;&lt;PRE&gt;hadoop distcp /blablabla s3a://bucket-name/&lt;/PRE&gt;&lt;P&gt;but it hangs for a while (I guess is trying several times). Same thing when I try to just list files in the bucket with&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;hadoop fs -ls s3a://bucket-name&lt;/PRE&gt;&lt;P&gt;I am sure it is not a credentials problem since I can connect using the same access and secret key with the python SKD and aws tool.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Anyone facing a similar issue? Thanks!&lt;/P&gt;</description>
      <pubDate>Fri, 02 Feb 2018 08:14:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/64253#M74219</guid>
      <dc:creator>iyanmv</dc:creator>
      <dc:date>2018-02-02T08:14:11Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to move data to a S3 bucket using last CDH (5.14.0)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/64435#M74220</link>
      <description>&lt;P&gt;Distcp can take some time to complete depending on your source data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;One thing to try would be to list a public bucket.&amp;nbsp; I believe if you have no credentials set you'll see an error, but if you have any valid credentials you should be able to list it:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;hadoop fs -ls s3a://landsat-pds/&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Also make sure you've deployed your client configs in Cloudera Manager (CM).&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 07 Feb 2018 23:59:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/64435#M74220</guid>
      <dc:creator>AaronFabbri</dc:creator>
      <dc:date>2018-02-07T23:59:07Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to move data to a S3 bucket using last CDH (5.14.0)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/65113#M74221</link>
      <description>&lt;P&gt;Hi Aaron! Thanks for answering.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;At the end it wasn't a problem with Hadoop or the configuration (credentials were correct and config files deploy in all nodes). It was just that IT was blocking all the traffic to the private bucket. Even after asking them to allow those IPs it didn't work so I install CNLM in all nodes and specified the proxy using:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;-Dfs.s3a.proxy.host="localhost" -Dfs.s3a.proxy.port="3128"&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;After that I was able to move 3 TB in less than a day.&lt;/P&gt;</description>
      <pubDate>Mon, 05 Mar 2018 11:38:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Unable-to-move-data-to-a-S3-bucket-using-last-CDH-5-14-0/m-p/65113#M74221</guid>
      <dc:creator>iyanmv</dc:creator>
      <dc:date>2018-03-05T11:38:32Z</dc:date>
    </item>
  </channel>
</rss>

