<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127480#M51508</link>
    <description>&lt;P&gt;Getting there....I missed a colon in my previous attempt......&lt;/P&gt;&lt;P&gt;[hdfs@test232 ~]$ hadoop distcp -Dhadoop.security.credential.provider.path=jceks://hdfs/aws.jceks /nsswitch.conf s3a://kartik-test.s3-us-west-2.amazonaws.com &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: Caught an AmazonServiceException, which means your request made it to Amazon S3, but was rejected with an error response for some reason. &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: Error Message: Status Code: 403, AWS Service: Amazon S3, AWS Request ID: 3094C5772AA3B4C0, AWS Error Code: SignatureDoesNotMatch, AWS Error Message: The request signature we calculated does not match the signature you provided. Check your key and signing method. &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: HTTP Status Code: 403
17/01/14 15:12:31 INFO s3a.S3AFileSystem: AWS Error Code: SignatureDoesNotMatch &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: Error Type: Client
17/01/14 15:12:31 INFO s3a.S3AFileSystem: Request ID: 3094C5772AA3B4C0
17/01/14 15:12:31 INFO s3a.S3AFileSystem: Class Name: com.amazonaws.services.s3.model.AmazonS3Exception&lt;/P&gt;</description>
    <pubDate>Sun, 15 Jan 2017 08:07:51 GMT</pubDate>
    <dc:creator>ddolecki</dc:creator>
    <dc:date>2017-01-15T08:07:51Z</dc:date>
    <item>
      <title>How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127474#M51502</link>
      <description>&lt;P&gt;How to install the hadoop-aws module to copy from on premist hdfs to s3 aws, I need the command s3DistCp&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2017 04:25:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127474#M51502</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-13T04:25:35Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127475#M51503</link>
      <description>&lt;P&gt;distcp recognizes the s3[a] protocols from the default libraries already available in Hadoop.  &lt;/P&gt;&lt;P&gt;For example: Moving data from Hadoop to S3.&lt;/P&gt;&lt;P&gt;hadoop distcp &amp;lt;current_cluster_folder&amp;gt; s3[a]://&amp;lt;bucket_info&amp;gt;&lt;/P&gt;&lt;P&gt;If you're looking for ways to manage access (via AWS Keys) to S3 Buckets in Hadoop, this article is a great secure way to do that.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/articles/59161/using-hadoop-credential-api-to-store-aws-secrets.html" target="_blank"&gt;https://community.hortonworks.com/articles/59161/using-hadoop-credential-api-to-store-aws-secrets.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2017 23:46:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127475#M51503</guid>
      <dc:creator>dstreev</dc:creator>
      <dc:date>2017-01-13T23:46:23Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127476#M51504</link>
      <description>&lt;P&gt;When I run: &lt;/P&gt;&lt;OL&gt;
&lt;LI&gt;hadoop credential create fs.s3a.access.key -provider localjceks://file/path/to/aws.jceks&lt;/LI&gt;&lt;LI&gt;&amp;lt;enter AccessKey value at prompt&amp;gt;&lt;/LI&gt;&lt;LI&gt;hadoop credential create fs.s3a.secret.key -provider localjceks://file/path/to/aws.jceks&lt;/LI&gt;&lt;LI&gt;&amp;lt;enter SecretKey value at prompt&amp;gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;It prompts me for a password:&lt;/P&gt;&lt;P&gt;[root@test232 conf]# hadoop credential create fs.s3a.access.key -provider localjceks://file/var/tmp/aws.jceks &lt;/P&gt;&lt;P&gt;Enter password: &lt;/P&gt;&lt;P&gt;Enter password again:&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 00:34:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127476#M51504</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T00:34:13Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127477#M51505</link>
      <description>&lt;P&gt;When I add the access key and the secret in the prompt for the password, I get this:&lt;/P&gt;&lt;P&gt;[hdfs@test232 ~]$ hdfs dfs -Dhadoop.security.credential.provider.path=jceks://hdfs/aws/aws.jceks -ls s3a://s3-us-west-2.amazonaws.com/kartik-test
17/01/14 07:51:00 INFO s3a.S3AFileSystem: Caught an AmazonServiceException, which means your request made it to Amazon S3, but was rejected with an error response for some reason.
17/01/14 07:51:00 INFO s3a.S3AFileSystem: Error Message: Status Code: 403, AWS Service: Amazon S3, AWS Request ID: C3EFA25EC200D255, AWS Error Code: null, AWS Error Message: Forbidden
17/01/14 07:51:00 INFO s3a.S3AFileSystem: HTTP Status Code: 403
17/01/14 07:51:00 INFO s3a.S3AFileSystem: AWS Error Code: null
17/01/14 07:51:00 INFO s3a.S3AFileSystem: Error Type: Client
17/01/14 07:51:00 INFO s3a.S3AFileSystem: Request ID: C3EFA25EC200D255
17/01/14 07:51:00 INFO s3a.S3AFileSystem: Class Name: com.amazonaws.services.s3.model.AmazonS3Exception
-ls: Fatal internal error&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 00:46:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127477#M51505</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T00:46:25Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127478#M51506</link>
      <description>&lt;P&gt;[hdfs@test232 ~]$ curl &lt;A href="http://kartik-test.s3-us-west-2.amazonaws.com" target="_blank"&gt;http://kartik-test.s3-us-west-2.amazonaws.com&lt;/A&gt;
&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;
&amp;lt;ListBucketResult xmlns="http://s3.amazonaws.com/doc/2006-03-01/"&amp;gt;&amp;lt;Name&amp;gt;kartik-test&amp;lt;/Name&amp;gt;&amp;lt;Prefix&amp;gt;&amp;lt;/Prefix&amp;gt;&amp;lt;Marker&amp;gt;&amp;lt;/Marker&amp;gt;&amp;lt;MaxKeys&amp;gt;1000&amp;lt;/MaxKeys&amp;gt;&amp;lt;IsTruncated&amp;gt;false&amp;lt;/IsTruncated&amp;gt;&amp;lt;Contents&amp;gt;&amp;lt;Key&amp;gt;hosts&amp;lt;/Key&amp;gt;&amp;lt;LastModified&amp;gt;2017-01-12T19:48:14.000Z&amp;lt;/LastModified&amp;gt;&amp;lt;ETag&amp;gt;"881dc3861c3c8a28e213790785a940b7"&amp;lt;/ETag&amp;gt;&amp;lt;Size&amp;gt;44&amp;lt;/Size&amp;gt;&amp;lt;StorageClass&amp;gt;STANDARD&amp;lt;/StorageClass&amp;gt;&amp;lt;/Contents&amp;gt;&amp;lt;Contents&amp;gt;&amp;lt;Key&amp;gt;logs/&amp;lt;/Key&amp;gt;&amp;lt;LastModified&amp;gt;2017-01-14T17:01:56.000Z&amp;lt;/LastModified&amp;gt;&amp;lt;ETag&amp;gt;"d41d8cd98f00b204e9800998ecf8427e"&amp;lt;/ETag&amp;gt;&amp;lt;Size&amp;gt;0&amp;lt;/Size&amp;gt;&amp;lt;StorageClass&amp;gt;STANDARD&amp;lt;/StorageClass&amp;gt;&amp;lt;/Contents&amp;gt;&amp;lt;/ListBucketResult&amp;gt;[hdfs@test232 ~]$&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 03:38:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127478#M51506</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T03:38:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127479#M51507</link>
      <description>&lt;P&gt;I tried: hadoop distcp -Dhadoop.security.credential.provider.path=jceks://hdfs/aws.jceks /nsswitch.conf s3a//kartik-test.s3-us-west-2.amazonaws.com&lt;/P&gt;&lt;P&gt;and it created a s3a folder in my hdfs:&lt;/P&gt;&lt;P&gt;[hdfs@test232 ~]$ hdfs dfs -ls &lt;/P&gt;&lt;P&gt;Found 3 items&lt;/P&gt;&lt;P&gt;drwx------   - hdfs hdfs          0 2017-01-14 07:47 .Trash &lt;/P&gt;&lt;P&gt;drwx------   - hdfs hdfs          0 2017-01-14 12:07 .staging &lt;/P&gt;&lt;P&gt;drwx------   - hdfs hdfs          0 2017-01-14 12:07 s3a&lt;/P&gt;&lt;P&gt;[hdfs@test232 ~]$
&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 05:02:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127479#M51507</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T05:02:48Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127480#M51508</link>
      <description>&lt;P&gt;Getting there....I missed a colon in my previous attempt......&lt;/P&gt;&lt;P&gt;[hdfs@test232 ~]$ hadoop distcp -Dhadoop.security.credential.provider.path=jceks://hdfs/aws.jceks /nsswitch.conf s3a://kartik-test.s3-us-west-2.amazonaws.com &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: Caught an AmazonServiceException, which means your request made it to Amazon S3, but was rejected with an error response for some reason. &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: Error Message: Status Code: 403, AWS Service: Amazon S3, AWS Request ID: 3094C5772AA3B4C0, AWS Error Code: SignatureDoesNotMatch, AWS Error Message: The request signature we calculated does not match the signature you provided. Check your key and signing method. &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: HTTP Status Code: 403
17/01/14 15:12:31 INFO s3a.S3AFileSystem: AWS Error Code: SignatureDoesNotMatch &lt;/P&gt;&lt;P&gt;17/01/14 15:12:31 INFO s3a.S3AFileSystem: Error Type: Client
17/01/14 15:12:31 INFO s3a.S3AFileSystem: Request ID: 3094C5772AA3B4C0
17/01/14 15:12:31 INFO s3a.S3AFileSystem: Class Name: com.amazonaws.services.s3.model.AmazonS3Exception&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 08:07:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127480#M51508</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T08:07:51Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127481#M51509</link>
      <description>&lt;P&gt;Regenerated keys, updated the aws.jceks entry&lt;/P&gt;&lt;P&gt;[hdfs@test232 ~]$ hadoop distcp -Dhadoop.security.credential.provider.path=jceks://hdfs/aws/aws.jceks /nsswitch.conf s3a://kartik-test.s3-us-west-2.amazonaws.com
17/01/14 20:14:59 ERROR tools.DistCp: Invalid arguments:
java.io.IOException: Bucket kartik-test.s3-us-west-2.amazonaws.com does not exist&lt;/P&gt;&lt;P&gt;But I am able to browse the bucket in http&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 09:16:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127481#M51509</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T09:16:53Z</dc:date>
    </item>
    <item>
      <title>Re: How to install the hadoop-aws module to copy from on premist hdfs to s3 aws</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127482#M51510</link>
      <description>&lt;P&gt;This worked: [hdfs@test232 ~]$ hadoop distcp -Dhadoop.security.credential.provider.path=jceks://hdfs/aws/aws.jceks /test s3a://kartik-test/&lt;/P&gt;&lt;P&gt;Thanks for all your help!!&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jan 2017 23:00:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-install-the-hadoop-aws-module-to-copy-from-on-premist/m-p/127482#M51510</guid>
      <dc:creator>ddolecki</dc:creator>
      <dc:date>2017-01-15T23:00:36Z</dc:date>
    </item>
  </channel>
</rss>

