Member since
09-09-2015
5
Posts
2
Kudos Received
0
Solutions
02-05-2019
10:32 AM
Hi @VenkateshB, You may want to take a look at the doc here where provides some distcp examples: https://www.cloudera.com/documentation/enterprise/6/latest/topics/cdh_admin_distcp_data_cluster_migrate.html#distcp_and_s3 Thanks and hope it helps, Li
... View more
03-05-2018
03:38 AM
Hi Aaron! Thanks for answering. At the end it wasn't a problem with Hadoop or the configuration (credentials were correct and config files deploy in all nodes). It was just that IT was blocking all the traffic to the private bucket. Even after asking them to allow those IPs it didn't work so I install CNLM in all nodes and specified the proxy using: -Dfs.s3a.proxy.host="localhost" -Dfs.s3a.proxy.port="3128" After that I was able to move 3 TB in less than a day.
... View more
02-01-2018
10:57 PM
STS should work. I would try (1) using s3a, not s3n, and (2) building your spark app with the same AWS SDK version as used in the cluster.
... View more