Created 06-08-2021 02:53 PM
Hi,
I am looking to import data from Cloudera Data Lake to S3.
Have anyone implemented using AWS Glue? Can you provide some steps?
Thanks
Bala
Created 06-15-2021 11:41 PM
1. We don't have any official document to connect directly to AWS Glue.
2. We have a document to connect to S3 and move data between S3 and Data lake cluster.
3. you can use BDR and Distcp to achieve the same , the below documents shows the same.
https://blog.cloudera.com/using-amazon-s3-with-cloudera-bdr/
https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/admin_s3_docs_ref.html
https://docs.cloudera.com/runtime/7.2.9/scaling-namespaces/topics/hdfs-distcp-with-amazon-s3.html