Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Does anyone know how to execute aws cli fommand to remove s3 objects from within oozie workflow (would need to do something like this command within oozie workflow "aws s3 rm s3://PathToObjects --recursive")

avatar
New Member
 
1 ACCEPTED SOLUTION

avatar

You can simply run hdfs cli commands to delete objects on S3

hdfs dfs -rm -r -f -skipTrash s3a://S3_BUCKET_NAME/S3_PATH

This however will only works if you have specified S3 properties (fs.s3a.access.key, fs.s3a.secret.key) in core-site.xml

View solution in original post

3 REPLIES 3

avatar

You can simply run hdfs cli commands to delete objects on S3

hdfs dfs -rm -r -f -skipTrash s3a://S3_BUCKET_NAME/S3_PATH

This however will only works if you have specified S3 properties (fs.s3a.access.key, fs.s3a.secret.key) in core-site.xml

avatar
New Member

so we're needing to perform the command inside an oozie workflow so am assuming the command above would work when the oozie workflow xml like what's shown below. Am I understanding that correctly?

<action name="remove-s3">

<fs>

<hdfs dfs -rm -r -f -skipTrash s3a://S3_BUCKET_NAME/S3_PATH />

</fs>

</action>

avatar

Yes, that is correct