Created 05-25-2022 11:57 PM
We are actually looking forward to integrating Ranger with a SQL Query Engine to gain control of the authorization before the actual SQL Query is sent for execution. We want to ensure that the user is authorized to access certain tables or columns before we actually execute the query provided.
ASSUMPTION: data is residing in S3 storage and user submits a SQL query involving certain table/columns to the query engine for its processing.
Below are the things I could not find firmly.
1. Can Ranger be used to authorize the data located on cloud storages such as S3, GCS ? If yes, can the user role policies be configured to control the table / column level data on such cloud storages ?
Created 05-26-2022 12:27 AM
Created 05-26-2022 12:56 AM
Thanks @Shelton for your response.
I have come across this link but it is about EMR's integration. I am specifically seeking clarifications on the ranger APIs to handle the authorization of the data lying in S3.
Not able to find a clear picture around this in order to begin integration with Ranger.