Support Questions

cjervis · ‎10-13-2021

I can do query through spark using spark.sql, but when I tried via beeline ranger is blocking the access because of the policy. How does it work via spark query? Doesn't it check HDFS level permission? I

ChethanYM · ‎10-15-2021

Hi,

I have found a another community article that has addressed your concern. Please do check below:

That sounds like all is working as designed/implemented since Ranger does not currently (as of HDP 2.4) have a supported plug-in for Spark and knowing that when spark is reading Hive tables that it really isn't going through the "front door" of Hive to actual run queries (it is reading these files from HDFS directly).

That said, the underlying HDFS authorization policies (either w/or w/o using Ranger) will be honored if they are in-place.

Article: https://community.cloudera.com/t5/Support-Questions/Does-Spark-job-honor-Ranger-hive-policies/td-p/1...

Do mark it resolved if it really helps you.

Regards,

Chethan YM

Cloudera Community

Support Questions

Spark query vs beeline query

How to display query metrics of Analyzer/Optimizer...

Query Hive Using Python

Beeline select query

Spark to support REGEX column specification for Hi...

Write Spark HQL Query output to HDFS

Accessing Hbase tables and querying on Dataframes ...

How to optimize IMPALA/KUDU queries

Unable to connect to resourcemanager at 0.0.0.0:80...

Spark 3 legacy configurations list ( Spark 2 behav...

Spark Python Supportability Matrix