- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Spark query vs beeline query
- Labels:
-
Apache Ranger
-
Apache Spark
-
HDFS
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I can do query through spark using spark.sql, but when I tried via beeline ranger is blocking the access because of the policy. How does it work via spark query? Doesn't it check HDFS level permission? I
Created 10-15-2021 08:58 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I have found a another community article that has addressed your concern. Please do check below:
That sounds like all is working as designed/implemented since Ranger does not currently (as of HDP 2.4) have a supported plug-in for Spark and knowing that when spark is reading Hive tables that it really isn't going through the "front door" of Hive to actual run queries (it is reading these files from HDFS directly).
That said, the underlying HDFS authorization policies (either w/or w/o using Ranger) will be honored if they are in-place.
Do mark it resolved if it really helps you.
Regards,
Chethan YM
