- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Ranger Dynamic query rewrite available for hive?
- Labels:
-
Apache Hive
-
Apache Ranger
-
Apache Spark
Created 11-14-2016 09:12 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Is dynamic query rewrite avialable for hive as it is In HDP 2.5 dynamic query rewrite via Ranger for SparkSQL?
Created on 11-15-2016 03:56 PM - edited 08-18-2019 03:22 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Sunile Manjee Answer is Yes. In HDP 2.5 Spark Column Security is available with LLAP and Ranger integration
You get Fine-Grained Column Level Access Control for SparkSQL. Fully dynamic policies per user. Doesn’t require views. Use Standard Ranger policies and tools to control access and masking policies.
Flow:
1.SparkSQL gets data locations known as “splits” from HiveServer and plans query.
2.HiveServer2 authorizes access using Ranger. Per-user policies like row filtering are applied.
3.Spark gets a modified query plan based on dynamic security policy.
4.Spark reads data from LLAP. Filtering / masking guaranteed by LLAP server.
Created 11-15-2016 12:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Ranger supports row-level filtering for Hive in 2.5, and accomplishes this by dynamically rewriting the query. I believe LLAP is a dependency for row-level filtering in SparkSQL.
Created on 11-15-2016 03:56 PM - edited 08-18-2019 03:22 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Sunile Manjee Answer is Yes. In HDP 2.5 Spark Column Security is available with LLAP and Ranger integration
You get Fine-Grained Column Level Access Control for SparkSQL. Fully dynamic policies per user. Doesn’t require views. Use Standard Ranger policies and tools to control access and masking policies.
Flow:
1.SparkSQL gets data locations known as “splits” from HiveServer and plans query.
2.HiveServer2 authorizes access using Ranger. Per-user policies like row filtering are applied.
3.Spark gets a modified query plan based on dynamic security policy.
4.Spark reads data from LLAP. Filtering / masking guaranteed by LLAP server.
