Member since
10-28-2016
392
Posts
7
Kudos Received
20
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 3469 | 03-12-2018 02:28 AM | |
| 5190 | 12-18-2017 11:41 PM | |
| 3641 | 07-17-2017 07:01 PM | |
| 2567 | 07-13-2017 07:20 PM | |
| 8215 | 07-12-2017 08:31 PM |
05-02-2017
11:17 PM
hello - I've implemented Spark security using LLAP, and seeing error in specific scenario Here is what is done -> 1 - I login to Spark Thrift server using user - spark 2 - created a Ranger policy which specifies that user - 'spark' does not have access to column - storekey in table - factsales 3 - fired query -> select count(1) from factsales; Error is as shown below. Pls. note - this seems to be happening when i try to get count of rows. When i fire query to get values of specific columns, i get expected
result (based on whether i have access to the column or not). ------------------------------- QUERY WHERE I GET ERROR (in getting count) -------------------- [alanka01@nwk2-bdp-hadoop-06 ~]$ beeline -u jdbc:hive2://nwk2-bdp-hadoop-08.gdcs-qa.apple.com:10015/default -n spark Connecting to jdbc:hive2://nwk2-bdp-hadoop-08.gdcs-qa.apple.com:10015/default
Connected to: Spark SQL (version 1.6.2) Driver: Hive JDBC (version
1.2.1000.2.5.3.0-37) Transaction isolation: TRANSACTION_REPEATABLE_READ
Beeline version 1.2.1000.2.5.3.0-37 by Apache Hive 0:
jdbc:hive2://nwk2-bdp-hadoop-08.gdcs-qa.ap> select count(1)
from factsales;
Error:
org.apache.spark.sql.catalyst.errors.package$TreeNodeException: execute,
tree: TungstenAggregate(key=[],
functions=[(count(1),mode=Final,isDistinct=false)], output=[_c0#402L])
+- TungstenExchange SinglePartition, None +- TungstenAggregate(key=[],
functions=[(count(1),mode=Partial,isDistinct=false)],
output=[count#405L]) +- Scan
LlapRelation(org.apache.spark.sql.hive.llap.LlapContext@32b7eb41,Map(table
-> default.factsales, url -> jdbc:hive2://nwk2-bdp-hadoop-06.gdcs-qa.apple.com:10500))[] (state=,code=0) -------------------------QUERIES WITH EXPECTED RESULT -1 ------------------------------------- 0: jdbc:hive2://nwk2-bdp-hadoop-08.gdcs-qa.ap> select saleskey from factsales limit 10; +-----------+--+
| saleskey |
+-----------+--+
| 3343549 |
| 2822385 |
| 2764012 |
| 3289348 |
| 2531906 |
| 3055870 |
| 2530527 |
| 2880758 |
| 2297049 |
| 3356058 |
+-----------+--+ -------------------------QUERIES WITH EXPECTED RESULT -2 (since user - spark does not have access to column - storekey) ---------------------------------------------------------------------------------------------------------- 0: jdbc:hive2://nwk2-bdp-hadoop-09.gdcs-qa.ap> select saleskey, storekey from factsales limit 10; Error: java.io.IOException: org.apache.hive.service.cli.HiveSQLException: java.io.IOException: org.apache.hadoop.hive.ql.metadata.HiveException: Failed to compile query: org.apache.hadoop.hive.ql.security.authorization.plugin.HiveAccessControlException: Permission denied: user [spark] does not have [SELECT] privilege on [default/factsales/saleskey,storekey] (state=,code=0)
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Spark
05-01-2017
10:36 PM
@Gour Saha - thanks, that setting fixed the issue !
... View more
05-01-2017
09:45 PM
@Neeraj Sabharwal, @mqureshi, @Artem Ervits -looping you in, any ideas on this ?
... View more
05-01-2017
09:38 PM
application-attempt-appattempt-1493666070215-0013.pdf Hi - i'm running a Beeline query, and it is not getting launched. Query : beeline -u jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000/default -n hive -e "INSERT INTO factsales SELECT * FROM factsales_tmp" Error on the Resource Manager UI is as shown -> -------------------------------- [Mon May 01 21:31:00 +0000 2017] Application is added to the scheduler and is not yet activated. Queue's AM resource limit exceeded. Details : AM Partition = <DEFAULT_PARTITION>; AM Resource Request = <memory:2048, vCores:1>; Queue Resource Limit for AM = <memory:2048, vCores:1>; User AM Resource Limit of the queue = <memory:2048, vCores:1>; Queue AM Resource Usage = <memory:2048, vCores:1>; -------------------------------- The Execution engine is - MapReduce Also, This is using default queue. I've another queue - llap which is used to run Hiveserver2 Intearctive (% of Cluster Capacity = 40%) Any ideas on what needs to be done/tweaked for this ?
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache YARN
04-28-2017
11:10 PM
@ksuresh, @mqureshi
- thanks, that fixed this issue !
... View more
04-28-2017
09:58 PM
@mqureshi- thx, however that did not help .. Any ideas ? How do i debug this ? --------------------------------- [hive@nwk2-bdp-hadoop-06 ~]$ beeline -u jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000
Connecting to jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000
Connected to: Apache Hive (version 1.2.1000.2.5.3.0-37)
Driver: Hive JDBC (version 1.2.1000.2.5.3.0-37)
Transaction isolation: TRANSACTION_REPEATABLE_READ
Beeline version 1.2.1000.2.5.3.0-37 by Apache Hive
0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap> use default
0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap> show tables
0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap> [root@nwk2-bdp-hadoop-06 ~]# beeline -u jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000/default
Connecting to jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000/default
Connected to: Apache Hive (version 1.2.1000.2.5.3.0-37)
Driver: Hive JDBC (version 1.2.1000.2.5.3.0-37)
Transaction isolation: TRANSACTION_REPEATABLE_READ
Beeline version 1.2.1000.2.5.3.0-37 by Apache Hive
0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap> show tables
0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap>
... View more
04-28-2017
09:36 PM
@mqureshi, @Sunile Manjee, @Shyam Sunder Rai - looping you in, any ideas on this ?
... View more
04-28-2017
09:23 PM
Hello - i've HiveServer2 enabled on my cluster, and i'm able to connect to the cluster using Beeline. However when i run the command - show tables OR show databases, it does not shown up. However, when i launch the Hive CLI, i'm able to see the tables & database. Any ideas on what is causing this ? --------------------------------------------------------- ON HiveCLI : [hive@nwk2-bdp-hadoop-06 bin]$ hive
Logging initialized using configuration in file:/etc/hive/2.5.3.0-37/0/hive-log4j.properties hive> > show tables; OK factsales factsales_tmp ON BEELINE : beeline -u 'jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000/'
Connecting to jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.apple.com:10000/ Connected to: Apache Hive (version 1.2.1000.2.5.3.0-37) Driver: Hive JDBC (version 1.2.1000.2.5.3.0-37) Transaction isolation: TRANSACTION_REPEATABLE_READ Beeline version 1.2.1000.2.5.3.0-37 by Apache Hive 0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap> show tables
0: jdbc:hive2://nwk2-bdp-hadoop-07.gdcs-qa.ap> show databases
... View more
Labels:
- Labels:
-
Apache Hive
04-19-2017
09:58 PM
@Void Messiah -- a basic question, how did you uninstall HiveServer2 Interactive ? Through Ambari ? Pls. let me know, i'm facing similar issue. Thanks!
... View more
04-19-2017
07:05 PM
@Alena Melnikova, @Huahua Wei - i'm getting the same issue, and On my cluster, llap_heap_size is ~80% hive.llap.daemon.yarn.container.mb, but HiveServer Interactive is not starting up. Any ideas on what else needs to be done ? ------error ---------
method(env) File"/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/scripts/hive_server_interactive.py", line 123,in start raiseFail("Skipping START of Hive Server Interactive since LLAP app couldn't be STARTED.") resource_management.core.exceptions.Fail:Skipping START of HiveServerInteractive since LLAP app couldn't be STARTED. ---------------------- hiveserver2-interactive-notstartingup-0419.pdf
... View more