Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Hive Batch Queries (Hive on Tez) and Concurrency

Highlighted

Hive Batch Queries (Hive on Tez) and Concurrency

New Contributor

Hi,

We have started using HDP 3.0 and we are connecting to hive through beeline but we are encountering concurrency issues. All sessions are using the hive user to connect and we are finding that when a second session tries to execute a query it remains in a pending status until the second session is closed. Any ideas where to start looking to determine why two or more sessions using the same user cannot execute queries simultaneously?

Here is the configuration details of our HDP 3.0 cluster:

  1. Number of nodes 5 (2 master, 3 data)
  2. 16GB memory on each node
  3. 12GB memory for YARN containers per data node
  4. Maximum container size memory is set to 12GB to satisfy Hive LLAP requirements
  5. Maximum container size vcores is set to 3

Thanks

Paul

1 REPLY 1

Re: Hive Batch Queries (Hive on Tez) and Concurrency

1. What is concurrency set for LLAP, how may TEZ AM can you see in the resource Manager UI ?
2. Can you kindly look at the TEZ UI and graphana LLAP to find if the first query is occupying all the LLAP executors.

As your LLAP size is quite small I believe even premepetion is not helping to launch two jobs simulataneously.

Don't have an account?
Coming from Hortonworks? Activate your account here