Support Questions
Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Innovation Accelerator group hub.

Hive Batch Queries (Hive on Tez) and Concurrency

New Contributor

Hi,

We have started using HDP 3.0 and we are connecting to hive through beeline but we are encountering concurrency issues. All sessions are using the hive user to connect and we are finding that when a second session tries to execute a query it remains in a pending status until the second session is closed. Any ideas where to start looking to determine why two or more sessions using the same user cannot execute queries simultaneously?

Here is the configuration details of our HDP 3.0 cluster:

  1. Number of nodes 5 (2 master, 3 data)
  2. 16GB memory on each node
  3. 12GB memory for YARN containers per data node
  4. Maximum container size memory is set to 12GB to satisfy Hive LLAP requirements
  5. Maximum container size vcores is set to 3

Thanks

Paul

1 REPLY 1

1. What is concurrency set for LLAP, how may TEZ AM can you see in the resource Manager UI ?
2. Can you kindly look at the TEZ UI and graphana LLAP to find if the first query is occupying all the LLAP executors.

As your LLAP size is quite small I believe even premepetion is not helping to launch two jobs simulataneously.