Support Questions

Find answers, ask questions, and share your expertise

Hive's taking to much time. It is normal?

avatar
Contributor

Hi,

I'm trying to do a select count(*) from tweets; using Hive CLI. But as you can see, it is taking to much time (i guess)

It is normal?

Any tip?

10007-hive-sql.png

Regards

1 ACCEPTED SOLUTION

avatar
Rising Star

The Tez job has not started: both the mapper and the reducer are in "pending" state and haven't yet been launched. Once launched they would enter "running" state.

Check yarn to ensure there is enough room in your queue to fit the containers (http://sandbox:8088). There isn't much ram in a sandbox, and it could all be taken up by a Spark instance or by the Tez instances of hiveserver2.

View solution in original post

14 REPLIES 14

avatar
Rising Star

The Tez job has not started: both the mapper and the reducer are in "pending" state and haven't yet been launched. Once launched they would enter "running" state.

Check yarn to ensure there is enough room in your queue to fit the containers (http://sandbox:8088). There isn't much ram in a sandbox, and it could all be taken up by a Spark instance or by the Tez instances of hiveserver2.

avatar
Contributor

Jean-Phillippe, as you may guess, i am new with the Sandbox. How can i know if there's enough room in my queue to fit containers?

These are my setting.

10008-hive-sql-config.png

avatar
Rising Star

Since you are in a sandbox, you need to reduce the amount of memory taken by each component so that they can fit (try running with 12 GB if you can). Reduce the memory footprint as follows:

  • Tez container size = 1024 MB
  • Map join, per Map memory = 256 MB
  • Metastore heap = 512 MB
  • Client heap = 512 MB
  • Tez Client->tez.am.resource.memory.mb = 512

YARN will need to fit at least one Tez AM (512 MB) and a couple Tez containers (512MB *2). You can check how much memory is allocated to YARN on the YARN config page "Memory allocated for all YARN containers on a node".

avatar
Contributor

Dead Jean-Phillipe.

I tried to follow your instructions. But, as you can see is giving me headaches

"YARN will need to fit at least one Tez AM (512 MB) and a couple Tez containers (512MB *2). You can check how much memory is allocated to YARN on the YARN config page "Memory allocated for all YARN containers on a node"."

Can you please guide me please.

Regards

avatar
Rising Star

Hi Cristian, the amount of memory that yarn can allocate is controlled by the setting "Memory allocated for all YARN containers on a node" under YARN. Set this to 3 GB, and that should give enough room for Tez to run (it needs 2.5 GB if you follow the settings above.).

avatar
Contributor

I've placed more ram, added jar path missing and worked perfect.

Thanks for your help

avatar
Contributor

Job is not started at all. Please share more info for debugging(cluster size, query, data size, hive log, etc)

avatar
Contributor

Hi Zyang,

Can you tell me where can i find that information?

Regards

avatar
Contributor

Cluster size: yarn node -list

Hive log: /tmp/<username>/hive.log