Options
- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Restrict memory for multiple hive query
Labels:
- Labels:
-
Apache Hive
-
Apache YARN
Contributor
Created 01-16-2019 03:14 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi All,
We have total of 3.5 TB of RAM and We are facing a problem of memory utilization .
Problem description :
We have 5 HQL (Each HQL has multiple sub query ) running in parallel .
One of the query is occupying 3.3 TB of memory and rest are sitting idle in the queue and taking lot of time to complete .
We need some recommendation or tuning so that all queries gets equal chunk of memory .
Parameters set in our query
SET hive.exec.compress.output=true ;SET hive.exec.compress.intermediate=true ; SET mapred.output.compress=true ; SET mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec ; SET mapred.map.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec ; SET io.seqfile.compression.type=BLOCK; SET io.sort.mb=500 ; SET dfs.block.size=536870912; SET io.file.buffer.size=131072; SET mapred.compress.map.output=true; SET mapred.output.compression.type=BLOCK; SET hive.auto.convert.join=true; SET mapreduce.map.memory.mb=12288; SET mapreduce.map.java.opts =-Xmx9831m; SET mapreduce.reduce.java.opts=-Xmx8192m; SET mapreduce.reduce.memory.mb=10240; SET hive.exec.dynamic.partition.mode=nonstrict; SET hive.allow-drop-table=true;
Attached screen shot as well for reference .
Quick Help is appreciated !!
1 REPLY 1
Explorer
Created 02-19-2021 08:22 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi ,
Where you able to get this issue sorted out.
Regards
Sam
