Member since
07-16-2018
15
Posts
2
Kudos Received
0
Solutions
01-16-2019
03:14 PM
Hi All, We have total of 3.5 TB of RAM and We are facing a problem of memory utilization . Problem description : We have 5 HQL (Each HQL has multiple sub query ) running in parallel . One of the query is occupying 3.3 TB of memory and rest are sitting idle in the queue and taking lot of time to complete . We need some recommendation or tuning so that all queries gets equal chunk of memory . Parameters set in our query SET hive.exec.compress.output=true ;SET hive.exec.compress.intermediate=true ;
SET mapred.output.compress=true ;
SET mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec ;
SET mapred.map.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec ;
SET io.seqfile.compression.type=BLOCK;
SET io.sort.mb=500 ;
SET dfs.block.size=536870912;
SET io.file.buffer.size=131072;
SET mapred.compress.map.output=true;
SET mapred.output.compression.type=BLOCK;
SET hive.auto.convert.join=true;
SET mapreduce.map.memory.mb=12288;
SET mapreduce.map.java.opts =-Xmx9831m;
SET mapreduce.reduce.java.opts=-Xmx8192m;
SET mapreduce.reduce.memory.mb=10240;
SET hive.exec.dynamic.partition.mode=nonstrict;
SET hive.allow-drop-table=true; Attached screen shot as well for reference . memory.jpg Quick Help is appreciated !!
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache YARN
07-16-2018
07:22 AM
Hi All, When i am ruining hive quries some times i am facing error: Insufficient space for shared memory file . After few tries it throws Exception in thread "main" java.io.IOException: No space left on device Help Appreciated .
... View more
Labels:
- Labels:
-
Apache Hive
07-11-2018
10:36 AM
Hi All, Is there any way we can skip to the next row based on some filter conditions in hive ? Ex: I have a table landing hive> desc landing;
OK
id string
iud string
time timestamp
content 1 I 2018-07-11 07:10:17
1 D 2018-07-11 07:11:16
2 I 2018-07-11 07:29:17
2 D 2018-07-11 07:31:21
3 I 2018-07-11 08:22:22
3 U 2018-07-11 09:12:22
5 I 2018-07-11 09:22:22 I want to insert into table B with following conditions -- When the 1) id has D flag then it should not be inserted (Skipped) , 2) when it has only U or I flag then insert , 3) when id has I,U followed by D flag then it should not insert (Skip) Output 3 U 2018-07-11 09:12:22
5 I 2018-07-11 09:22:22 I have written a code which is using row_number to eliminate duplicate records , but not satisfying the above conditions. select * from (select *,row_number() over(partition by id order by time desc) AS rowid FROM landing where iud <> 'D') r where r.rowid=1; 1 I 2018-07-11 07:10:17 1
2 I 2018-07-11 07:29:17 1
3 U 2018-07-11 09:12:22 1
5 I 2018-07-11 09:22:22 1
Help is appreciated .
... View more
Labels:
- Labels:
-
Apache Hive
06-25-2018
05:21 AM
Hi Vinicius, Yes its 1) Managed table 2) Engine is MR 3) Its a text format hive> show create table payout_curreny_master_temp ;
OK
CREATE TABLE `payout_curreny_master_temp`(
`partner_id` int,
`program_name` string,
`program_type` string,
`payout_currency` string,
`delete_flag` string)
ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
WITH SERDEPROPERTIES (
'field.delim'=',',
'serialization.format'=',')
STORED AS INPUTFORMAT
'org.apache.hadoop.mapred.TextInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION
'hdfs://nameservice1/user/hive/warehouse/xyz/payout_curreny_master_temp'
TBLPROPERTIES (
'COLUMN_STATS_ACCURATE'='true',
'numFiles'='1',
'numRows'='0',
'rawDataSize'='0',
'totalSize'='110912',
'transient_lastDdlTime'='1529665548')
... View more
06-23-2018
04:30 PM
Hi All, My table has 3776 rows . 1) when i do : explain select * from table Statistics: Num rows: 274 Data size: 110912 Basic stats: COMPLETE Column stats: NONE 2) when i do : explain select count(*) from table; Statistics: Num rows: 1 Data size: 8 Basic stats: COMPLETE Column stats: COMPLETE 3) when i do : explain select partner_id from table: Statistics: Num rows: 27728 Data size: 110912 Basic stats: COMPLETE Column stats: NONE Appreciate your help
... View more
Labels:
- Labels:
-
Apache Hive
03-09-2018
10:28 AM
@rtrivedi Yes got some hive logs .. 2018-03-06 03:42:42,951 INFO [HiveServer2-Background-Pool: Thread-1132047]: exec.MoveTask (MoveTask.java:execute(309)) - Partition is: {region_code=null, fiscal_year_week_code=null}
at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:298)
2018-03-06 03:49:24,600 ERROR [HiveServer2-Background-Pool: Thread-1132295]: ql.Driver (SessionState.java:printError(962)) - FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
... View more
03-08-2018
02:34 PM
@rtrivedi : Thanks But 1) The same job will run successful when u re-run the job (so there ll be no space and permission issues ) 2) Its a small table
... View more
03-08-2018
12:22 PM
Hi , Its a automated job (running through talend ) and its a common table where we will be performing insertions , sometimes it works fine and sometimes we get org.apache.hadoop.hive.ql.exec.MoveTask We are not able to figure it out the exact reason . Please help us . Table structure : CREATE EXTERNAL TABLE `lz_control_zyme_file_load_tracker`(
`target_stage_name` string,
`input_file_sequential_number` int,
`source_system_file_type_name` string,
`region_code` string,
`source_file_name` string,
`process_date` string)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY '|'
STORED AS INPUTFORMAT
'org.apache.hadoop.mapred.TextInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION
'hdfs://HDPPRODNN/sales/channel/loading/lz_control_zyme_file_load_tracker/data'
TBLPROPERTIES (
'COLUMN_STATS_ACCURATE'='true',
'numFiles'='43739',
'numRows'='4706',
'rawDataSize'='443398',
'totalSize'='1842648',
'transient_lastDdlTime'='1520496190')
... View more
Labels:
- Labels:
-
Apache Hive
04-28-2017
01:46 PM
1 Kudo
Thanks a lot .. It works Consolidated answer
ALTER TABLE abc.Stage_xact_personall_region_of_Sale DROP PARTITION (region_code='TYES'); Then do an overwrite on it insert overwrite table abc.Stage_xact_personall_region_of_Sale partition(region_code) select * from xyz.Region_Sale where region_code='TYES'; -Thanks Satish
... View more
04-28-2017
07:27 AM
1 Kudo
Hi All, Can we truncate partition external table ? Sharing an example where i faced the issue . hive> truncate table abc.Stage_xact_personall_region_of_Sale partition (region_code='TYES'); FAILED: SemanticException [Error 10146]: Cannot truncate non-managed table abc.Stage_xact_personall_region_of_Sale . Help will be much appreciated ASAP . -Thanks Satish
... View more
Labels:
- Labels:
-
Apache Hive