About newbieone

ggangadharan · ‎10-03-2023

It seems that the query involves dynamic partitioning, but the dynamic partition column is not included in either the select statement or the Common Table Expression (CTE). Please add the dynamic partition column 'date' to the select statement and validate it in Beeline.

ChethanYM · ‎08-29-2022

Hi, After you run the query you need to look at the query profile to analyse the complete memory. look for “Per node peak memory usage” in the profile to understand how much memory each host or impala daemon used to run this query. For above snippet from your side it looks like this query has the 3gb max limit to run the query, this can be set at session level or in impala admission control pool. If you provide the complete query profile i think we can get more details. Regards, Chethan YM

Bilbao · ‎08-22-2022

Hi, I have some points to that questions: The order of the parameter does not matter If you do not persist the settings in the configuration, you have to apply them at the start of each session Those parameters are not the holy grail. Vectorized execution can lead to errors and wrong results under specific circumstances and should only be used if it is required and known to work with the used UDFs. Using CBO/fetching stats can improve your performance. Under the wrong circumstances - it can lead to a long gathering period for stats at the end of a query that maybe is worse than the performance gain. Auto convert join should only be used if you know the sizes of the tables. Setting this property to true will trigger mapjoin only if one table fits in your memory otherwise there will be no use in setting this to true and you will not find any optimization in your runtime. I really can recommend you that article by a fellow Clouderan: https://community.cloudera.com/t5/Community-Articles/Hive-on-Tez-Performance-Tuning-Determining-Reducer-Counts/ta-p/245680 If you have concrete questions to optimize a specific query do not hesitate to ask.

newbieone · ‎08-18-2022

Can we set 100g? Or maybe 1000g?

VidyaSargur · ‎07-27-2022

@newbieone, Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.

newbieone · ‎07-21-2022

Hi @EricL, If we are using company laptop and not allowed to place these files, is there any other solutions? Thanks.

VidyaSargur · ‎07-18-2022

@newbieone, as this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.

newbieone · ‎07-17-2022

Hi, Can i use CTE in Hive script? Thanks.

newbieone · ‎07-17-2022

Hi @colorsoflife Consider using CTE is possible, with the sequence as below for your references. This is Hive script incorporate into Ozzie workflow. set hive.exec.dynamic.partition=true; set hive.exec.dynamic.partition.mode=nonstrict; with test_CTE as (select * from table1), testone_CTE as (select col1, col2, col3 from test_CTE) insert into table mytablename partition(biz_dt) select col1 as name1, col2 as name2, col3 as name3 from testOne_CTE

newbieone · ‎07-15-2022

Could help to explain what's the meaning of first 2 sentence? Set Hive..... What would happen if we don't have 2 sentences? How would that impact our query? Thanks.

Online	Offline
Last Visited	‎08-27-2024 07:32 PM

Member Since	‎07-15-2022 12:40 AM
Last Visited	‎08-27-2024 07:32 PM
Posts	11

Cloudera Community

Re: Ozzie Workflow for CTE and Insert Statement

Re: How to Read Impala Memory Summary

Re: Hive Performance Tuning Parameters

Re: Impala - Memory limit exceeded

Re: Ozzie Hive Script

Re: FAILED: Execution Error, return code 2 from or...

Re: Compiling statement: FAILED: ParseException : ...

Re: Hive ACID Merge by Example

Re: ParseException line 1:162 cannot recognize inp...

Re: Hive partitioning