Member since
07-15-2022
11
Posts
0
Kudos Received
0
Solutions
10-03-2023
02:15 AM
It seems that the query involves dynamic partitioning, but the dynamic partition column is not included in either the select statement or the Common Table Expression (CTE). Please add the dynamic partition column 'date' to the select statement and validate it in Beeline.
... View more
08-29-2022
05:21 AM
2 Kudos
Hi, After you run the query you need to look at the query profile to analyse the complete memory. look for “Per node peak memory usage” in the profile to understand how much memory each host or impala daemon used to run this query. For above snippet from your side it looks like this query has the 3gb max limit to run the query, this can be set at session level or in impala admission control pool. If you provide the complete query profile i think we can get more details. Regards, Chethan YM
... View more
08-22-2022
02:35 PM
Hi, I have some points to that questions: The order of the parameter does not matter If you do not persist the settings in the configuration, you have to apply them at the start of each session Those parameters are not the holy grail. Vectorized execution can lead to errors and wrong results under specific circumstances and should only be used if it is required and known to work with the used UDFs. Using CBO/fetching stats can improve your performance. Under the wrong circumstances - it can lead to a long gathering period for stats at the end of a query that maybe is worse than the performance gain. Auto convert join should only be used if you know the sizes of the tables. Setting this property to true will trigger mapjoin only if one table fits in your memory otherwise there will be no use in setting this to true and you will not find any optimization in your runtime. I really can recommend you that article by a fellow Clouderan: https://community.cloudera.com/t5/Community-Articles/Hive-on-Tez-Performance-Tuning-Determining-Reducer-Counts/ta-p/245680 If you have concrete questions to optimize a specific query do not hesitate to ask.
... View more
07-27-2022
07:44 AM
@newbieone, Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.
... View more
07-21-2022
02:39 AM
Hi @EricL, If we are using company laptop and not allowed to place these files, is there any other solutions? Thanks.
... View more
07-18-2022
02:31 AM
@newbieone, as this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.
... View more
07-17-2022
03:58 AM
Hi @colorsoflife Consider using CTE is possible, with the sequence as below for your references. This is Hive script incorporate into Ozzie workflow. set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode=nonstrict;
with test_CTE as
(select * from table1),
testone_CTE as
(select col1, col2, col3 from test_CTE)
insert into table mytablename partition(biz_dt)
select col1 as name1, col2 as name2, col3 as name3 from testOne_CTE
... View more
07-15-2022
07:25 PM
Could help to explain what's the meaning of first 2 sentence? Set Hive..... What would happen if we don't have 2 sentences? How would that impact our query? Thanks.
... View more