Member since
03-19-2019
18
Posts
0
Kudos Received
0
Solutions
04-17-2019
10:28 PM
Thank you very much Tim. Provided link has clarified my doubt.
... View more
04-17-2019
08:49 AM
Hi Friends, I have little doubt about Impala using fair schedulre for launching job. I am reading about Impala since 3 months but I never come across about Impala using fair scheduler instead it has its own mechanism for resource allocation. Is there any situation where Impala uses fair scheduler during execution of query? I have one more doubt about IMPALA using YARN. I want to the scenario/condition when we have to use YARN with IMPALA, becuase IMPALA has its own execution engine. When we have to use YARN with IMPALA? I think Llama is meant for that only.
... View more
Labels:
- Labels:
-
Apache Impala
-
Apache YARN
04-17-2019
04:40 AM
Thanks Tim, How long Impala keeps/caches this metadata? If statistics of tables which are participating in query are not available then will it be available after first run? What if I run the query after big interval, then also will metadata be available in cache? What is my cluster or Impala is restarted? Does Impala perform some activity to get statistics of all the participating tables for the first time if statistics are not available and keep it in metastore or some where in DB?
... View more
04-16-2019
07:57 AM
Hi Friends, I am trying to run IMPALA queries using query options. I am trying to analyze the resultant query attributes. What I have observed that when I run the query for the first time, it took long time than I run the query second time or onwards. I want to know the reason behind difference in time? Does this happen only when table statistics are not available or it happens all the time. Is my observation right?
... View more
Labels:
- Labels:
-
Apache Impala
-
Cloudera Manager
04-10-2019
05:02 AM
Thank you very much Tim for providing this insight. I have assumption that MEM_LIMIT option is asking for that amount of space for query.
... View more
04-04-2019
10:22 PM
Thanks Tim, This limit(3gb) only work when your IMPALAD's mem_limit is greater than 3gb. I've increased the IMPALAD's mem_limit by invoking the rest api and by manually changing the configuration of Impala but in this way you have to restart the Impala server then only mem_limit will work. I can not understand if IMPALAD's mem_limit is 1gb and if I pass the higher mem_limit (query_option) in jdbc URL then it won't work. What is the point of providing this query option. (1) If my query needed 3gb memory and IMPALAD's mem_limit is 1 gb and I am passing mem_limit=3gb is JDBC url then it won't work. I've to change the mem_limit of IMPALAD and restart the server. And (2) If my query needed 500mb memory and IMPALAD's mem_limit is 1 gb then I don't need to pass mem_limit because in any case it is going to execute. Hope you understood my point. I can conclude that this query option can prevent query to take entire memory of IMPALAD, not for allocating the required memory.
... View more
04-04-2019
03:12 AM
Hi Team, I am trying to perform some testing on Impala so that I can analyze the performance of Impala query based on provided configuration. I am using TPCDS queries. I am making jdbc calls to fire queries. In order to change the configuration values for my query at run time for the current session, I am using Impala query options. I am analysing the query attribute values after execution. In one of the jdbc url I am using "mem_limit" query option, I set its value as 3gb (mem_limit=3gb) But I can not see this value is applied to the current session. I am getting below error- "Memory limit exceeded"
this is how I am using query option
jdbc:impala://host:21050 /tpcds_bin_partitioned_textfile_40;AuthMech=1;KrbRealm=test.com ;KrbHostFQDN=host;KrbServiceName=impala;mem_limit=3gb;";
But when I changed(mem_limit=3gb) the value from clouodera manager->Impala->configuration, it works fiine.
What wrong I am doing here.
... View more
Labels:
- Labels:
-
Apache Impala
-
Cloudera Manager