Member since
07-29-2015
535
Posts
141
Kudos Received
103
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 7651 | 12-18-2020 01:46 PM | |
| 4997 | 12-16-2020 12:11 PM | |
| 3810 | 12-07-2020 01:47 PM | |
| 2483 | 12-07-2020 09:21 AM | |
| 1624 | 10-14-2020 11:15 AM |
04-18-2018
02:41 PM
Do you have the JVM error dump file? /var/run/cloudera-scm-agent/process/13339-impala-IMPALAD/hs_err_pid13065.log I filed https://issues.apache.org/jira/browse/IMPALA-6882 to investigate the issue. I took a look at the code and it doesn't look like anything has changed, so probabyl requires deeper investigation.
... View more
04-18-2018
02:31 PM
In Impala 2.11 we actually capped the max batch_size setting. Before that you could set it to an arbitrarily high value, which could have strange consequences. It's still a bit of a use-at-your-own-risk setting since it can have consequences for memory consumption and performance. The real fix for this would be https://issues.apache.org/jira/browse/IMPALA-1618. Setting batch_size is just a workaround that may or may not work for you.
... View more
04-17-2018
05:09 PM
What version of CDH were you running before the upgrade? Were you running on the same hardware? Can you include the CPU info from your impalad.INFO log. It looks something like this: I0417 17:05:31.064653 8873 init.cc:237] Cpu Info:
Model: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz
Cores: 8
Max Possible Cores: 8
L1 Cache: 32.00 KB (Line: 64.00 B)
L2 Cache: 256.00 KB (Line: 64.00 B)
L3 Cache: 8.00 MB (Line: 64.00 B)
Hardware Supports:
ssse3
sse4_1
sse4_2
popcnt
avx
avx2
pclmulqdq
Numa Nodes: 1
Numa Nodes of Cores: 0->0 | 1->0 | 2->0 | 3->0 | 4->0 | 5->0 | 6->0 | 7->0 |
... View more
03-28-2018
09:43 AM
@alpertankut current link is https://www.cloudera.com/documentation/enterprise/latest/topics/impala_analytic_functions.html#row_number
... View more
03-27-2018
04:46 PM
Hi @ludof, "INVALIDATE METADATA" doesn't return a result set (i.e. result rows), so I think you want to use the execute() method instead of executeQuery(). - Tim
... View more
03-27-2018
10:58 AM
I think this is an area where the mismatch between HBase's data model and the traditional relational model cause some weirdness - from what I understand, HBase doesn't have a idea of an arbitrary order of columns in a table - I think Impala is not preserving the creation order when reloading the table. There are some known issues in this area, maybe we could do better: https://issues.apache.org/jira/browse/IMPALA-886
... View more
03-26-2018
02:47 PM
Are there any NULLs in idvar? If so, you could be getting tripped up by the interaction between NOT IN and NULL values. One interesting quirk of SQL is that in some cases IN and NOT IN can both be false for the same row and subquery. E.g. I can recreate a similar scenario if the only value in the subquery is a NULL. [localhost:21000] > select count(distinct int_col) from functional.alltypestiny;
+-------------------------+
| count(distinct int_col) |
+-------------------------+
| 2 |
+-------------------------+
[localhost:21000] > select count(distinct int_col) from functional.alltypestiny t1 where int_col in (select distinct int_col from functional.alltypesagg where int_col is null);
+-------------------------+
| count(distinct int_col) |
+-------------------------+
| 0 |
+-------------------------+
[localhost:21000] > select count(distinct int_col) from functional.alltypestiny t1 where int_col not in (select distinct int_col from functional.alltypesagg where int_col is null);
+-------------------------+
| count(distinct int_col) |
+-------------------------+
| 0 |
+-------------------------+ I suspect it might be easier to understand if you use a NOT EXISTS. It is almost equivalent to NOT IN but the handling of NULL values is more intuitive. [localhost:21000] > select count(distinct int_col) from functional.alltypestiny t1 where not exists(select distinct int_col from functional.alltypesagg t2 where int_col is null and t1.int_col = t2.int_col);
+-------------------------+
| count(distinct int_col) |
+-------------------------+
| 2 |
+-------------------------+
[localhost:21000] > select count(distinct int_col) from functional.alltypestiny t1 where exists(select distinct int_col from functional.alltypesagg t2 where int_col is null and t1.int_col = t2.int_col);
+-------------------------+
| count(distinct int_col) |
+-------------------------+
| 0 |
+-------------------------+
... View more
03-22-2018
09:11 AM
Hi @Impala_issues, I think this is more of an SSRS question than an Impala question, although there may be people on this forum using Impala in a similar way. I'm assuming that SSRS uses the Impala ODBC driver. The Impala ODBC driver supports the standard SQLBindParameter() API to subsitute ? in the query text with parameters. I have no idea if SSRS has some kind of additional parameter substitution logic on top of that. References: https://www.cloudera.com/documentation/other/connectors/impala-odbc/latest/Cloudera-ODBC-Driver-for-Impala-Install-Guide.pdf https://docs.microsoft.com/en-us/sql/odbc/reference/syntax/sqlbindparameter-function
... View more
02-19-2018
04:51 PM
It looks like it's referencing a Java class "org.apache.hive.jdbc.HiveDriver" that's not available on the Impala JVM's classpath. In Hive I assume that that class is deliberately or accidentally on the classpath of the java tasks spawned for the Hive job. Impala doesn't have that class on it's classpath. It seems pretty weird for a UDF to depend on a use a database driver, so my first thought is to modify the UDF to removed the need for the dependency. Otherwise, generally, if you want to use Java classes from your UDF they would need to be built into the UDF JAR.
... View more
02-09-2018
03:36 PM
@siddesh210491the simplest solution might be to set the safety valve globally as above. That will apply globally but may be a reasonable setting for clients other than hue too. Otherwise another option is to use the query_timeout_s query option. You can set a default value for that option (or any query options) if you have dynamic resource pools set up, with all Hue queries going into a pool. https://www.cloudera.com/documentation/enterprise/latest/topics/cm_mc_resource_pools.html#concept_xkk_l1d_wr__impala_dynamic_pool_settings
... View more