Member since
10-16-2013
307
Posts
77
Kudos Received
59
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
11112 | 04-17-2018 04:59 PM | |
6092 | 04-11-2018 10:07 PM | |
3514 | 03-02-2018 09:13 AM | |
22071 | 03-01-2018 09:22 AM | |
2606 | 02-27-2018 08:06 AM |
07-02-2018
02:52 AM
So this begs the question how does Cloudera Manager produce the following chart: any reasson we cant use the SQL: select query_duration from DEFAULT.IMPALA_QUERIES where serviceName="impala" and (stats_missing=true or stats_corrupt=true) ??
... View more
05-10-2018
07:54 PM
@alex.behmwrote: To debug wrong results, it's very helpful for us to get an Impala query profile of the query that returns wrong results. Would you be able to provide that to help us debug? Please see on this url for impala query profile. Thanks
... View more
04-18-2018
01:13 AM
Thank you all, I added.. PARTITIONED BY (`date` STRING, hour INT) ..to the CREATE External TABLE sentence and then.. ALTER TABLE processed_data_p ADD PARTITION (`date`="2015-08-19", hour=20) LOCATION '/user/myuser/spark/proyecto1/date=2015-08-19/hour=20'; One option to avoid adding all partitions manually is: alter table processed_data_p recover partitions;
... View more
04-12-2018
06:26 AM
DROP TABLE IF EXISTS kudu_db.fact_patientencounter_kudu; CREATE TABLE kudu_db.fact_patientencounter_kudu PRIMARY KEY (pk_patientencounterid, fk_agencyid) PARTITION BY HASH(fk_agencyid) PARTITIONS 3 STORED AS KUDU AS SELECT pk_patientencounterid, fk_agencyid FROM default.fact_patientencounter The above does work. Thanks for the quick replies on this.
... View more
02-27-2018
08:06 AM
1 Kudo
Kudu has the capability to evaluate simple filters natively, e.g. using the primary index of a table, so Impala will push such filters directly to Kudu. More complex filters (e.g. those involving UDFs) are evaluated by Impala after receiving rows from Kudu. Impala clearly distinguishes the filters evaluated by Kudu and those by Impala in the explain plan.
... View more
02-09-2018
11:46 PM
Thanks for you reply. I try to SET MT_DOP=0; before compute stats and it works! The impalad does not crash any more althougth compute stats still fail due to incompataible schema.
... View more
02-09-2018
03:36 PM
@siddesh210491the simplest solution might be to set the safety valve globally as above. That will apply globally but may be a reasonable setting for clients other than hue too. Otherwise another option is to use the query_timeout_s query option. You can set a default value for that option (or any query options) if you have dynamic resource pools set up, with all Hue queries going into a pool. https://www.cloudera.com/documentation/enterprise/latest/topics/cm_mc_resource_pools.html#concept_xkk_l1d_wr__impala_dynamic_pool_settings
... View more
02-05-2018
11:59 AM
1 Kudo
Thank you a lot for quick response! -------EDIT---------- Also, I have found a possible walk around with using split_part() function instead of substr() - it will work for my particular case
... View more
01-22-2018
01:25 PM
Hello everyone, I was wondering like Iamwan if it was still impossible today (in January 2018) to auto increment an ID (an int, a bigint or whatever) in a table (in Impala or Hive) when data are added. Would you have further information please ? I didn't find anything about it here : https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_datatypes.html Have a nice day !
... View more