About alex.behm

Tim Armstrong · ‎07-13-2018

@mcalnd it's in CDH5.8 onwards

rafeski · ‎07-02-2018

So this begs the question how does Cloudera Manager produce the following chart: any reasson we cant use the SQL: select query_duration from DEFAULT.IMPALA_QUERIES where serviceName="impala" and (stats_missing=true or stats_corrupt=true) ??

hendry · ‎05-10-2018

@alex.behmwrote: To debug wrong results, it's very helpful for us to get an Impala query profile of the query that returns wrong results. Would you be able to provide that to help us debug? Please see on this url for impala query profile. Thanks

messenjah00 · ‎04-18-2018

Thank you all, I added.. PARTITIONED BY (`date` STRING, hour INT) ..to the CREATE External TABLE sentence and then.. ALTER TABLE processed_data_p ADD PARTITION (`date`="2015-08-19", hour=20) LOCATION '/user/myuser/spark/proyecto1/date=2015-08-19/hour=20'; One option to avoid adding all partitions manually is: alter table processed_data_p recover partitions;

AKB · ‎04-12-2018

DROP TABLE IF EXISTS kudu_db.fact_patientencounter_kudu; CREATE TABLE kudu_db.fact_patientencounter_kudu PRIMARY KEY (pk_patientencounterid, fk_agencyid) PARTITION BY HASH(fk_agencyid) PARTITIONS 3 STORED AS KUDU AS SELECT pk_patientencounterid, fk_agencyid FROM default.fact_patientencounter The above does work. Thanks for the quick replies on this.

alex.behm · ‎02-27-2018

Kudu has the capability to evaluate simple filters natively, e.g. using the primary index of a table, so Impala will push such filters directly to Kudu. More complex filters (e.g. those involving UDFs) are evaluated by Impala after receiving rows from Kudu. Impala clearly distinguishes the filters evaluated by Kudu and those by Impala in the explain plan.

linhao · ‎02-09-2018

Thanks for you reply. I try to SET MT_DOP=0; before compute stats and it works! The impalad does not crash any more althougth compute stats still fail due to incompataible schema.

Tim Armstrong · ‎02-09-2018

@siddesh210491the simplest solution might be to set the safety valve globally as above. That will apply globally but may be a reasonable setting for clients other than hue too. Otherwise another option is to use the query_timeout_s query option. You can set a default value for that option (or any query options) if you have dynamic resource pools set up, with all Hue queries going into a pool. https://www.cloudera.com/documentation/enterprise/latest/topics/cm_mc_resource_pools.html#concept_xkk_l1d_wr__impala_dynamic_pool_settings

laanajar · ‎02-05-2018

Thank you a lot for quick response! -------EDIT---------- Also, I have found a possible walk around with using split_part() function instead of substr() - it will work for my particular case

Meldin · ‎01-22-2018

Hello everyone, I was wondering like Iamwan if it was still impossible today (in January 2018) to auto increment an ID (an int, a bigint or whatever) in a table (in Impala or Hive) when data are added. Would you have further information please ? I didn't find anything about it here : https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_datatypes.html Have a nice day !

Online	Offline
Last Visited	‎05-10-2018 06:52 PM

Member Since	‎10-16-2013 11:04 AM
Last Visited	‎05-10-2018 06:52 PM
Posts	307
Kudos received	77

Cloudera Community

Re: External Table from Parquet folder returns emp...

Re: Impala SQL for KUDU does not work

Re: Impalad logs diskspace full

Re: Impala round function does not return expected...

Re: Is Impala a proces engine when I use kudu?

Re: Impalad logs diskspace full

Re: Missing stats in Impala

Re: Select in impala has different value with hive

Re: External Table from Parquet folder returns emp...

Re: Impala SQL for KUDU does not work

Re: Is Impala a proces engine when I use kudu?

Re: Impalad exit when execute 'compute stats' on ...

Re: Can the impalad "idle_query_timeout" parameter...

Re: IllegalStateException: null using first_value ...

Re: Is it possiable Auto Increment columns in Impa...