About alex.behm

alex.behm · ‎09-07-2016

It looks like your table metadata is in a strange state. How did you create the table exactly? Did you alter the table (e.g. add/remove columns)?

alex.behm · ‎09-07-2016

Just realized the docs do not yet have info on manually setting column stats: https://issues.cloudera.org/browse/IMPALA-3369 Here's the JIRA that enabled that feature, you can find the syntax in the comments. Feel free to ask questions if you are considering this workaround 🙂

alex.behm · ‎09-07-2016

Sorry this is causing you so much pain. One workaround is to update the table and column stats manually. The main idea is this: You can cut down the time for computing stats significantly by manually computing and setting the stats for those columns that actually need them. The columns used in predicates (including join predicates) should have stats. The column can be updated relatively infrequently. Setting the column stats manually is a relatively new feature. The table stats can also be computed and set manually, so e.g., if you've just added a new partition you can do a count(*) on that partition and set the #rows manually. You can find more info in the docs here: http://www.cloudera.com/documentation/enterprise/5-7-x/topics/impala_perf_stats.html#perf_stats

alex.behm · ‎08-26-2016

If you want to delete a whole database you can: drop database <dbname> cascade; That will drop the database and all its tables.

alex.behm · ‎08-01-2016

Did you enable short-cirtuit reads via these configurations? http://www.cloudera.com/documentation/enterprise/latest/topics/admin_hdfs_short_circuit_reads.html

alex.behm · ‎07-29-2016

Filed the JIRA: https://issues.cloudera.org/browse/IMPALA-3938 Thanks a lot for your detailed report and easy reproduction!

alex.behm · ‎07-29-2016

Still investigating and working on the JIRA. In the meantimg, I think the query that you mean is this: select t.id, l.pos as location_number, m.key, m.value from mytable t join t.locations l join l.item m order by id, l.pos; The bug is that those queries returning wrong results are not semantically correct but Impala runs them anyway and gives "arbitrary" results.

alex.behm · ‎07-26-2016

Hi Thomas, in your example query the table alias 'a' has a 'pos' pseudo-column that refers to the element-position within the ARRAY, so I think it's indeed what you want. However, I think there is a bug here that may lead to confusion. The query you wrote "should" be illegal because you should not be able to access a.key and a.value without referencing the nested map in the FROM clause. I will follow-up with a JIRA, stay tuned. Can you try running this and see if you get the expected results? select c.id, a.pos from mytable c left join c.`location` a order by c.id, a.pos; Alternatively try this query if you also want to explode the items within the nested MAP: select c.id, a.pos, a.key, a.value from mytable c left join c.`location` a, a.item order by c.id, a.pos;

alex.behm · ‎07-15-2016

Please take a look at this thread for a response: http://community.cloudera.com/t5/Interactive-Short-cycle-SQL/Comma-delimited-string-to-individual-rows/m-p/41402#M1781

alex.behm · ‎07-15-2016

Hi! Good question. Today, Impala is not aware of the heterogeneity and will split the work evenly among all available nodes - regardless of how much cpu/memory those nodes have.

Online	Offline
Last Visited	‎05-10-2018 06:52 PM

Member Since	‎10-16-2013 11:04 AM
Last Visited	‎05-10-2018 06:52 PM
Posts	307
Kudos received	77

Cloudera Community

Re: External Table from Parquet folder returns emp...

Re: Impala SQL for KUDU does not work

Re: Impalad logs diskspace full

Re: Impala round function does not return expected...

Re: Is Impala a proces engine when I use kudu?

Re: Impala AVRO schema throws error while Querying

Re: Incremental stats size estimate exceeds 200.00...

Re: Incremental stats size estimate exceeds 200.00...

Re: Drop multiples tables

Re: Impala error during install

Re: Impala Complex types: position in ARRAY of MAP

Re: Impala Complex types: position in ARRAY of MAP

Re: Impala Complex types: position in ARRAY of MAP

Re: String to array

Re: How does Impala handle heterogeneous hardware?