About Tim Armstrong

Tim Armstrong · ‎06-05-2018

@mauriciothanks for the profile. I think you might be better off tweaking DISABLE_CODEGEN_ROWS_THRESHOLD instead of using the big hammer of DISABLE_CODEGEN. The way that option works is that codegen is disabled automatically if the planner detects that no point in the query plan processes that number of rows per backend. The default is 50,000. E.g. if your query scans 100,000 rows split across three backends (33,333 per backend), it will disable codegen automatically. Instead of setting DISABLE_CODEGEN, I'd suggest increasing the value first. Based on the profile you sent me, it looks like something like 400000 might be sufficient for that query at least.

Tim Armstrong · ‎06-05-2018

@mauricioI agree it's not great to turn it on globally. I'd be interested in seeing the query profile to understand what happened. We've made some codegen time improvements but there are still remaining issues so would be good to see if it's something we've fixed or not.

Tim Armstrong · ‎05-23-2018

@Hrishi1did you consider setting a default SCRATCH_LIMIT at the resource pool level so that queries will fail if they spill too much data? I know a lot of cluster admins do things like that to prevent runaway queries, and also so that users will come to them if they're trying to run big queries instead of them having to contact users. I understand that it's not exactly what you're looking for but I've seen people have success with it.

Tim Armstrong · ‎05-21-2018

I looked into it and we don't currently support per-query alerts. I passed along this feedback to the Cloudera Manager team. I guess we already covered it, but my two suggestions would be: Set a default scratch_limit per-pool or globally so that users don't accidentally write queries that spill a lot of data Set up monitoring for some aggregate threshold, then use the queries page to discover the spilling queries. My philosophy on this is that spilling queries are nothing to be concerned about as long as queries are completing fast enough for your needs.

Tim Armstrong · ‎05-16-2018

I'm planning to get back to you with an answer - just haven't been able to find the time yet 🙂

Tim Armstrong · ‎05-14-2018

Depending on exactly what you want to trigger on, you can use the generic function in CM to trigger based on any tsquery expression: https://www.cloudera.com/documentation/enterprise/latest/topics/cm_dg_triggers_usecases.html . There are a number of metrics tracking spill-to-disk: https://www.cloudera.com/documentation/enterprise/latest/topics/cm_metrics_impala.html I don't fully understand the goal though - generally spill-to-disk happens transparently as part of normal query processing when memory is constrained and isn't cause for concern. If your aim is to prevent runaway spilling, the scratch_limit query option is a direct way to do that: https://www.cloudera.com/documentation/enterprise/latest/topics/impala_scratch_limit.html . You can set the default query option globally or set default query options per-resource-pool via the "Dynamic Resource Pools" UI in CM. https://www.cloudera.com/documentation/enterprise/latest/topics/impala_disable_unsafe_spills.html is also occasionally useful.

Tim Armstrong · ‎05-11-2018

The CM queries tab keeps track of "Memory Spilled" per query. You can choose to display it via "select attributes" and also search for queries based on memory_spilled in the search box. If you click the down array next to the query and look at "query details", the information is in there too. The "Utilization Report" UI also has some aggregate information about memory spilled per resource pool.

Tim Armstrong · ‎05-02-2018

There are also --idle_query_timeout and --idle_session_timeout startup flags that set an upper bound on the expiration. They might also be set.

Tim Armstrong · ‎04-27-2018

https://issues.apache.org/jira/browse/IMPALA-6882 is easy to rule out since it only occurs on > 5 year old processors.

Tim Armstrong · ‎04-26-2018

I agree we'd need more info to diagnose. Based on the correlation with querying a nested types table, it could be https://issues.apache.org/jira/browse/IMPALA-6489 which is fixed in the 5.14.2 maintenance release.

Online	Offline
Last Visited	‎02-11-2021 06:07 PM

Member Since	‎07-29-2015 04:07 PM
Last Visited	‎02-11-2021 06:07 PM
Posts	535
Kudos received	141

Cloudera Community

Re: Impala Queries which were previously working a...

Re: Impala queries are not distributing to all the...

Re: impala - `recover partitions` points to old da...

Re: impala catalog server JVM

Re: Impala - On-demand metadata

Re: Very slow CodeGen taking 80% of runtime

Re: Very slow CodeGen taking 80% of runtime

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Query blah expired due to client inactivity (t...

Re: Impala critical bug in CDH 5.14.0

Re: Impala critical bug in CDH 5.14.0