Member since
03-19-2019
18
Posts
0
Kudos Received
0
Solutions
04-19-2019
02:30 PM
1 Kudo
https://www.cloudera.com/documentation/enterprise/latest/topics/impala_explain_plan.html#explain_plan is our high level doc. I would recommend starting with summary to understand where time is spent, then using the profile to drill down into individual nodes. WorkloadXM can help a lot automate the analysis process to understand bottlenecks.
... View more
04-17-2019
10:28 PM
Thank you very much Tim. Provided link has clarified my doubt.
... View more
04-17-2019
06:00 PM
1 Kudo
In it's default configuration, metadata is cached until an "INVALIDATE METADATA" command evicts the table from the cache. Or until the catalog is restarted. In 5.16 and 6.1+ there are some non-default options that will evict metadata after a particular timeout. At some point these will become the defaults. Table stats are collected and stored in the hive metastore when you run a "compute stats" command. They are then just part of the table metadata.
... View more
04-10-2019
05:02 AM
Thank you very much Tim for providing this insight. I have assumption that MEM_LIMIT option is asking for that amount of space for query.
... View more