- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
COMPUTE Stats or Analyze table
- Labels:
-
Apache Hive
-
Apache Impala
Created on 04-18-2019 01:20 AM - edited 09-16-2022 07:19 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi All,
We have created a Hive database and have some external tables on with files placed in S3 bucket. Our Application accesses this data using Impala.
Tables are not partitioned and the files are in text/csv format.
How should we collect stats using Impala or Hive ?
Is it possible to collect stats for Hive tables using Impala and HIve tables stats should only be created to hive. Appreciate your help.
Regards.
Sandeep Suman
Created 04-18-2019 10:09 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If you are mainly accessing the table using Impala, I'd recommend Impala's compute stats for best performance of Impala.
There are some subtle differences in the stats collected (whether they're partition or table-level). The engines can interoperate but Impala can generally generate better plans with the full set of stats from "COMPUTE STATS"
Created 04-18-2019 10:09 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If you are mainly accessing the table using Impala, I'd recommend Impala's compute stats for best performance of Impala.
There are some subtle differences in the stats collected (whether they're partition or table-level). The engines can interoperate but Impala can generally generate better plans with the full set of stats from "COMPUTE STATS"
