Support Questions

Find answers, ask questions, and share your expertise
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

what exactly column and row stats implies in a running impala query?


Hi Team,


i see coulmn stats and row stats is unavaialble in impala query. can i know what exactly this means?

partitions=1/1 files=200 size=8.77GB
   stats-rows=unavailable extrapolated-rows=disabled
   table stats: rows=unavailable size=8.77GB
   column stats: unavailable



Impala requires statistics about the data in your table so that it can optimise query plans for most efficient execution.



You should generally run "compute stats" on your tables after loading data into them or after the data has changed significantly. For more details I'd recommend our docs, which have more information than I could fit in this reply

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.