Reply
Explorer
Posts: 6
Registered: ‎04-22-2014

analyze table and compute status not provide detailed statistics.

I am trying to run the

analyze table <tbl_name> compute statistics

query in hive.
The query runs without any issues. However, when it comes to viewing the results using the query

Describe extended <tbl_name>

all I can see is just the column names and column type(int, string) etc. but no detailed statistics.

I have a mySQL data base as a metastore with the settings configured as mentioned in the link below http://www.cloudera.com/content/cloudera-content/cloudera-docs/CM4Ent/latest/Cloudera-Manager-Instal...

I have CDH 5beta2.

It is my understanding that patch ref:JIRA[HIVE-1326] should have implemented stats and that we could see details, like:

  • number of distinct values
  • number of NULL values
  • min/max k values where k could be given by user
  • histogram: frequency and height balanced
  • average size of the column
  • avg/sum of all values in the column if their type is numerical
  • percentiles of the value

I am not able to see basic stats, nor detailed one. Any help as to why would be appreciated.

Highlighted
Posts: 1,826
Kudos: 406
Solutions: 292
Registered: ‎07-31-2013

Re: analyze table and compute status not provide detailed statistics.

Can you recheck the JIRA ID posted in your message, so we can x-ref it with the CDH installation version to see if the feature is included in it? The one referred (HIVE-1326) does not appear related.
Explorer
Posts: 6
Registered: ‎04-22-2014

Re: analyze table and compute status not provide detailed statistics.

Announcements