- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Hive Compute Statistics not showing any statistics while describe table
- Labels:
-
Apache Hive
Created ‎10-04-2018 05:44 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Trying to see statistics on a particular column. I executed the ANALYZE command first and then tried to see the stats by DESCRIBE FORMATTED <table_name> <col_name>. I cant see any values in this. Any idea why its not showing any values?
I tried MSCK and analyzed the table again and checked for stats. No luck so far.
Thanks
Anil
Created ‎10-12-2018 08:41 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Below Statement worked for me finally . hive> desc formatted testdb.table col_name partition (data_dt='20180715');
Created ‎10-05-2018 09:40 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I suspect that the table is partitioned because of which the "describe formatted" does not show any stats related information.
Try running "describe extended" for particular partition spec.
Created ‎10-05-2018 03:58 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Sindhu, Thanks for the reply.
I tried the same and it wont return any values. But the compute statistics command worked fine though
hive> analyze table testdb.table partition(data_dt) compute statistics for columns order_dispatch_diff; Query ID = anil_20181004181226_ed6afd54-fed8-4601-8b54-29214a8dbfc3 Total jobs = 1 Launching Job 1 out of 1 Status: Running (Executing on YARN cluster with App id application_1537195062060_1102639) -------------------------------------------------------------------------------- VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED -------------------------------------------------------------------------------- Map 1 .......... SUCCEEDED 62 62 0 0 0 0 Reducer 2 ...... SUCCEEDED 1 1 0 0 0 0 -------------------------------------------------------------------------------- VERTICES: 02/02 [==========================>>] 100% ELAPSED TIME: 101.08 s -------------------------------------------------------------------------------- OK Time taken: 108.142 seconds hive> desc extended testdb.table order_dispatch_diff; OK order_dispatch_diff int from deserializer Time taken: 0.041 seconds, Fetched: 1 row(s)
Created ‎10-06-2018 10:31 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yes, second that, observing the same behaviour surprisingly.
desc formatted/extended shows no values for the column stats for the partition even after analyse. Is this a reported bug?
Created ‎10-12-2018 08:41 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Below Statement worked for me finally . hive> desc formatted testdb.table col_name partition (data_dt='20180715');
