Support Questions

Find answers, ask questions, and share your expertise

kudu table size

avatar
Expert Contributor

Hi ,

do we have a coomand to check kudu table size. like hive and hbase table we can check size on hdfs .

do we have similar thing for kudu 

4 REPLIES 4

avatar
Expert Contributor
Hi,

There is not a command to do this. However, if you are using Cloudera
Manager, you can navigate to the "Charts Library" page under the Kudu
service, and then select "Tables" on the left hand side, and then select
the table of interest. This should give various metrics including its size
on disk (post-replication).

Hope that helps
-Todd

avatar
Explorer

Is there not a Kudu command which will allow for obtaining table size information? If not, then how does Cloudera Manager perform this? We would like to be able to replicate this behavior so that we can configure e-mail alerts to be sent whenever a table reaches a particular size.

 

Thanks,

Braz

 

 

 

 

avatar
Contributor

@Braz wrote:

Is there not a Kudu command which will allow for obtaining table size information? If not, then how does Cloudera Manager perform this? We would like to be able to replicate this behavior so that we can configure e-mail alerts to be sent whenever a table reaches a particular size.

 

Thanks,

Braz

 


 

CM is scrapping and aggregating the /metrics pages from the tablet server instances for each tablet/table.

 

Have you reviewed CM triggers/alerts?[1] You might be able to configure email alerts with a similar trigger rule for table sizes.

Alternatively, you could implement what CM currently does by scraping each tablet server's /metrics page and aggregating the data together per tablet/table.

 

 

[1] https://www.cloudera.com/documentation/enterprise/latest/topics/cm_dg_triggers_usecases.html

avatar
Rising Star

I just wanted to add to Todd's suggestion: also if you have CM, you can create a new chart with this query: "select total_kudu_on_disk_size_across_kudu_replicas where category=KUDU_TABLE", and it will plot all your table sizes, plus the graph detail will list current values for all entries. 

 

Probably not easily scriptable, but at least a way to quickly copy all sizes in one go, looking like this:

 

7.2T impala::<tablename_redacted> (Kudu)
9.8T impala::<tablename_redacted> (Kudu)
6.5T impala::<tablename_redacted> (Kudu)
4.1G impala::<tablename_redacted> (Kudu)
21.5G impala::<tablename_redacted> (Kudu)
15.2G impala::<tablename_redacted> (Kudu)
6.1T impala::<tablename_redacted> (Kudu)
98G impala::<tablename_redacted> (Kudu)
23.2G impala::<tablename_redacted> (Kudu)
10G impala::<tablename_redacted> (Kudu)
9.1G impala::<tablename_redacted> (Kudu)
1.2T impala::<tablename_redacted> (Kudu)
7.5G impala::<tablename_redacted> (Kudu)
2.6T impala::<tablename_redacted> (Kudu)
35.8T impala::<tablename_redacted> (Kudu)