Posts: 47
Registered: ‎12-28-2015

Frequently accessed impala table

How do I check which Impala table is more frequently accessed so that I can gather my hot data for HDFS cacheing.

Posts: 642
Topics: 3
Kudos: 121
Solutions: 67
Registered: ‎08-16-2016

Re: Frequently accessed impala table

Set up auditing for Impala and use a tool to analyze it (or do it yourself). I have feed these audit logs to Cloudera Optimizer and Splunk (this requires Splunk and SPL knowledge). Both will give you your answers to this and quite a bit more.

Honestly, to just get your answer you should be able to read the audit files and use basic unix tools like grep, awk, cut, sort, uniq, etc. to get tables names and the frequency.
New solutions