Support Questions
Find answers, ask questions, and share your expertise

Frequently accessed impala table

Frequently accessed impala table


How do I check which Impala table is more frequently accessed so that I can gather my hot data for HDFS cacheing.


Re: Frequently accessed impala table

Set up auditing for Impala and use a tool to analyze it (or do it yourself). I have feed these audit logs to Cloudera Optimizer and Splunk (this requires Splunk and SPL knowledge). Both will give you your answers to this and quite a bit more.

Honestly, to just get your answer you should be able to read the audit files and use basic unix tools like grep, awk, cut, sort, uniq, etc. to get tables names and the frequency.