Support Questions
Find answers, ask questions, and share your expertise
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Frequently accessed impala table

Frequently accessed impala table


How do I check which Impala table is more frequently accessed so that I can gather my hot data for HDFS cacheing.


Re: Frequently accessed impala table

Set up auditing for Impala and use a tool to analyze it (or do it yourself). I have feed these audit logs to Cloudera Optimizer and Splunk (this requires Splunk and SPL knowledge). Both will give you your answers to this and quite a bit more.

Honestly, to just get your answer you should be able to read the audit files and use basic unix tools like grep, awk, cut, sort, uniq, etc. to get tables names and the frequency.
Don't have an account?
Coming from Hortonworks? Activate your account here