03-11-2019 06:50 PM - edited 03-11-2019 06:51 PM
I am managing CDH 5.6.0 consisting of 16 Linux servers. (1 eco, 2 namenodes, 13 datanodes)
The Cloudera Hadoop has no trouble issue, but nowadays, the rate of analyzing process was increased. Anyone associated with analyzing can access CDH hive by using SQL communication program(ex, dbeaver). What concerns me is termination of the CDH cluster, because, they send so many queries to the hive.
Is there the way monitoring client IP accessing to hive?
03-12-2019 08:04 AM
As @Consult mentioned, you can use the Cloudera Navigator UI to query for Hive audit events, in the Audits tab. Each audit event is associated with the username and the IP address of the client request. This should help you to get an idea of who runs the most queries.