Created 02-15-2019 10:02 AM
the server and the agents are located on container(LXC) basd on proxmox virtual env 5.3 with the latest version
i have some issues with ambari metrics
tail /var/log/ambari-metrics-monitor/ambari-metrics-monitor.out
File "/usr/lib/python2.6/site-packages/resource_monitoring/core/metric_collector.py", line 45, in process_event self.process_host_collection_event(event) File "/usr/lib/python2.6/site-packages/resource_monitoring/core/metric_collector.py", line 80, in process_host_collection_event metrics.update(self.host_info.get_combined_disk_io_counters()) File "/usr/lib/python2.6/site-packages/resource_monitoring/core/host_info.py", line 276, in get_combined_disk_io_counters rawdict = psutil.disk_io_counters(True) File "/usr/lib/python2.6/site-packages/resource_monitoring/psutil/build/lib.linux-x86_64-2.7/psutil/__init__.py", line 1726, in disk_io_counters raise RuntimeError("couldn't find any physical disk") RuntimeError: couldn't find any physical disk
Created 02-18-2019 10:15 AM
May be you can try commenting the following 4 lines from the below mentioned script on all your cluster nodes. Followed by "metrics-monitor" restart on all the cluster nodes.
# grep -iR 'disk_io_counters' /usr/lib/python2.6/site-packages/resource_monitoring/core/metric_collector.py metrics.update(self.host_info.get_combined_disk_io_counters()) metrics.update(self.host_info.get_disk_io_counters_per_disk()) metrics.update(self.host_info.get_combined_disk_io_counters()) metrics.update(self.host_info.get_disk_io_counters_per_disk())
After commenting / removing those lines you will need to restart the metrics monitor from ambari UI on all hosts (or better restart AMS service)
Created 02-18-2019 09:30 AM
how can i disable hdd monitoring, all i want is cpu\ram\network monitoring
Created 02-18-2019 10:15 AM
May be you can try commenting the following 4 lines from the below mentioned script on all your cluster nodes. Followed by "metrics-monitor" restart on all the cluster nodes.
# grep -iR 'disk_io_counters' /usr/lib/python2.6/site-packages/resource_monitoring/core/metric_collector.py metrics.update(self.host_info.get_combined_disk_io_counters()) metrics.update(self.host_info.get_disk_io_counters_per_disk()) metrics.update(self.host_info.get_combined_disk_io_counters()) metrics.update(self.host_info.get_disk_io_counters_per_disk())
After commenting / removing those lines you will need to restart the metrics monitor from ambari UI on all hosts (or better restart AMS service)
Created 02-18-2019 11:27 AM
works thanks 🙂