Created 05-24-2024 04:12 AM
Hi,
I am in need for some sort of alerting mechanism to monitor long running CDE jobs, basically we use Airflow to call /run Spark DBT jobs I am looking if there is a way to monitor / alert if any jobs cross certain threshold time limit.
Would appreciate if anyone has implemented it / guide in this direction.
Thanks
Wert
Created 06-18-2024 01:00 AM
You can integrate Airflow with a monitoring tool such as Prometheus or Grafana. These tools provide advanced monitoring and alerting capabilities. You can configure thresholds and receive alerts when the job runtime exceeds the specified limit.
Alternatively, you can use a dedicated job monitoring tool like Apache Oozie or Azkaban, which also offer alerting mechanisms to monitor long-running jobs. These tools provide more comprehensive job management features and can be integrated with Airflow.
Created 06-18-2024 01:00 AM
You can integrate Airflow with a monitoring tool such as Prometheus or Grafana. These tools provide advanced monitoring and alerting capabilities. You can configure thresholds and receive alerts when the job runtime exceeds the specified limit.
Alternatively, you can use a dedicated job monitoring tool like Apache Oozie or Azkaban, which also offer alerting mechanisms to monitor long-running jobs. These tools provide more comprehensive job management features and can be integrated with Airflow.