Member since
08-03-2022
14
Posts
0
Kudos Received
0
Solutions
11-30-2023
07:49 AM
Nice article! Where can we find the templates in Cloudera manager? You mention "we’re offering you JSON files of custom Cloudera Manager dashboards to import into your CM ’managed cluster", but do not give details on where customer can get the template files? Is that a separate license to purchase? It will be great to put the JSON files location details for the better use of the dashboards and this article. Thanks!
... View more
05-10-2023
08:55 AM
In a series of articles Cloudera’s Data Analytics team shares insights into how they started their journey towards operational excellence. These articles provide guidelines on how you can leverage Cloudera and supporting tool APIs to harness the platform and the process data. But technology is just a facilitator, inside these articles you will find insights into methodology to gain insights into your platform , the processes running in it, how to customize the insights to your particular ecosystem and how to use it to achieve operational excellence. Modern CDP platform built on an open source technology stack provides great flexibility to harvest platform data and it is up to us to use that data to build meaningful and actionable insights. From one operational data practitioner to another, these articles are a must read. Here are the links to the articles in this series. # Topic Link Address 1 Gaining Control of Your CDP Environment https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Gaining-Control-of-Your-CDP-Environment/ta-p/369657 2 Visibility and Transparency https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Visibility-and-Transparency/ta-p/369919 2.1 Deploy your Cloudera Manager Dashboards https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Deploy-your-Cloudera-Manager-Dashboards/ta-p/369923 2.2 CDSW Monitoring - Operational Data Engineering Process Monitoring https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/CDSW-Monitoring-Operational-Data-Engineering-Process/ta-p/370005 2.3 Operational Data Engineering Process Monitoring - Transparency and Visibility https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Operational-Data-Engineering-Process-Monitoring-Transparency/ta-p/370013 2.4 Yarn/Spark jobs Monitoring Implementation Steps - Operational Data Engineering Process Monitoring https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Yarn-Spark-jobs-Monitoring-Implementation-Steps-nbsp/ta-p/370015 2.5 Airflow jobs Monitoring Implementation Steps - Operational Data Engineering Process Monitoring https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Airflow-jobs-Monitoring-Implementation-Steps-nbsp/ta-p/370017 2.6 Cron jobs Monitoring Implementation Steps - Operational Data Engineering Process Monitoring https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Cron-jobs-Monitoring-Implementation-Steps-nbsp-Operational/ta-p/370022 2.7 NiFi Job Monitoring Implementation Steps - Operational Data Engineering Process Monitoring https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/NiFi-Job-Monitoring-Implementation-Steps-Operational-Data/ta-p/370023 2.8 Handling Impala Critical Query Exceptions/Failures - Transparency and Visibility https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Handling-Impala-Critical-Query-Exceptions-Failures/ta-p/370048 2.9 Monitoring Impala Usage - Transparency and Visibility https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Monitoring-Impala-Usage-Transparency-and-Visibility/ta-p/370058 2.10 Leveraging Cloudera Manager Cluster Generated Alerts To Gain Control Of Your CDP Environment - Transparency and Visibility https://community.cloudera.com/t5/Cloudera-Data-Analytics-CDA/Leveraging-Cloudera-Manager-Cluster-Generated-Alerts-To-Gain/ta-p/370040
... View more
05-05-2023
12:42 AM
Thanks Raj.
... View more
05-04-2023
01:05 AM
Transparency and Visibility - Operational Data Engineering Process Monitoring - NiFi jobs Monitoring Implementation Steps
Part of the Visibility and Transparency series this article provides steps for the implementation of NiFi processes or workloads monitoring and alerting system. The framework uses Apache Hive, Apache NiFi, and BI visualization tools. The features include customized dashboards for insights and alert mechanisms.
The attachment to this article provides the details of the implementation steps.
... View more
05-04-2023
12:59 AM
Part of the Visibility and Transparency series this article provides steps for the implementation of the Cron processes alerting system. The framework uses Apache Hive, Python, and Airflow. The features include an alert mechanism.
The attachment to this article details the implementation steps.
... View more
05-04-2023
12:39 AM
Part of the Visibility and Transparency series this article provides steps for the implementation of Airflow processes or workloads monitoring and alerting system. The framework uses Apache Hive, Python, and BI visualization tools. The features include customized dashboards for insights and alert mechanism.
The attachment to this article details the implementation steps.
... View more
05-04-2023
12:17 AM
Part of the Visibility and Transparency series this article provides steps for the implementation of Yarn/Spark processes or workloads monitoring and alerting system. The framework uses Apache Hive, Apache NiFi, and BI visualization tools. The features include customized visualization dashboards for insights and alert mechanisms.
The attachment to this article details the implementation steps.
... View more
Labels:
05-04-2023
12:01 AM
These steps will enable DevOps and Data Management teams to…
Gain near real-time visibility into day-to-day operational data engineering processes.
Automate manual monitoring activities and suppress noise.
Build near real-time visualizations to monitor those processes and trends using Cloudera Manager (CM) API, NIFI Pipeline, and NiFi APIs.
Find hidden patterns in process issues to troubleshoot, diagnose, and quickly action failing, troublesome, and resource-intensive processes.
Mine for insights into Yarn, Cloudera Data Science Workbench (CDSW), AirFlow, and NIFI processor logs/database.
Improve platform optimization and control user behaviors.
Become more proactive with incident resolution.
We will provide you with a framework that runs locally on your cluster and is customizable by security-restricted telemetry tools that require data to be sent outside the cluster. The framework provides insights into the processes that trigger workloads on the Cloudera Private Cloud Base platform, including:
KPI Monitoring Framework
Scheduling / ETL Tool
Workloads Triggered
API / Data Source Used
Airflow Jobs Monitor
Airflow - Python, SPARK
Yarn jobs status
Python script - Airflow database
CDSW Jobs Monitor
CDSW - Python
CDSW jobs status
Python script - Cdsw database
NIFI Processor Monitor
NiFi - Python
NIFI processor status
NIFI Flow - NIFI app logs
Spark/Yarn jobs Monitor
NIFI - Python
Spark/Yarn jobs status
NIFI Flow - Python script
Tools Required
Solution Pattern
These steps will guide you to implement monitoring and alerting mechanisms into your engineering service processes and build the dashboards.
Refer to the following links for articles on implementation steps for various monitoring processes,
Airflow Monitoring
CDSW Monitoring
NiFi Monitoring
Spark and Yarn Monitoring
Cron Jobs Monitoring
Keyword Search
Transparency, Visibility, Log data, Database data, Monitor AirFlow, Monitor CDSW, Monitor SPARK, Monitor NiFi Processor, Manage AirFlow, Manage CDSW, Manage SPARK, Manage NiFi Processor, Actionable insights, Operational processes, Data engineering processes, Yarn, Cloudera Manager (CM) API, NiFi Process logs, Visualize failures, Visualize jobs, Visualize processes.
... View more
05-03-2023
11:30 PM
Part of the Visibility and Transparency series this article provides steps for the implementation of Cloudera Data Science Workbench processes or workloads monitoring and alerting system. The framework uses Apache Hive, Apache Airflow, Bash shell scripts, and BI Visualization tools. The framework features include a customized visualization dashboard for insights and alert mechanisms.
The attachment to this article details the implementation steps.
... View more
05-03-2023
12:28 AM
Summary
Out of the box, Cloudera Manager provides plenty of valuable health checks to highlight platform and service issues that may need to be addressed along with an intuitive interface for troubleshooting issues. Nevertheless, the greater the quantity of pipelines and processes the platform supports (especially at peak loads/times), the higher the probability that you would benefit from additional visibility into overall cluster performance. To that end, we’re offering you JSON files of custom Cloudera Manager dashboards to import into your CM ’managed cluster.
The attachment to this article details steps for creating and deploying Cloudera Manager dashboards.
Keyword Search
Deploy Cloudera Manager Dashboards, Free Manager Dashboards, Cloudera Manager Troubleshooting, Monitor CDP Cluster, Proactively resolve Cloudera Manager alerts, Cloudera Manager Monitoring, JSON, Health monitoring, Impala health monitoring, YARN health monitoring, HBase tuning, HDFS caching, Datanode health monitoring, HDFS NN & JN health monitoring, Hive health monitoring, Impala monitor coordinators, Infra-Solr health monitoring, Solr health monitoring, HDFS JVM GC Sizing, YARN JVM GC Sizing, MGMT Cluster JVM GC Sizing, ZK JVM GC Sizing, Oozie JVM GC Sizing, Hive JVM GC Sizing
... View more