Support Questions

Find answers, ask questions, and share your expertise

can't get critical alerts from a dashboard

avatar
Explorer

Hello!
I'm using CM api v19
I'm trying to get only critical alerts which I'm looking at right now (RED ones).
image001.png
Tried https://cloudera.github.io/cm_api/apidocs/v19/path__events.html

 

 

/api/v19/events?query=category==HEALTH_EVENT;severity=CRITICAL

 

 

 The output consists of too many not relevant events. What should I do to get only critical events displayed on the dashboard?
Thank you.

2 ACCEPTED SOLUTIONS

avatar
Super Guru

Hi, @hamsterrrrr ,

 

Could you please check (and share) status for each one of the roles in the HDFS service?

The endpoint /api/v40/clusters/cluster/services/hdfs/roles  should give you this data.

I'd like to understand what's the underlying issue in this case.

 

André

 

--
Was your question answered? Please take some time to click on "Accept as Solution" below this post.
If you find a reply useful, say thanks by clicking on the thumbs up button.

View solution in original post

avatar
Explorer

Thank you! Looks like what I wanted.

View solution in original post

5 REPLIES 5

avatar
Super Guru

Hi, @hamsterrrrr ,

 

Those events represent the point-in-time occurrence when the alert was thrown. They don't tell you the current issues with the cluster, which is what you see in the UI.

I don't think the REST API has an endpoint that gives you that list directly, but it's pretty easy to find out.

 

In the results of the /clusters/{cluster}/services endpoint, look for services with the status = BAD or health checks with summary = BAD. For example:

{
  "items": [
    {
      "name": "impala",
      "type": "IMPALA",
      "healthSummary": "BAD",                 <<--- this
      "healthChecks": [
        ...
        {
          "name": "IMPALA_IMPALADS_HEALTHY",
          "summary": "BAD",                   <<--- or this
          "suppressed": false
        },
        ...
      ],
      ...
    }
  ]
}

 

Cheers,

André

--
Was your question answered? Please take some time to click on "Accept as Solution" below this post.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
Explorer

Hello, André
Thanky you for yoir quick reply.
So you are saying that by using api there is no way to find out what exactly is going on with claster? All we can get is just summary like GOOD\BAD of each service? And based on this we should conduct the further investigation by means of GUI?

avatar
Explorer

I'm saying that because the command you are talking about says that overall state of the service HDFS is good, all components of it are GOOD, while in the GUI I still can see one red red Bad health Data DIrectory  Status bad health event, while overall status of HDFS is shown as GOOD(green).
Long story short, I just want to get this event via API call. Is that possible?

overall service state is good, but there is a critical eventoverall service state is good, but there is a critical eventthis critical  eventthis critical event

clusters/cluster/services/hdfs'|jq .

{

  "name": "hdfs",

  "type": "HDFS",

  "clusterRef": {

    "clusterName": "cluster"

  },

  "serviceUrl,

  "roleInstancesUrl":,

  "serviceState": "STARTED",

  "healthSummary": "GOOD",

  "healthChecks": [

    {

      "name": "HDFS_BLOCKS_WITH_CORRUPT_REPLICAS",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_CANARY_HEALTH",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_DATA_NODES_HEALTHY",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_FAILOVER_CONTROLLERS_HEALTHY",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_FREE_SPACE_REMAINING",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_HA_NAMENODE_HEALTH",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_MISSING_BLOCKS",

      "summary": "GOOD",

      "suppressed": false

    },

    {

      "name": "HDFS_UNDER_REPLICATED_BLOCKS",

      "summary": "GOOD",

      "suppressed": false

    }

  ],

  "configStalenessStatus": "FRESH",

  "clientConfigStalenessStatus": "FRESH",

  "maintenanceMode": false,

  "maintenanceOwners": [],

  "displayName": "HDFS",

  "entityStatus": "GOOD_HEALTH"

}


 

avatar
Super Guru

Hi, @hamsterrrrr ,

 

Could you please check (and share) status for each one of the roles in the HDFS service?

The endpoint /api/v40/clusters/cluster/services/hdfs/roles  should give you this data.

I'd like to understand what's the underlying issue in this case.

 

André

 

--
Was your question answered? Please take some time to click on "Accept as Solution" below this post.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
Explorer

Thank you! Looks like what I wanted.