Support Questions

VidyaSargur · ‎09-26-2022

Hello,

I enabled cruise control for Kafka cluster to perform self healing.

My expectation is get a notification whenever any action performed by cruise control, otherwise couldn't understand if CC is in action or not. Is there any provision?

Thanks,

Uday

Babasaheb · ‎09-28-2022

Hello @Uday483

- You should check whether the CC endpoints are working as expected or not.

- Automatic rebalancing is DISABLED by default but it can be enabled by adding self.healing.enabled=true in cruisecontrol.properties advance snippet on CM UI. You can enable self-healing for specific types of events like broker failures, disk failures, metric anomalies or goal violations, etc.

More properties you can check at the below link.

https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/cctrl-overview/topics/cctrl-self-healing.html

https://docs.cloudera.com/cdp-private-cloud-base/7.1.8/cctrl-configuration/cctrl-configuration.pdf

And below are the Rest-APIs you can use on CC without enabling self.healing as well.

--

https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/cctrl-managing/topics/cctrl-using-rest-api.ht...

--

If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post.

Thank you.

Uday483 · ‎09-28-2022

Hello Babasaheb,

Self healing is enabled already. When any broker goes down, CC is rebalancing partitions to healthy nodes but when that node comes back up, redistribution is not happening automatically. I have to run the rebalance API manually.

self.healing.broker.failure.enabled=true
self.healing.disk.failure.enabled=true
broker.failure.alert.threshold.ms=60000
broker.failure.self.healing.threshold.ms=120000

Thanks,

Uday

Babasaheb · ‎09-28-2022

Hello @Uday483

Please let us know what type of rebalancing you are looking for when a broker comes online.

Is it disk / complete partitions rebalancing from other brokers to this broker (which comes online)

OR only leader rebalance.

Thank you.

Uday483 · ‎09-28-2022

Hi @Babasaheb,

I am expecting to rebalance leaders and replicas. In the example below when broker id 1111 went down, CC self healed(distributed leaders/partitions automatically) to other nodes. When 1111 node came back, distribution not happening automatically.

BROKER LEADER(S) REPLICAS OUT-OF-SYNC OFFLINE IS_CONTROLLER
1234 36 118 0 0 false
5678 44 123 0 0 false
1111 0 0 0 0 false
3333 44 126 0 0 false
4444 44 126 0 0 true

Thanks,

Uday

Babasaheb · ‎10-07-2022

@Uday483

It looks like you have the case open for the same issue from the Cloudera ticket system and we are replicating and investigating this issue on priority. We will share a workaround in the case and here in this community article as well so that it can help other users if they will come across the same issue.

Thank you.

Cloudera Community

Support Questions

Cruise Control notifications