Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Root Cause Analysis using Machine Learning in Spark ML

 
1 ACCEPTED SOLUTION

Cloudera Employee

Do you have a target variable that you can predict? Or do you have logic that will allow you to convert a "low" CPU value into a target variable?

Spark has a wide variety of models that are available for classification modeling: https://spark.apache.org/docs/latest/mllib-classification-regression.html

If you are interested in seeing which factor is contributing to a specific instance, I would recommend starting with a logistic regression model as that will provide more explanatory power -- providing more insight into which factor is contributing to a particular CPU failure

View solution in original post

1 REPLY 1

Cloudera Employee

Do you have a target variable that you can predict? Or do you have logic that will allow you to convert a "low" CPU value into a target variable?

Spark has a wide variety of models that are available for classification modeling: https://spark.apache.org/docs/latest/mllib-classification-regression.html

If you are interested in seeing which factor is contributing to a specific instance, I would recommend starting with a logistic regression model as that will provide more explanatory power -- providing more insight into which factor is contributing to a particular CPU failure

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.