Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Root Cause Analysis using Machine Learning in Spark ML

Solved Go to solution

Root Cause Analysis using Machine Learning in Spark ML

New Contributor
 
1 ACCEPTED SOLUTION

Accepted Solutions

Re: Root Cause Analysis using Machine Learning in Spark ML

Cloudera Employee

Do you have a target variable that you can predict? Or do you have logic that will allow you to convert a "low" CPU value into a target variable?

Spark has a wide variety of models that are available for classification modeling: https://spark.apache.org/docs/latest/mllib-classification-regression.html

If you are interested in seeing which factor is contributing to a specific instance, I would recommend starting with a logistic regression model as that will provide more explanatory power -- providing more insight into which factor is contributing to a particular CPU failure

1 REPLY 1

Re: Root Cause Analysis using Machine Learning in Spark ML

Cloudera Employee

Do you have a target variable that you can predict? Or do you have logic that will allow you to convert a "low" CPU value into a target variable?

Spark has a wide variety of models that are available for classification modeling: https://spark.apache.org/docs/latest/mllib-classification-regression.html

If you are interested in seeing which factor is contributing to a specific instance, I would recommend starting with a logistic regression model as that will provide more explanatory power -- providing more insight into which factor is contributing to a particular CPU failure

Don't have an account?
Coming from Hortonworks? Activate your account here