Reply
Highlighted
Expert Contributor
Posts: 125
Registered: ‎07-17-2017
Accepted Solution

Why the Apache Mahout is deprecated and what's the alternatives ?!

Hi all,

I have read in the Mahout Installation docs that it was deprecated since CDH 5.5 and it will be removed at CDH 6.0 as I see in the Deprecated items.
Any idea about the why? and the alternatives?

Thanks in advance.

Cloudera Employee
Posts: 4
Registered: ‎07-29-2013

Re: Why the Apache Mahout is deprecated and what's the alternatives ?!

Briefly: Because Apache Spark is a capable - and far more powerful - replacement.

--
Matt Brandwein
@mattbrandwein
Expert Contributor
Posts: 125
Registered: ‎07-17-2017

Re: Why the Apache Mahout is deprecated and what's the alternatives ?!

[ Edited ]

Thanks @Matt Brandwein.

From a computational force point of view, yes (knowing that Mahout also uses Spark),but what about the Mahout algorithms library? is Spark has some libriries too?

Expert Contributor
Posts: 125
Registered: ‎07-17-2017

Re: Why the Apache Mahout is deprecated and what's the alternatives ?!

In fact, the answer is yes, I found that Spark has the MLlib Library for Machine Learning algorithms, and it seems great.
Contributor
Posts: 33
Registered: ‎11-04-2016

Re: Why the Apache Mahout is deprecated and what's the alternatives ?!

As you mentioned correctly Apache Spark is offering MlLib (or ML) which it comes with a set of features for some basic NLP, most popular algorithms for clustering and classifications, etc.

 

But that is not all! You can use many libraries which are released to complete Spark in a domain of Machine Learning and Deep Learning. Basically, these libraries are using Spark APIs and Engine.

 

You can have a look here (or other lists): https://github.com/awesome-spark/awesome-spark#machine-learning-extension

Announcements