Created on 02-27-2018 11:20 AM - edited 09-16-2022 05:54 AM
Hi all,
I have read in the Mahout Installation docs that it was deprecated since CDH 5.5 and it will be removed at CDH 6.0 as I see in the Deprecated items.
Any idea about the why? and the alternatives?
Thanks in advance.
Created 02-27-2018 04:59 PM
Briefly: Because Apache Spark is a capable - and far more powerful - replacement.
Created 02-27-2018 04:59 PM
Briefly: Because Apache Spark is a capable - and far more powerful - replacement.
Created on 02-28-2018 07:54 AM - edited 02-28-2018 07:55 AM
Thanks @Matt Brandwein.
From a computational force point of view, yes (knowing that Mahout also uses Spark),but what about the Mahout algorithms library? is Spark has some libriries too?
Created 03-01-2018 10:22 AM
Created 06-20-2018 01:55 PM
As you mentioned correctly Apache Spark is offering MlLib (or ML) which it comes with a set of features for some basic NLP, most popular algorithms for clustering and classifications, etc.
But that is not all! You can use many libraries which are released to complete Spark in a domain of Machine Learning and Deep Learning. Basically, these libraries are using Spark APIs and Engine.
You can have a look here (or other lists): https://github.com/awesome-spark/awesome-spark#machine-learning-extension