Created 12-16-2015 11:44 AM
Hi,
I have to choose between cloudera, hortonworks and mapR.
And i don't know how can i test the performance between those distributions.
After choosing a distribution i have to work with spark and extract data from social networks . So should i just test algorithms with spark in each distribution?
Any help?
Thanks in advance
Created 12-16-2015 05:38 PM
Iobna,
Every vendor ships Apache Spark, so there isn't a difference there.
But there are other differences you can use to evaluate which vendor you choose
There are many other factors to consider, but this should give you some ideas.
Thanks, Vinay
Created 12-16-2015 11:46 AM
@lobna tonn Hi Lobna, I highly recommend to do more research on the business model and core technology of these vendors. You can start with a POC (prepare a use case) to load your own data or start with https://github.com/hortonworks/hive-testbench once cluster is up. My linkedin address is in my profile. Please feel free to add me and we can talk about it.
Created 12-16-2015 11:54 AM
thank you for your reply, do you think that there is no big performance difference between them ? I know that CDH enable native acceleration for some mathematical operations in Spark MLlib, and that it ships the most recent spark version (1.5). should i test for example wordcount algorithm with spark in each distribution ?
Created 12-16-2015 11:57 AM
@lobna tonn Please see this https://hortonworks.com/press-releases/hortonworks-accelerates-spark-at-scale-for-the-enterprise/
Spark 1.5.2 is part of HDP stack. You can try running wordcount but I highly recommend to look into the core business model too.
Created 12-16-2015 12:59 PM
in the core business model it says that HDP ships Spark 1.3 in HDP 2.3 with a beta preview of 1.5 .And is there a way to pay for support in HDP ?
Created 12-16-2015 01:39 PM
You get Spark 1.4.1 when you install HDP 2.3.2 but if you want to upgrade to 1.5.2 then we can help you on that.
Core business model means 100% open source. Read this
Created 12-16-2015 05:38 PM
Iobna,
Every vendor ships Apache Spark, so there isn't a difference there.
But there are other differences you can use to evaluate which vendor you choose
There are many other factors to consider, but this should give you some ideas.
Thanks, Vinay
Created 02-12-2016 02:19 PM
@lobna tonn how are your tests doing? Did you decide?