<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145141#M44276</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/12076/obaidsalikeen.html" nodeid="12076"&gt;@Obaid Salikeen&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Pros:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Multiple cloud provider support (ypu can deploy clusters using the same interface to different providers)&lt;/LI&gt;&lt;LI&gt;You can use it even on private cloud e.g OpenStack&lt;/LI&gt;&lt;LI&gt;Cloudbreak and HDP is open source&lt;/LI&gt;&lt;LI&gt;Cloudbreak installs Ambari, what you can use to monitor or customise your cluster after deployment (e.g. add new services)&lt;/LI&gt;&lt;LI&gt;It comes with fully configured SaltStack what you can use to manage your VMs e.g apply security patches&lt;/LI&gt;&lt;LI&gt;More flexible since you can create your own &lt;A href="https://cwiki.apache.org/confluence/display/AMBARI/Blueprints"&gt;Blueprint&lt;/A&gt; which can contains only those services what you need&lt;/LI&gt;&lt;LI&gt;Cloudbreak supports autoscaling based on metrics gathered from Ambari (e.g some of those metrics are very general e.g. disk space others are Hadoop specific e.g. pending YARN containers)&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Cons:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;need one more instance where Cloudbreak is running (of course one Cloudbreak can manage multiple clusters)&lt;/LI&gt;&lt;LI&gt;Cloudbreak is a cluster management tool and you cannot submit jobs through it. Something like &lt;A href="https://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/AddingStepstoaJobFlow.html"&gt;steps&lt;/A&gt; in EMR is not supported &lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Disclaimer: I am an engineer working on Cloudbreak&lt;/P&gt;&lt;P&gt;Attila&lt;/P&gt;</description>
    <pubDate>Wed, 26 Oct 2016 04:23:32 GMT</pubDate>
    <dc:creator>akanto</dc:creator>
    <dc:date>2016-10-26T04:23:32Z</dc:date>
    <item>
      <title>Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145138#M44273</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I am a newbie to HDP and cloudbreak. I want to move some of our onsite Hadoop clusters/jobs on AWS. Two solutions that I have came-across are Cloudbreak and EMR, however not sure which one to use.&lt;/P&gt;&lt;P&gt;I wanted to know which technology to use for launching hadoop jobs on AWS? Pros and cons of using either approach would be really helpful (interms of cost, ease of use, monitoring, metrics, latency etc). One apparent cost optimization feature that I am interested in : is to launch the cluster whenever a job or jobs needs to run, and kill the cluster/nodes whenever there are no more jobs to execute.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Obaid&lt;/P&gt;</description>
      <pubDate>Sun, 23 Oct 2016 15:04:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145138#M44273</guid>
      <dc:creator>obaid_salikeen</dc:creator>
      <dc:date>2016-10-23T15:04:38Z</dc:date>
    </item>
    <item>
      <title>Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145139#M44274</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/12076/obaidsalikeen.html" nodeid="12076"&gt;@Obaid Salikeen&lt;/A&gt;, You may also consider using Hortonworks Data Cloud (currently in technical preview stage. See &lt;A href="http://hortonworks.github.io/hdp-aws/" target="_blank"&gt;http://hortonworks.github.io/hdp-aws/&lt;/A&gt;. &lt;/P&gt;</description>
      <pubDate>Tue, 25 Oct 2016 03:19:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145139#M44274</guid>
      <dc:creator>Dominika</dc:creator>
      <dc:date>2016-10-25T03:19:03Z</dc:date>
    </item>
    <item>
      <title>Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145140#M44275</link>
      <description>&lt;P&gt;Thanks @&lt;A href="https://community.hortonworks.com/users/10146/dbialek.html"&gt;Dominika B&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Thanks for sharing the link, seems interesting.&lt;/P&gt;&lt;P&gt;So I have a very basic question: Amazon EMR lets you launch manage Hadoop and Spark clusters, so what would be the benefit of using Hortonworks cloud vs just using EMR?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Obaid&lt;/P&gt;</description>
      <pubDate>Tue, 25 Oct 2016 06:22:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145140#M44275</guid>
      <dc:creator>obaid_salikeen</dc:creator>
      <dc:date>2016-10-25T06:22:04Z</dc:date>
    </item>
    <item>
      <title>Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145141#M44276</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/12076/obaidsalikeen.html" nodeid="12076"&gt;@Obaid Salikeen&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Pros:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Multiple cloud provider support (ypu can deploy clusters using the same interface to different providers)&lt;/LI&gt;&lt;LI&gt;You can use it even on private cloud e.g OpenStack&lt;/LI&gt;&lt;LI&gt;Cloudbreak and HDP is open source&lt;/LI&gt;&lt;LI&gt;Cloudbreak installs Ambari, what you can use to monitor or customise your cluster after deployment (e.g. add new services)&lt;/LI&gt;&lt;LI&gt;It comes with fully configured SaltStack what you can use to manage your VMs e.g apply security patches&lt;/LI&gt;&lt;LI&gt;More flexible since you can create your own &lt;A href="https://cwiki.apache.org/confluence/display/AMBARI/Blueprints"&gt;Blueprint&lt;/A&gt; which can contains only those services what you need&lt;/LI&gt;&lt;LI&gt;Cloudbreak supports autoscaling based on metrics gathered from Ambari (e.g some of those metrics are very general e.g. disk space others are Hadoop specific e.g. pending YARN containers)&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Cons:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;need one more instance where Cloudbreak is running (of course one Cloudbreak can manage multiple clusters)&lt;/LI&gt;&lt;LI&gt;Cloudbreak is a cluster management tool and you cannot submit jobs through it. Something like &lt;A href="https://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/AddingStepstoaJobFlow.html"&gt;steps&lt;/A&gt; in EMR is not supported &lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Disclaimer: I am an engineer working on Cloudbreak&lt;/P&gt;&lt;P&gt;Attila&lt;/P&gt;</description>
      <pubDate>Wed, 26 Oct 2016 04:23:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145141#M44276</guid>
      <dc:creator>akanto</dc:creator>
      <dc:date>2016-10-26T04:23:32Z</dc:date>
    </item>
    <item>
      <title>Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145142#M44277</link>
      <description>&lt;P&gt;Thanks a lot &lt;A rel="user" href="https://community.cloudera.com/users/488/akanto.html" nodeid="488"&gt;@Attila Kanto&lt;/A&gt; for a detailed response,&lt;/P&gt;&lt;P&gt;Let me ask another cost related question, which is an important factor for making a decision on which technology to use: How would you compare EMR vs Cloudbreak (or Hortonworks Data Cloud) in-terms of cost? &lt;/P&gt;&lt;P&gt;Obaid&lt;/P&gt;</description>
      <pubDate>Sun, 30 Oct 2016 01:20:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145142#M44277</guid>
      <dc:creator>obaid_salikeen</dc:creator>
      <dc:date>2016-10-30T01:20:38Z</dc:date>
    </item>
    <item>
      <title>Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145143#M44278</link>
      <description>&lt;P&gt;Sorry, but I do not have such comparison.&lt;/P&gt;&lt;P&gt;Attila&lt;/P&gt;</description>
      <pubDate>Sun, 30 Oct 2016 02:55:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145143#M44278</guid>
      <dc:creator>akanto</dc:creator>
      <dc:date>2016-10-30T02:55:34Z</dc:date>
    </item>
    <item>
      <title>Re: Pros and cons of using EMR vs Cloudbreak for launching hadoop cluster on AWS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145144#M44279</link>
      <description>&lt;P&gt;sure, no problem&lt;/P&gt;</description>
      <pubDate>Thu, 03 Nov 2016 20:35:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pros-and-cons-of-using-EMR-vs-Cloudbreak-for-launching/m-p/145144#M44279</guid>
      <dc:creator>obaid_salikeen</dc:creator>
      <dc:date>2016-11-03T20:35:17Z</dc:date>
    </item>
  </channel>
</rss>

