<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Scaling and Auto-scaling of HDP on AWS and Azure Cloud in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105660#M50485</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3333/manikandankannan.html" nodeid="3333"&gt;@learninghuman&lt;/A&gt;&lt;/P&gt;&lt;P&gt;To state it most simply, auto-scaling is a capability of Cloudbreak only at this point in time. With &lt;A href="http://sequenceiq.com/cloudbreak-docs/release-1.6.1/periscope/#auto-scaling"&gt;Cloudbreak Periscope&lt;/A&gt;, you can define a scaling policy and apply it to any Alert on any Ambari Metric. Scaling granularity is at the Ambari host group level. This provides you the option to scale services or components only, not the whole cluster. Per your line of questioning above, if you use Cloudbreak to provision HDP on either Azure IaaS or AWS IaaS, you can use the auto-scaling capabilities it provides. Both Azure HDInsight (HDI) and Hortonworks Data Cloud for AWS (HDC) make it very easy to manually re-size your cluster through their respective consoles. Auto-scaling is not a feature of either offering at this point in time.&lt;/P&gt;&lt;P&gt;In regards to data re-balancing, neither HDI nor HDC need to be concerned with this, because they are both automatically configured to use Cloud Storage (currently ADLS and S3 respectively) - not HDFS. For HDP deployed on IaaS with Cloudbreak, auto-scaling may potentially perform a HDFS rebalance - but only after a Downscale operation. In order to keep a healthy HDFS during downscale, Cloudbreak always keeps the replication factor configured and makes sure that there is enough space on HDFS to rebalance data.
During downscale, in order to minimize the rebalancing, replication, and HDFS storms, Cloudbreak checks block locations and computes the least costly operations.&lt;/P&gt;</description>
    <pubDate>Tue, 03 Jan 2017 00:15:35 GMT</pubDate>
    <dc:creator>tmccuch</dc:creator>
    <dc:date>2017-01-03T00:15:35Z</dc:date>
    <item>
      <title>Scaling and Auto-scaling of HDP on AWS and Azure Cloud</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105659#M50484</link>
      <description>&lt;P&gt;My understanding along with questions as below,&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;AWS-&lt;/STRONG&gt;&lt;B&gt;HDCloud&lt;/B&gt;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Manual scaling&lt;/EM&gt;&lt;/STRONG&gt; using Ambari or AWS UI possible. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Auto Scaling&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;1. Is it possible to auto-scale in this option (while creating the cluster can i set auto-scaling group)? &lt;/P&gt;&lt;P&gt;1.1. In that case, how is the data re-balanced? i.e. if a new node is added, then compute may not gain data locality.&lt;/P&gt;&lt;P&gt;--------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;AWS-HDP on IaaS&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Manual scaling&lt;/EM&gt;&lt;/STRONG&gt; using Ambari is possible.&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Auto Scaling-&lt;/STRONG&gt;&lt;STRONG&gt;Without CloudBreak&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;2. Is it possible to auto-scale in this option (while creating the cluster can i set auto-scaling group)? &lt;/P&gt;&lt;P&gt;2.1. In that case, how is the data re-balanced? i.e. if a new node is added, then compute may not gain data locality.&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Auto Scaling-&lt;/STRONG&gt;&lt;STRONG&gt;WithCloudBreak&lt;/STRONG&gt;&lt;/EM&gt; &lt;/P&gt;&lt;P&gt;Auto-scaling may be possible, but question 2.1 applies here as well.&lt;/P&gt;&lt;P&gt;--------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Azure-HdInsights&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Manual scaling&lt;/EM&gt;&lt;/STRONG&gt; using Ambari or Azure UI possible.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Auto Scaling&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;3. Is it possible to auto-scale in this option (while creating the cluster can i set auto-scaling group)? &lt;/P&gt;&lt;P&gt;3.1. In that case, how is the data re-balanced? i.e. if a new node is added, then compute may not gain data locality.&lt;/P&gt;&lt;P&gt;--------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Azure-HDP in MarketPlace&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Manual scaling&lt;/EM&gt;&lt;/STRONG&gt; using Ambari or Azure UI possible.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Auto Scaling&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;4. Is it possible to auto-scale in this option (while creating the cluster can i set auto-scaling group)?&lt;/P&gt;&lt;P&gt;4.1. In that case, how is the data re-balanced? i.e. if a new node is added, then compute may not gain data locality.&lt;/P&gt;&lt;P&gt;--------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Azure-HDP on IaaS&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Same questions as AWS-HDP on IaaS&lt;/P&gt;</description>
      <pubDate>Fri, 30 Dec 2016 17:58:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105659#M50484</guid>
      <dc:creator>learninghuman</dc:creator>
      <dc:date>2016-12-30T17:58:20Z</dc:date>
    </item>
    <item>
      <title>Re: Scaling and Auto-scaling of HDP on AWS and Azure Cloud</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105660#M50485</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3333/manikandankannan.html" nodeid="3333"&gt;@learninghuman&lt;/A&gt;&lt;/P&gt;&lt;P&gt;To state it most simply, auto-scaling is a capability of Cloudbreak only at this point in time. With &lt;A href="http://sequenceiq.com/cloudbreak-docs/release-1.6.1/periscope/#auto-scaling"&gt;Cloudbreak Periscope&lt;/A&gt;, you can define a scaling policy and apply it to any Alert on any Ambari Metric. Scaling granularity is at the Ambari host group level. This provides you the option to scale services or components only, not the whole cluster. Per your line of questioning above, if you use Cloudbreak to provision HDP on either Azure IaaS or AWS IaaS, you can use the auto-scaling capabilities it provides. Both Azure HDInsight (HDI) and Hortonworks Data Cloud for AWS (HDC) make it very easy to manually re-size your cluster through their respective consoles. Auto-scaling is not a feature of either offering at this point in time.&lt;/P&gt;&lt;P&gt;In regards to data re-balancing, neither HDI nor HDC need to be concerned with this, because they are both automatically configured to use Cloud Storage (currently ADLS and S3 respectively) - not HDFS. For HDP deployed on IaaS with Cloudbreak, auto-scaling may potentially perform a HDFS rebalance - but only after a Downscale operation. In order to keep a healthy HDFS during downscale, Cloudbreak always keeps the replication factor configured and makes sure that there is enough space on HDFS to rebalance data.
During downscale, in order to minimize the rebalancing, replication, and HDFS storms, Cloudbreak checks block locations and computes the least costly operations.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jan 2017 00:15:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105660#M50485</guid>
      <dc:creator>tmccuch</dc:creator>
      <dc:date>2017-01-03T00:15:35Z</dc:date>
    </item>
    <item>
      <title>Re: Scaling and Auto-scaling of HDP on AWS and Azure Cloud</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105661#M50486</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3333/manikandankannan.html" nodeid="3333"&gt;@learninghuman&lt;/A&gt; If this answer helps, please accept it. Otherwise, I'd be happy to answer any remaining questions you have. &lt;/P&gt;&lt;P&gt;
Thanks! _Tom&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jan 2017 01:49:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scaling-and-Auto-scaling-of-HDP-on-AWS-and-Azure-Cloud/m-p/105661#M50486</guid>
      <dc:creator>tmccuch</dc:creator>
      <dc:date>2017-01-04T01:49:59Z</dc:date>
    </item>
  </channel>
</rss>

