Community Articles

Find and share helpful community-sourced technical articles.
avatar

Hortonworks Cloud Overview

Overview

Hortonworks Cloud (Hortonworks Connected Data cloud) is system for launching and managing a data lake on AWS. You can easily launch HDP clusters for analyzing and processing data. With Apache Hadoop. Use this document as supplement guide to existing official guide by Hortonworks. The doc will guide you in running HDP platform in your AWS environment.

Prerequisites

Before you launch hortonworks cloud controller make sure

Hortonworks cloud supported in following regions for now.

Us-east-1, us-west-2, eu-central-1, eu-west-1 and ap-northeast-1.

Architecture Overview

Cloud Controller is the primary component of hortonworks cloud. Cloud controller manages one or more HDP clusters on AWS. It runs on EC2 instance and used for launching and managing cluster. The master and worker nodes runs on multiple EC2 instances. Different cluster type templates are provided to accelerate the HDP platform setup on AWS. Currently two HDP versions are available.

  • HDP 2.4
  • HDP 2.5 (Technical Preview).

During cluster creation you can select the HDP version and cluster types.

Cloud Controller

The screenshots in this section covers the detail setup and configuration after launching the cloud controller to create the AWS resources. You can launch the cloud controller at

http://hortonworks.github.io/hdp-aws/launch/

Setup & Configuration Details.

hc1-at-115206-am.png

hc2-at-120519-pm.png

screen-shot-2016-07-26-at-120243-pm.png

screen-shot-2016-07-26-at-120546-pm.png

screen-shot-2016-07-26-at-120743-pm.png

Once the stack creation is complete you can access the cloud controller to create HDP cluster.

Accessing Cloud UI.

Using the CloudURL you can access the cloud controller instance. Check the screenshot below to get your CloudURL.

screen-shot-2016-07-26-at-10434-pm.png

screen-shot-2016-07-26-at-10512-pm.png

screen-shot-2016-07-26-at-10544-pm.png

You have provided with cluster templates to accelerate the HDP cluster configuration. You can select the HDP version and cluster type.

screen-shot-2016-07-26-at-10654-pm.png

screen-shot-2016-07-26-at-10821-pm.png

screen-shot-2016-07-26-at-10907-pm.png

It normally takes less than 30 minutes to setup 3 node HDP cluster.

screen-shot-2016-07-26-at-10956-pm.png

Validate all the steps for any errors and you can launch the Ambari web UI to verify the cluster setup.

screen-shot-2016-07-26-at-30332-pm.png

screen-shot-2016-07-26-at-13405-pm.png

On AWS dashboard it will provide the details of all the running EC2 instances.

screen-shot-2016-07-26-at-13527-pm.png

Post cluster setup Actions.

The cluster action provides you the option to

  • Resize the cluster
  • Clone the cluster.

screen-shot-2016-07-26-at-31618-pm.pngscreen-shot-2016-07-26-at-31339-pm.png

Benefits

Hortonworks cloud provides greater agility through faster HDP cluster deployment in AWS.

It provides elastic scalability on demand

You can launch the enterprise ready HDP platform in less than hour.

Document References

http://hortonworks.github.io/hdp-aws/

If you run into AWS related resource limitation, please validate the following doc. (elastic IP address limit etc.)

http://docs.aws.amazon.com/AmazonVPC/latest/UserGuide/VPC_Appendix_Limits.html

you can submit request to increasing the limit by

submit a request

4,213 Views
Comments
avatar
Guru

Thanks @milind pandit for posting this summary! FWIW: folks can find out some more here too...

https://hortonworks.com/blog/quickly-launch-hortonworks-data-platform-amazon-web-services/

Have fun!

avatar
New Contributor

@milind pandit Does HWX Data Cloud has Atlas, Ranger & Griffin included (like HDP 2.6)? How do I configure them?

avatar

@Sameer Bhatnagar They are not currently included. Currently, we only include the following services.

avatar
New Contributor

@Dominika Bialek, thank you! To include Data Governance, what solution would you recommend?I prefer to using Cloud.

avatar

@jeff Can you answer this?

By the way, you get a better visibility by posting a question as a separate thread rather than commenting below an article.