Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
avatar

DCloud for AWS 1.16 (released in June 2017) allows you to register a previously created RDS instance as a Hive or Druid metastore.

In this tutorial, we will be:

  • Launching an RDS instance and creating a database on it.
  • Registering a database running on an RDS instance as a Hive metastore.

Note that you can use these instructions for creating a Druid metastore.

Let's get started!

Prerequisites

As a prerequisite to this tutorial, you need to have an HDCloud cloud controller running on AWS. If you need help with the steps required to meet this prerequisite, refer to the following tutorial: How to set up Hortonworks Data Cloud (HDCloud) for AWS.

Launch an RDS Instance

1. Navigate to the RDS Dashboard at https://console.aws.amazon.com/rds.

2. In the top right corner, select the region in which you want to create your DB instance. Let’s create the RDS instance in the same region in which you've launched the cloud controller.

3. In the RDS Dashboard navigation pane, click Instances, and then click Launch DB instance to launch the Launch DB Instance Wizard.

4. In Step 1: Select Engine, select the PostgreSQL Engine and click Select.

5. In Step 2: Production?, select Dev/Test and click Next Step.

6. In Step 3: Specify DB Details, enter:

  1. For Instance Specifications, you can use values similar to those in the screenshot. Make sure to use DB Engine Version 9.5.4 or later.
  2. For Settings, come up with an identifier, a username, and a password for your instance.
  3. Click Next Step.

7. In Step 4: Configure Advanced Settings:

  1. In the Network & Security section, select the VPC where the RDS instance should be started. Select the same VPC in which your cloud controller is running.
  2. On the right, in the Connection Information, make sure that the Inbound access on the security group is set to “0.0.0.0/0”.
  3. In the Database Options section, enter a Database Name. This field is not required, so it’s easy to miss it. If you miss it, you will have to create the database manually.

8. Click Launch DB Instance.

9. Click on View Your DB Instances to get redirected to the RDS Dashboard. Keep this page open, as you will need to copy the RDS information and provide it in the Hive metastore registration form.

10. When your RDS instance is ready, proceed to the next step.

Congratulations! You've just launched an RDS instance and created a database on it. Let's register this database as a Hive metastore in your cloud controller.

Register a Hive Metastore

1. Log in to the cloud controller UI.

2. From the navigation menu, select SHARED SERVICES:

3. The list of registered Hive Metastores is displayed.

4. Click REGISTER METASTORE and the registration form is displayed:

5. Enter the following parameters:

  • Name: Enter the name to use when registering this Metastore to the cloud controller. This is not the database name.
  • HDP Version: Select the version of HDP that this Metastore can be used with.
  • JDBC Connection: Select the database type (PostgreSQL) and enter the RDS Endpoint (HOST:PORT/DB_NAME).
  • Authentication: Enter the RDS connection username and password.

You can obtain these parameters from the RDS dashboard:

6. Click Test connection to validate and test the RDS connection information.

7. Once your settings are validated and working, click REGISTER HIVE METASTORE to save the metastore. The metastore will now show up in the list of available metastores when creating a cluster.

Congratulations! You've just registered your RDS as a Hive metastore.

Feedback

Let us know if this was useful and how we can help you with HDCloud for AWS in the future. Free to leave a comment below with a suggestion for an HDCloud for AWS tutorial that you would like to see. Thanks!

2,551 Views
Comments

Updated for HDCloud for AWS version 1.14.1. Check it out!

Updated for HDCloud for AWS version 1.14.4. Check it out!

Updated for the latest HDCloud version 1.16. No major changes, just updated screenshots and links. Check it out!