Created on 07-28-2016 03:41 PM - edited 08-17-2019 11:09 AM
HDF Overview
Overview
Hortonworks DataFlow (HDF) powered by Apache NiFi, Kafka and Storm, collects, curates, analyzes and delivers real-time data from the IoAT to data stores both on-premises and in the cloud. This is the quick installation guide to install Apache NiFi on AWS EC2 instance. Please refer this document as supplement guide to official Hortonworks HDF documentation.
Prerequisites
Before you install Apache NiFi on AWS, make sure
Installation Steps.
The screenshots in this section detail the setup and configuration of Apache NiFi on EC2 instance.
Refer the NiFi Admin Guide for the System requirements. This document covers installation on a Redhat linux (64 bit) EC2 instance.
Login to AWS and launch the EC2 instance and OS of your choice. (Please make sure the selected OS is supported by NiFi). The current exercise uses the Red Hat Enterprise Linux 7.2 image (HDF EC2 Instance).
Make sure you keep the security private key safe. Under Network and Security configuration, open the Http ports (e.g. 8081 & 8082 shown below) to access the NiFi web interface and for the site to site protocol to exchange data between multiple NiFi instances.
e.g. scp -i HDF.pem HDF-1.2.0.1-1.zip ec2-user@<public-dns-hostname>:/home/ec2-user
where HDF.pem is private key.
sudo yum install unzip
sudo yum install java
e.g. update the site-to-site properties to include the following
nifi.remote.input.socket.host=<public_dns_hostname>
nifi.remote.input.socket.port=8082
nifi.remote.input.secure=false
Benefits
Document References
NiFi System Admin Guide:
Created on 09-08-2016 10:21 PM
Hi Milind - great overview.
Any recommendations around the type of instance we should use?
Created on 07-10-2017 05:11 PM
Is this article still valid for HDF version 3.0 which was released recently? Are there easier ways of deploying to Amazon?