Created on 12-02-202004:23 PM - edited 12-03-202005:32 AM
Welcome to Part 3 of my article series on how to harness the hybrid cloud series (see parent article here). In this tutorial you will learn to use NiFi to:
Pull information from public APIs
Push this raw data to secure S3 bucket using SDX integration
Create Hive tables on top of this data by connecting to CDW
This corresponds to step 3 of the series, as explained below:
This tutorial requires the creation of two NiFi flow (one to map zip codes to attitude and longitude, and one to get the latest covid cases numbers):
In this tutorial, I will only detail the configuration of one of the two flows as they are almost identical except for file/table/field names. The full example code is in this gist. Here is the overview of the US Geocode flow:
Get data from API to secure S3 using SDX
For this part, we first use a simple configuration of an invoke http: