Created on 11-18-2016 09:57 AM - edited 08-17-2019 08:00 AM
* You may need to increase the heap size used by NiFi. You will know if/when you start getting the error: java.lang.OutOfMemoryError
Create the HBase table. You can easily do this via Apache Zeppelin.
Create the Phoenix view. You can easily do this via Apache Zeppelin.
Configure the ListHDFS processor
Configure the FetchHDFS processor
Use the SplitText processor to split each JSON record from the HDFS files
I used the processor as is without any changes, other than renaming it to 'Split_Lines_In_File'
Configure the JoltTransformJSON processor to extract only those fields needed for the dashboard.
We only need the following fields from the JSON data:
Some of the above fields are child elements for a top-level parent. The property 'Jolt Specification' needs to be set so that we properly extract those fields:
Configure the PutHBaseJSON processor to store the record.
Created on 11-18-2016 04:07 PM
awesome @Binu Mathew
Created on 12-31-2016 03:25 AM
@Binu Mathew : Thanks for sharing the awesome article. Do you mind to share the sample data?