* You may need to increase the heap size used by NiFi. You will know if/when you start getting the error: java.lang.OutOfMemoryError
Create the HBase table. You can easily do this via Apache Zeppelin.
Create the Phoenix view. You can easily do this via Apache Zeppelin.
Configure the ListHDFS processor
Configure the FetchHDFS processor
Use the SplitText processor to split each JSON record from the HDFS files
I used the processor as is without any changes, other than renaming it to 'Split_Lines_In_File'
Configure the JoltTransformJSON processor to extract only those fields needed for the dashboard.
We only need the following fields from the JSON data:
Some of the above fields are child elements for a top-level parent. The property 'Jolt Specification' needs to be set so that we properly extract those fields:
Configure the PutHBaseJSON processor to store the record.
awesome @Binu Mathew
@Binu Mathew : Thanks for sharing the awesome article. Do you mind to share the sample data?