Created on 12-30-2019 08:44 AM - edited on 12-30-2019 06:51 PM by ask_bill_brooks
In this article, I am going to explain how you can work with the Schema Registry directly in your NiFi Data Flow. In my previous article Using the Schema Registry API I talk about the work required to expose the API methods needed to Create a Schema Registry Entity and update that Entity with an Avro Schema. In this article, I am going to take it one step further and complete both operations directly in my NiFi Data Flow. I will also include the Use Case I am working on. This flow accepts a CSV Parameter List which contains the columns and data types found within the fields object of the Avro Schema, processes the contents of the file, builds the required Avro Schema and saves it to the Schema Registry Entity I created.
At the time of writing this article I have not handled data types, so in these examples, the schema data types are all strings. In future updates to my flow, I will be mapping different types to appropriate data types for hive queries. For now, this article and the NiFi template will include two main parts:
https://github.com/steven-dfheinz/NiFi-Templates/raw/master/Schema_Registry_Demo.xml
GenerateFlowFile→UpdateAttribute→AttributesToJSON→InvokeHttp
**Note: notice my use of an output port to route failures during testing. While working on my flows I route all relationships like this until a time I decide it is appropriate to auto-terminate or route to error handling process group.
**Note: to see specific configuration, please download the template, add to your workspace, and inspect procs.
HandleHttpRequest→HandleHttpResponse→UpdateAttribute→ConvertCSVToAvro→ConvertAvroToJSON→ExtractText→AttributesToJSON→ReplaceText→UpdateAttribute→AttributesToJSON→InvokeHTTP
**Note: notice my use of an output port to route failures during testing. While working on my flows I route all relationships like this until a time I decide it is appropriate to auto-terminate or route to error handling process group.
**Note: to see specific configuration, please download the template, add to your workspace, and inspect procs.
As always, if you have any questions, or comments feel free to add them below or send me a message. If you need more help getting this template to work, I would be more than happy to help out.
Created on 08-18-2020 05:24 AM
I suppose nowadays you would use ConvertRecord instead of ConvertCSVToAvro and then ConvertAvroToJSON.