About tthomas

tthomas · ‎06-05-2018

Abdelkrim Hadjidj, yes you can do it if you know what you want to extract. This code will help if user want to load the attribute from json file , so that the attribute value is not hardcoded in the flow.xml. Often some values can be kept in variable for specific environment , eg : Dev , Test , Prod. and these can be separated out in a json file which will not change with the updates in the flow.xml With the latest version of nifi (variable registry ) this is not required. My intention is just to show the need for the same.

tthomas · ‎06-04-2018

Load the json in to flowfile content . Feed this to a Executescript Processor with the below code. Note this code assume that the json does not have nested element. Hope this help import org.apache.commons.io.IOUtils import java.nio.charset.* def flowFile = session.get(); if (flowFile == null) { return; } def slurper = new groovy.json.JsonSlurper() def attrs = [:] as Map<String,String> session.read(flowFile, { inputStream -> def text = IOUtils.toString(inputStream, StandardCharsets.UTF_8) def obj = slurper.parseText(text) obj.each {k,v -> attrs[k] = v.toString() } } as InputStreamCallback) def text = '' // Cast a closure with an outputStream parameter to OutputStreamCallback flowFile = session.write(flowFile, {outputStream -> outputStream.write(text.getBytes(StandardCharsets.UTF_8)) } as OutputStreamCallback) flowFile = session.putAllAttributes(flowFile, attrs) session.transfer(flowFile , REL_SUCCESS)

tthomas · ‎06-04-2018

Many ppl has me this question on how to load the attributes from json.

tthomas · ‎01-29-2018

Step 1 : Check Service Status: should use get request curl -u admin:admin -H "X-Requested-By:ambari"-i -X GET http://sandbox.hortonworks.com:8080/api/v1/clusters/Sandbox/services/NIFI

tthomas · ‎09-27-2017

One more point use listHDFS and putHDFS than getHDFS . If you are using TDE then there is a bug in getHDFS which can happen in rare scenarios.

tthomas · ‎09-27-2017

I would love to make this flow complicated. > What if one putftp is success and another one failed. 🙂 do u want one to be success and other failed . or do you want to handle it like a retry

tthomas · ‎09-27-2017

/-------updateattrtibute(filename) ----putftp--- Try ---ListHDFS----fetchhdfs ---------------mergecontent-----putftp --- ---

tthomas · ‎09-27-2017

Here i used validation_table attribute to have the table name in the flow file Create your own logic to count the rows from oracle and hive . Then merge the 2 flows using merge processor. I have created a process group to count oracle table and another for counting hive which will add a oracle_cnt attribute and hive_cnt attribute with the result. The result is merged to a single flow file by correlating using the co relation attribute name . Allso mention the attribute strategy as "keep all unique attribute"

tthomas · ‎09-27-2017

Sure , You can do that with MergeContent Processor . if you are using only source and target then you can set the processor property Min no of entries to 2 and max no of entries to 2 and also mention a correlation attribute to do the merge.

Online	Offline
Last Visited	‎06-05-2018 04:54 PM

Member Since	‎09-19-2017 02:42 PM
Last Visited	‎06-05-2018 04:54 PM
Posts	10
Kudos received	5

Cloudera Community

Re: How to do row count using Nifi in source table...

Re: How to load attributes from json

Re: How to load attributes from json

How to load attributes from json

Re: Working with Ambari REST API - Automate NIFI I...

Re: Create files from GetHDFS processor flowfiles

Re: Create files from GetHDFS processor flowfiles

Re: Create files from GetHDFS processor flowfiles

Re: How to do row count using Nifi in source table...

Re: How to do row count using Nifi in source table...