About stevenmatison

stevenmatison · ‎12-30-2019

The alert is just indicating you need to enter a value where one is missing. Scroll down inside of the page (not the whole page), and you should be able to find the missing value. Another fast way to get where you want is to search/filter for nifi.toolkit.tls.token which will take you directly to it. Once the alert/error is satisfied the NEXT button on the bottom of the page will work again. If this reply answers your questions please mark this reply as Solution.

stevenmatison · ‎12-30-2019

A replaceText processor can change from Comma Separated to Tab Separated. This would be the easiest option. Configured with [tab] in Search Value, and [,] in Replacement Value:

stevenmatison · ‎12-30-2019

Not working in your Use Case, I tried to show as much of mine as I could, I think you understand the concept. So the next trick is just getting the REGEX matched to your string. Try a tool here: https://regex101.com In this tool I quickly matched REGEX to 4th column: .*?,.*?,.*?,(.*?),.* to: test,test,test,25.0,test,test,test For your example, the above should work. It is not necessary to parse past the column you need, so the .* on end should pickup entire rest of the line.

stevenmatison · ‎12-29-2019

Regarding: "file_attachment" : "[\"attachment1.docx\",\"attachment2.docx\",\"attachment3.docx\",\"attachment4.docx\"]" The processor translates the object as a string. You will need to take some action on it to prepare it as you want it to be downstream. Assuming you are working with content of FlowFile, the processor you need is ReplaceText and the syntax is: ${'$1':unescapeJson()} To dial it even further so its not a string at all: ${'$1':unescapeJson():replace('"[','['):replace(']"',']')} Here is same in the ReplaceText processor properties: If you are operating on an attribute, you can do the similar action with UpdateAttribute: ${attributeName:unescapeJson()} If this reply answers your question please mark it as a Solution.

stevenmatison · ‎12-29-2019

@CJoe36 couple of things you need to do here: Get the value of the column into an attribute. Create routes based on the value of attribute. For #1 there are a few ways. First, you can create a flow that uses a CSV Reader and a known schema. Using this you can translate and parse the columns. This requires multiple processors and CSVReader Controller Service. Two, you can just use ExtractText with regex (one processor). here is example of ExtractText to get a Quantity and SKU from an inventory CSV. Notice the regex codes used with the commas, and the () indicating which field maps to the attribute defined (sku or qty). qty: .*?,,.*?,,(\d+)$ sku: (.*?),,.*?,,.*? If your CSV is 10 columns, and #5 is your value, you probably want something like this: ,,,,(.*?),,,,, For #2 you want to use RouteOnAttribute Processor with routes defined using NiFi Expression Language. You define a route, then when defined you can chose it for downstream of RouteOnAttribute. Anything else will go to unmatched. Here is an example: And Routed: If this reply helps answer your question, please mark it as a Solution.

stevenmatison · ‎12-29-2019

@KUnew Sounds like you have nifi and Elasticsearch installed. For a Proof Of Concept the single node simple installs should work fine. There are no further requirements. A Cloudera or Hortonworks cluster is not required. However, if you intend to have multiple instances of Nifi and one or more Elasticsearch Master and Data nodes I would highly recommend using Hortonworks HDF with Ambari to manage the NiFi and Elasticsearch installation & configuration. I have created the Elk Management Pack for HDF 3.x which allows install of 6.3.2 or 7.4.2 Elasticsearch, Logstash, Kibana, Filebeat, & Metricbeat. You can find it on my GitHub: https://github.com/steven-dfheinz/dfhz_elk_mpack I also have some articles here specifically about how to install the Elk Management Pack. Just search up elasticsearch here in the community Articles section. It is very easy to install ambari, the management pack, and further install NiFi and ELK. If this reply answers your question, please mark it as a Solution.

stevenmatison · ‎12-27-2019

I have a use case scenario coming up next week where I will need to process some Parquet to CSV. I created a few demos with nifi 1.10 but unfortunately it is not possible to use this version of NiFi on the customers environment. I know I can still satisfy the actions I need to take with 1.9 without the new 1.10 features for Parquet, but I know these new features would be more efficient. Is it possible to drop the required artifacts into 1.9 and achieve Parquet Reader functionality?

stevenmatison · ‎12-24-2019

This is a very basic use case scenario for NiFi. I would recommend that once you get the file into NiFi you split it line by line. Once you have the log file splits, then you do the match logic on each single line. Route the lines you want down stream and handle them accordingly. There are many ways to do this, and the fun part of NiFi is discovering what works best for you. Here is a NiFi Template I have that checks log files: https://github.com/steven-dfheinz/NiFi-Templates/blob/master/Get_File_Demo.xml If this answers helps solve your issue, please make it as Accepted Solution.

stevenmatison · ‎12-24-2019

Couple things I notice. First per documentation, your hostname should be an FQDN and mapped to the correct IP. Not mapping a proper hostname to the main server IP will cause confusion between short hostname, long hostname, localhost, localhost IP, etc. Second, after fixing that, you need to create permissions from your nifi FQDN hostname to your mysql server with a user that has the proper privelages. A simple test to check network connectivity from NiFi to mysql host would be using telnet to port 3306. If the telnet connects, then you know for sure you need to create the user@'host' permission grants as follows: CREATE USER 'user'@'%' IDENTIFIED BY 'password'; GRANT ALL PRIVILEGES ON *.* TO 'user'@'%' WITH GRANT OPTION; FLUSH PRIVILEGES; Depending on your setup the % (wildcard) may or may not work. If required replace the wildcard with the NiFI FQDN hostname. Last suggestion: tail -f /var/log/nifi/nifi-app.log while testing. You will see much more info about errors than you see in the NiFi UI. You can also set each processor error level to get more information in the UI. If this answer helps solve your issue, please accept it as the solution.

stevenmatison · ‎12-23-2019

You will need to send the raw data to NiFi HandleHTTPRequest using https protocol. This will require that NIFI be secured (per required documentation). If your source is secure also, the HandleHTTPRequest should be configured using an SSLContextService with a keystore and truststore containing the certs for the source.

Online	Offline
Last Visited	‎05-13-2026 01:42 PM

Name	Steven Matison
Location	Florida
Member Since	‎07-19-2018 04:45 PM
Last Visited	‎05-13-2026 01:42 PM
Posts	613
Kudos received	101

Cloudera Community

Re: Apache nifi - how to convert a file .txt into ...

Re: Apache Nifi - Using PutParquet, the HDFS file ...

Re: How to extract csv column record and used it f...

Re: Could not connect to Distributed Map Cache ser...

Re: NiFi InvokeHTTP POST JSON

Re: Error shown before deploying hdf cluster with ...

Re: Nifi || Mail || Display csv files content in t...

Re: Use column values of a csv file to route flow ...

Re: NiFi - JSON to MongoDB - problems with array

Re: Use column values of a csv file to route flow ...

Re: NIFI and Elasticsearch

Can I put the NiFi 1.10 Parquet Record Reader in N...

Re: how to read file content and extract specific ...

Re: Need help to connect MySQL with CaptureChangeM...

Re: Encrypt data in transit in Nifi using HandleHt...