Member since
02-01-2022
270
Posts
96
Kudos Received
59
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2172 | 06-12-2024 06:43 AM | |
3279 | 04-12-2024 06:05 AM | |
2206 | 12-07-2023 04:50 AM | |
1334 | 12-05-2023 06:22 AM | |
2258 | 11-28-2023 10:54 AM |
06-26-2023
09:18 AM
@Ghilani You should know these 3 articles intimately if you want to use Execute Script https://community.cloudera.com/t5/Community-Articles/ExecuteScript-Cookbook-part-1/ta-p/248922 https://community.cloudera.com/t5/Community-Articles/ExecuteScript-Cookbook-part-2/ta-p/249018 https://community.cloudera.com/t5/Community-Articles/ExecuteScript-Cookbook-part-3/ta-p/249148 That said, it's sometimes helpful for me to see a working example and modify from there. As such, here is a github with a sample flow definition file (01_Fraud_Detection_Demo_Params_ExecuteScript.json) and script file(Fraud Demo ExecuteScript.py) that should work out of the box: https://github.com/cldr-steven-matison/Fraud-Prevention-With-Cloudera-SSB/tree/main/Templates Pay attention to imports and, then line 160 is what you want to get the flowfile. My flow ignores the content, but you should be able to find references in the Part 1 cookbook for anything you want to do w/ flowfile content.
... View more
06-26-2023
09:04 AM
@madhs @joseomjr A great example of the configuration is above. Here is a full flow definition file/example showing how to get values from JSON and prepare ahead of an InvokeHTTP processor: Flow Def File: https://raw.githubusercontent.com/cldr-steven-matison/NiFi-Templates/main/InvokeHttp_Demo_2.json
... View more
06-23-2023
09:48 AM
@Wpq To enable nifi to use the mysql driver jar, you should copy that jar file into the nifi node file system. Additionally, the file will need to be owned by the same user running nifi. For example: usually the "nifi" user. If you have a multi node cluster, the file should exist in all nodes in the same location. Once you get into nifi UI. Configure a Database Connection Pool Controller Service referencing your driver. When the path and permissions are right, you will be able to enable that Controller Service. Once enabled you can use it with many processors such as ExecuteSQL.
... View more
06-23-2023
06:02 AM
@bhadraka Check out this flow definition example for how to prepare and test JSON ahead of an invokeHttp post. Flow Def File: https://raw.githubusercontent.com/cldr-steven-matison/NiFi-Templates/main/InvokeHttp_Demo_2.json As far as the post itself goes. I highly recommend that you test API calls with something like Postman so that you can clearly test what headers and such are required so that the API call works. Only then try to make it work with InvokeHttp duplicating all headers. Figuring out how to get the call to work and how to configure InvokeHTTP at same time is not fun. TIP: in Invoke HTTP click the + to add key value pairs corresponding to your headers or values you need to pass.
... View more
06-22-2023
02:32 PM
@Juanes I believe you just need to resolve the missing dependency. Check out this solution: > pip3 install python-setuptools
> yum install impala-shell https://community.cloudera.com/t5/Support-Questions/How-to-install-impala-shell-on-RHEL-8-3-to-communicate-with/m-p/313665
... View more
06-22-2023
02:25 PM
@Manimaran Wait/Notify should be able to do what you are describing in the deeper dialouge. Based on your current flow complexity, I would recommend that you build a new sample to learn how wait/notify behavior works in nifi. This is very difficult setup, so having a learning flow w/o context of the existing functionality is highly optimal. Then you can refactor your sql flow accordingly, without having to figure out how to get wait/notify working during that refactor.
... View more
06-21-2023
06:40 AM
@DTM In that case you would need to use the DatabaseConnectionPool and jdbc to aws postgres. This will require permissions to allow nifi network to speak to RDS endpoint. If you cant use DBCP, you will have to put something between the RDS and nifi. For example nifi could use invokeHTTP to send/post data to an ec2 instance with some kind of API that can do the connectivity.
... View more
06-21-2023
06:27 AM
@DTM You are correct, to use an AWS credentials in nifi, you need to use the Controller Service. This controller service is then referenced by processors as a drop down menu for AWS Credentials Provider Service. If one does not exist in the drop down you can chose to create it. https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-aws-nar/1.21.0/org.apache.nifi.processors.aws.credentials.provider.service.AWSCredentialsProviderControllerService/index.html An example of one such processor is GetDynamoDB but there are many. Just search AWS in the processor search box to find all aws related processors.
... View more
06-21-2023
06:21 AM
2 Kudos
@drewski7 Your actual bottleneck with the initial cluster size is the limitation of cores and ram. No matter what you do with concurrency, run schedule, the processor config, or other processors is limited by total cores and jvm on 2 machines. The total number of nodes itself is a limitation too. Ideally you want a master node at the top of flow, and its pushing down flowfiles to 2-3-4-5+ nodes to distribute the workload. That division of the workload is where nifi shines and you see massive throughput.
... View more
06-20-2023
06:47 AM
1 Kudo
@wert_1311 Check out this article: https://community.cloudera.com/t5/Community-Articles/Support-Video-Enabling-kubectl-for-CDE/ta-p/314200 Enjoy Breakstone's amazing radio voice! 😉
... View more