Member since
02-01-2022
285
Posts
103
Kudos Received
60
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1182 | 05-15-2025 05:45 AM | |
| 5124 | 06-12-2024 06:43 AM | |
| 8120 | 04-12-2024 06:05 AM | |
| 5995 | 12-07-2023 04:50 AM | |
| 3299 | 12-05-2023 06:22 AM |
06-23-2023
09:48 AM
@Wpq To enable nifi to use the mysql driver jar, you should copy that jar file into the nifi node file system. Additionally, the file will need to be owned by the same user running nifi. For example: usually the "nifi" user. If you have a multi node cluster, the file should exist in all nodes in the same location. Once you get into nifi UI. Configure a Database Connection Pool Controller Service referencing your driver. When the path and permissions are right, you will be able to enable that Controller Service. Once enabled you can use it with many processors such as ExecuteSQL.
... View more
06-23-2023
06:02 AM
@bhadraka Check out this flow definition example for how to prepare and test JSON ahead of an invokeHttp post. Flow Def File: https://raw.githubusercontent.com/cldr-steven-matison/NiFi-Templates/main/InvokeHttp_Demo_2.json As far as the post itself goes. I highly recommend that you test API calls with something like Postman so that you can clearly test what headers and such are required so that the API call works. Only then try to make it work with InvokeHttp duplicating all headers. Figuring out how to get the call to work and how to configure InvokeHTTP at same time is not fun. TIP: in Invoke HTTP click the + to add key value pairs corresponding to your headers or values you need to pass.
... View more
06-22-2023
02:32 PM
@Juanes I believe you just need to resolve the missing dependency. Check out this solution: > pip3 install python-setuptools
> yum install impala-shell https://community.cloudera.com/t5/Support-Questions/How-to-install-impala-shell-on-RHEL-8-3-to-communicate-with/m-p/313665
... View more
06-22-2023
02:25 PM
@Manimaran Wait/Notify should be able to do what you are describing in the deeper dialouge. Based on your current flow complexity, I would recommend that you build a new sample to learn how wait/notify behavior works in nifi. This is very difficult setup, so having a learning flow w/o context of the existing functionality is highly optimal. Then you can refactor your sql flow accordingly, without having to figure out how to get wait/notify working during that refactor.
... View more
06-21-2023
06:40 AM
@DTM In that case you would need to use the DatabaseConnectionPool and jdbc to aws postgres. This will require permissions to allow nifi network to speak to RDS endpoint. If you cant use DBCP, you will have to put something between the RDS and nifi. For example nifi could use invokeHTTP to send/post data to an ec2 instance with some kind of API that can do the connectivity.
... View more
06-21-2023
06:27 AM
@DTM You are correct, to use an AWS credentials in nifi, you need to use the Controller Service. This controller service is then referenced by processors as a drop down menu for AWS Credentials Provider Service. If one does not exist in the drop down you can chose to create it. https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-aws-nar/1.21.0/org.apache.nifi.processors.aws.credentials.provider.service.AWSCredentialsProviderControllerService/index.html An example of one such processor is GetDynamoDB but there are many. Just search AWS in the processor search box to find all aws related processors.
... View more
06-21-2023
06:21 AM
2 Kudos
@drewski7 Your actual bottleneck with the initial cluster size is the limitation of cores and ram. No matter what you do with concurrency, run schedule, the processor config, or other processors is limited by total cores and jvm on 2 machines. The total number of nodes itself is a limitation too. Ideally you want a master node at the top of flow, and its pushing down flowfiles to 2-3-4-5+ nodes to distribute the workload. That division of the workload is where nifi shines and you see massive throughput.
... View more
06-20-2023
06:47 AM
1 Kudo
@wert_1311 Check out this article: https://community.cloudera.com/t5/Community-Articles/Support-Video-Enabling-kubectl-for-CDE/ta-p/314200 Enjoy Breakstone's amazing radio voice! 😉
... View more
06-20-2023
06:42 AM
3 Kudos
@drewski7 This blog is a great place to start: https://blog.cloudera.com/benchmarking-nifi-performance-and-scalability/ That said, some recommendations: Recommend 3 nodes. Use 32 or 64gb ram. Set min ram 16, max 32, let nifi/operating system leverage other 32gb. Add more cores and tune Active Thread Count accordingly Be careful which processors are Primary Only and which processors are not. Do not over loadbalance queues, load balance at top of flow, let nifi distribute work load naturally after that. Tune Processor Concurrency and Run Schedule. Be sure to understand how each work. With a good setup tuned as above, have a plan to identify when time is appropriate to scale horizontally (add more nodes). Here are some more docs that get specific into sizing: https://docs.cloudera.com/cfm/2.1.1/nifi-sizing/topics/cfm-sizing-recommendations.html https://docs.cloudera.com/HDPDocuments/HDF3/HDF-3.5.2/nifi-configuration-best-practices/content/configuration-best-practices.html
... View more
06-15-2023
09:36 AM
@Ray82 Yes, you can achieve this with UpdateRecord. You will need to provide record reader/writer with schema of your upstream and downstream. Then in UpdateRecord you explicitly add properties (+) for each record value you want to update versus using a SQL statement like QueryRecord. Here are some useful community posts on this topic: https://community.cloudera.com/t5/Community-Articles/Update-the-Contents-of-FlowFile-by-using-UpdateRecord/ta-p/248267 https://community.cloudera.com/t5/Support-Questions/NiFi-UpdateRecord-processor-is-not-updating-JSON-path/m-p/186256
... View more