About DianaTorres

DianaTorres · ‎06-13-2024

@omeraran Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future. Thanks.

AkshayAmdocs · ‎06-12-2024

Hi @rki_ , @ChethanYM , @paras , Hope you are doing well! Could you please help us with above issue? Thanks, Akshay

MattWho · ‎06-11-2024

Yes, i believe this to be a legitimate NiFi bug.

DianaTorres · ‎06-10-2024

@Deejay Welcome to the Cloudera Community! To help you get the best possible solution, I have tagged our NiFi expert @steven-matison who may be able to assist you further. Please keep us updated on your post, and we hope you find a satisfactory solution to your query.

DianaTorres · ‎06-07-2024

@dankh Welcome to the Cloudera Community! This information has been sent to be updated by the team in charge. Thank you so much for your contribution!

MattWho · ‎06-06-2024

@G_B NiFi cluster deployments expect that all nodes in the cluster have same hardware specifications. There is no option in NiFi's Load Balanced connections to customize load-balancing based on current CPU load average of some other node. Even doing so would require NiFi nodes to continuously ping all other nodes to get the current load average before sending FlowFiles which would impact performance. The only thing that would result in any form of variation in distribution would be a node receive rate being diminished, but that is out of NiFi's control. Round Robin will skip a node in rotation if the node is unable to receive FlowFiles as fast as another node. Also keep in mind that a NiFi Cluster elects a node the roles "cluster coordinator" and "primary node". Sometimes both roles get assigned to same node. The assignment of these roles can change at. anytime. The primary node is only node that will schedule "primary node" only processors to execute. So your one node lighter on CPU could also end up assigned this role adding to its CPU load average. Often CPU load average is not only impacted by volume, but also content size of the FlowFiles. The LB connections also do not take in to account FlowFile content size when distributing FlowFiles. While your best option here performance wise is to make sure all nodes have same hardware specifications, there are a few less performant options you could try to distribute your data differently. 1. Use Remote Process Group (RPG) which uses Site-To-SIte (S2S) to distribute FlowFiles across your NiFi nodes. Always recommend using RPG to push to a Remote Input port rather then pull from an Remote output port to achieve better load distribution. Issue here is you need to add RPGs and Remote ports everywhere you were previously using LB configured connections. 2. Build a smart data distribution reusable dataflow. You could build a data flow that sorts FlowFiles by their content size ranges, merges bundles via mergeContent using FlowFile Stream, v3 merge format, send bundles based on size ranges to your various nodes via invokeHTTP to listenHTTP, and then unpackContent once received to extract the FlowFile bundle. This mergeContent is going to add addition cpu load. 3. Consider using DistributeLoad (can be configured with weighted distribution allowing you to create three distribution relationships with maybe like 5 FlowFile per relationship 1 and 2, and relationship with only 1 per iteration. This allows you to send 1 to you lower core node for every 5 sent to other two nodes. You would still need to use updateAttribute (set custom target node URL), mergeContent, invokeHttp, ListenHTTP, and unpackContent in this flow. So if addressing your hardware differences is not option, Number 1 is probably your next best choice. Please help our community thrive. If you found any of the suggestions/solutions provided helped you with solving your issue or answering your question, please take a moment to login and click "Accept as Solution" on one or more of them that helped. Thank you, Matt

JOLTEnjoyer · ‎06-06-2024

@Thar11027 yes, but it will be more complex: you can put a shift operation and using the "*" wildcard. So if for example you do this: [ { "operation": "shift", "spec": { "*": { "*/*": "[&1].&", //with a '/' in the middle somewhere "/*":"[&1].&", //with a '/' at the start followed by something "*/":"[&1].&" //with a '/' at the end following something } } } ] You will obtain all the desired fields, so it will be more easy to do the manual substitution(try it on the jolt demo site). Know that if you want to make it full automated, it will be a little more difficult, because then you would have to manipulate the string of the field. If you are interested in that I really suggest you to look at the last example on the guide I already sent you (My Guide). (I don't know if maybe it will be more appropriate to open another question about this other problem, because the topic changed and maybe if someone with the same problem is searching for it, it can be found)

DianaTorres · ‎06-03-2024

@sibin Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future. Thanks.

DianaTorres · ‎05-31-2024

@adsejnf Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future. Thanks.

shrikantverma · ‎05-27-2024

Online	Offline
Last Visited	‎07-14-2026 07:25 AM

Member Since	‎11-17-2021 08:08 AM
Last Visited	‎07-14-2026 07:25 AM
Posts	1,163
Kudos received	241

Cloudera Community

Re: How to change the company in the profile

Re: Keycloak SSO Hive REST Catalog

Re: Error connecting to NiFi Registry from NiFi UI...

Re: How to change my Account Email Address?

Re: Cannot erase old /opt/cloudera/parcels

Re: Need Help About Apache NiFi

Re: Kafka Machine not releasing memory from buffrt

Re: Nifi: Flowfile stuck in front of a processor g...

Re: Nifi Dynamic Parameter Context / ListAzureBlob...

Re: CDP Runtimes 7.1.9 CHF7 wrong URL

Re: Load balancing in NiFi - Heterogenous Nodes in...

Re: JOLT TRANSFORMATION returns unexpected result

Re: Cloudera after setting up custom kerberos for ...

Re: Mapreduce doesn't successfully do INSERT / CRE...

Re: Un-optimize queries are running on metastore_d...