Created 03-22-2024 02:54 AM
Hi Community,
I have a system which reads data from source and gets enriched to further use it for business.
I need 3 clusters, one with ETL capabilities(NiFi), the other with storage alone and third one where I can run my business using spark.
We have almost 10 Billion Load and abount 15,000 process in which we have about 10% of custom process as NiFi is not able to do custom lookup, custom sink, custom filter, custom mapper and so on which utilizes many threads.
Please also recommend me AZURE machine series for these 3 cluster too.
Thank You
Regards
Chetan K C
Created 03-22-2024 04:29 AM
@Chetankc, Welcome to our community! To help you get the best possible answer, I have tagged in our experts @MattWho @steven-matison @cotopaul @SAMSAL who may be able to assist you further.
Please feel free to provide any additional information or details about your query, and we hope that you will find a satisfactory solution to your question.
Regards,
Vidya Sargur,Created 03-22-2024 06:21 AM
@Chetankc
From a NiFi perspective there is not much guidance that can be given with such little information.
What kind of performance and throughput are you achieving now? and onn what type of setup (how many nodes in your NiFi cluster, number of CPU cores, JVM Heap settings, type of disk, etc) currently?
Thank you,
Matt