Member since
07-21-2020
29
Posts
5
Kudos Received
3
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1210 | 03-21-2022 05:11 AM | |
4390 | 04-22-2021 12:21 PM | |
1887 | 02-08-2021 11:44 PM |
05-13-2021
09:17 AM
Hello, I struggle on a dependency issues in spark, now. Being new in spark, I hope there is a simple remedy. The question is, is there any mechanism how to separate dependencies of the spark engine and dependencies of a application. Example: The latest version of spark-core_2.12 (3.1.1, March 2021) depends on hadoop-client (3.3.0, March 2020) which itself depends on hadoop-common (3.3.0, July 2020) which finally depends on an antient version of gson (2.2.4, May 2013). You can easily find many other examples, e.g. commons-codec, protobuf-java ... So, what if your application, basically a library developed outside spark, depends on the latest (no longer compatible) version of gson 2.8.6? My obviously naive approach to start a spark application ends in runtime incompatibility clashes (e.g. with gson) Best regards Jaro
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Spark
04-22-2021
10:21 AM
Hello, while attempting to start ListenHTTP (port=10000) on a fresh cluster, I obtain the following error. Do you know what to change to get it work? ListenHTTP[id=6222ab5d-0177-1000-ffff-ffffaeaba266] Failed to properly initialize Processor. If still scheduled to run, NiFi will attempt to initialize and run the Processor again after the 'Administrative Yield Duration' has elapsed. Failure is due to java.io.IOException: Failed to bind to 0.0.0.0/0.0.0.0:10000: java.io.IOException: Failed to bind to 0.0.0.0/0.0.0.0:10000 Cloudera Flow Management (CFM) 2.0.4.0 1.11.4.2.0.4.0-80 built 09/27/2020 09:52:49 CEST Tagged nifi-1.11.4-RC1 Best regards Jaro
... View more
Labels:
- Labels:
-
Apache NiFi
04-12-2021
01:44 AM
Hello, we see the following error on UpdateAttribute processor Failed to persist list of Peers due to java.io.IOException: All Partitions have been blacklisted due to failures when attempting to update. If the Write-Ahead Log is able to perform a checkpoint, this issue may resolve itself. Otherwise, manual intervention will be required.; if restarted and the nodes specified at the RPG are down, may be unable to transfer data until communications with those nodes are restored Similar issue was reported some some years ago: https://community.cloudera.com/t5/Support-Questions/Caused-by-java-io-IOException-All-Partitions-have-been/td-p/230571 but in our case the processor only updates a sequence number,thus the pattern does not apply. How to recover from this situation? Cloudera Flow Management (CFM) 2.0.4.0 1.11.4.2.0.4.0-80 built 09/27/2020 09:52:49 CEST Best regards Jaro
... View more
Labels:
- Labels:
-
Apache NiFi
02-10-2021
08:29 AM
1 Kudo
Hello, I try to work out some automated deployment situation in NiFi (no UI usage, just cli toolkit or alternatives). Let us asume, you have a nifi flow under a load which need to be updated. There are some data in in-between connection queues. What happens when you try to update the flow? Will some data be lost or will the update be refused? Are there any solution for procedure like this. 1) stop the data receiving processor 2) check if the success connection queue is empty 3) apply steps 1 and 2 to the next processor till the last processor is stopped. 4) update the flow ignoring any data in failure or error connection queues Do you think it is reasonable approach? Is there something ready to use? Best regards Jaro
... View more
Labels:
- Labels:
-
Apache NiFi
02-08-2021
11:44 PM
1 Kudo
OK,, I got it. I missed a parameter. It should be nifi set-param -u http://localhost:8080 -pcid 828f0882-0177-1000-aabd-d680a070c5c5 -pn sample.URL -pv newvalue
... View more
02-08-2021
12:27 PM
Hello, I have tried to use nifi cli toolkit (nifi 1.11.4) to set parameter like this. However, I can't see any effect. What have I doing wrong? #> nifi get-param-context -u http://localhost:8080 -pcid 828f0882-0177-1000-aabd-d680a070c5c5 # Name Value Sensitive Description - ------------- ------------- --------- ----------- 1 sample.Passwd ******** true 2 sample.Schema SAMPLE001 false 3 sample.URL jdbc://sample false 4 sample.User dbsample false #> nifi set-param -u http://localhost:8080 -pcid 828f0882-0177-1000-aabd-d680a070c5c5 -pn sample.URL newvalue Waiting for update request to complete... #> nifi get-param-context -u http://localhost:8080 -pcid 828f0882-0177-1000-aabd-d680a070c5c5 # Name Value Sensitive Description - ------------- ------------- --------- ----------- 1 sample.Passwd ******** true 2 sample.Schema SAMPLE001 false 3 sample.URL jdbc://sample false 4 sample.User dbsample false Best regards Jaro Jaro
... View more
Labels:
- Labels:
-
Apache NiFi
02-07-2021
11:55 AM
Hello, I have a question regarding the concept of parameter context. The admin guide says : A Process Group can only be assigned one Parameter Context, while a given Parameter Context can be assigned to multiple Process Groups. OK, let's assume you have internal kafka or database, and some external system you connect to. You might want to have a parameter "group" for the internal kafka connection, and another parameter contexts for each external system connection. However, your flow, clearly a part a process group, reads data from external system and stores data internally. Thus you need actually 2 connections in the flow, it means 2 parameter contexts, which is not possible. To solve that, you assign the connections (controller services) to the parent group and the parameters. Finally, for connections which are common to many flows, like your kafka, databases, and HttpSSL you land in the parent NiFi Flow, having parameter context which contains a bunch of pretty unrelated parameters (for kafka, databases, ..). What is see, there is hardly a possibility to reuse parameter context as they cannot be combined and thus you have 1:1 relationship between par. context and group just like for variables assigned to a processor group. Did I miss something? What is actually the big advantage of parameters over variables? Best regards Jaro
... View more
Labels:
- Labels:
-
Apache NiFi
12-04-2020
05:09 AM
Hello, Does nipyapi package works with windows 10 (64bit)? I have tried to install nipyapi on two machines, one with winpython (manual installation of downloaded packages) another with anaconda (installation with pip). Both failed, obviously due a number of missing obscure dependencies like libxml2. I have only a little knowledge, but the installation looked like a tough stuff under windows 10. Is nipyapi recommended at all or are there better alternatives? Best regards Jaro
... View more
Labels:
- Labels:
-
Apache NiFi
09-24-2020
12:59 PM
Hi, @MattWho we actually already implemented prototypes of such custom processors and the basic principles are clear. By processor logic, I did not mean the structure of the processor like relationships. The structure is static in our case. By the processor logic, I refer, the code which is called in "onTrigger" in "StreamCallback.process". In my case, the logic can be parametrized by some configuration data (These configuration data is provided by a service or database tables). The configuration data is static during the whole processor run and must be provided during instantiation of the business logic object. You might see it as lookup, which must not happen during the processing "onTrigger", but "onScheduled" (sorry I wrote in "init" previously, that was not right)
Best regards Jaro
... View more
- « Previous
-
- 1
- 2
- Next »