Member since
09-29-2015
871
Posts
723
Kudos Received
255
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
4058 | 12-03-2018 02:26 PM | |
3040 | 10-16-2018 01:37 PM | |
4176 | 10-03-2018 06:34 PM | |
3021 | 09-05-2018 07:44 PM | |
2288 | 09-05-2018 07:31 PM |
06-13-2018
02:25 PM
1 Kudo
ListSFTP extends ListFileTransfer and that class has performListing() which does: final FileTransfer transfer = getFileTransfer(context); final List<FileInfo> listing; try { listing = transfer.getListing(); } finally { IOUtils.closeQuietly(transfer); } The FileTransfer here is the SFTPTransfer, so it is being closed in a finally block every time.
... View more
06-12-2018
05:56 PM
Can you open Chrome Dev tools and see what is shown ? any javascript errors in the console? any interesting response codes for any requests on the network tab?
... View more
06-07-2018
05:09 PM
I don't know that much about the internals of NiFi's metrics, but when you use the REST API it should be making a federated request to all the nodes, which is different from a reporting task which should only have the metrics for the given node it is running on.
... View more
06-05-2018
07:05 PM
Your getLogger.info(...) statement is inside a for loop over "statuses" and I don't see a variable called statuses so it is hard to say what is in the variable, but if that is an empty collection then you might not be executing your for loop. I would try putting a logging statement right at the beginning of onTrigger to see if the reporting task is even executed.
... View more
06-05-2018
12:39 PM
1 Kudo
Yes, to be completely safe you should stop NiFi Registry, copy the database directory and flow storage directory somewhere, then start back up.
... View more
06-01-2018
02:28 PM
1 Kudo
You can change the Run Schedule on the scheduling tab of the processor config.
... View more
05-31-2018
01:52 PM
The recommended approach would be to use NiFi Registry to promote your flow between Dev and Prod. When using NiFi Registry you should not need to use a variable for sensitive properties, you enter the value as normal in Dev, then save to registry which removes the values, then import to prod and enter the prod password one time. After that, the value in each environment will be retained across upgrades, for example if you go back to dev and make a change and save a new version to registry, then upgrade the flow in prod, you won't have to enter the prod password again.
... View more
05-30-2018
08:01 PM
1 Kudo
The Jetty server is running separately and creating flow files from the POST requests and then placing the flow files in an internal list/queue, and then when the processor executes it takes a flow file out of the internal queue and transfers it to the success relationship. So the number of concurrent tasks is the number of threads processing the internal queue, but there is only ever one Jetty server per processor.
... View more
05-30-2018
07:51 PM
3 Kudos
ListenHTTP creates an embedded Jetty server with a thread pool of 200 threads, so presumably a single ListenHttp could handle whatever you could do with a single Jetty instance with 200 threads. To scale out you would have a NiFi cluster with a ListenHTTP on each node and a load balancer in front and you'd have your clients POST to the load balancer URL.
... View more
05-30-2018
07:45 PM
1 Kudo
HBase is actually not a JSON store... the row id, column family, column qualifiers, and values are all stored as byte[] so it can be whatever you want. In NiFi there is PutHBaseJson which is one way of using NiFi to get data into HBase because it is a common way to represents the columns of a row as key/value pairs in JSON, but it is NiFi that is choosing to use JSON here, not HBase. There is another processor PutHBaseCell which writes the byte contents of a flow file to a cell value in HBase, so this would make sense if you had a GetFile pick up a PDF from a directory and then wanted to store the PDF as the value of a cell in HBase. Solr and ES are both text-indexing systems so they aren't really made to store binary data, although I believe they do have a a binary field type. Most likely you would use them to index the text content of the PDF which would be extracted with something like Tika, an example of that would be this: https://community.hortonworks.com/articles/42210/using-solrs-extracting-request-handler-with-apache.html There is a pull request up for an issue that might be related to the ArrayIndexOutOfBounds exceptions... https://github.com/apache/nifi/pull/2718
... View more