1973
Posts
1225
Kudos Received
124
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1913 | 04-03-2024 06:39 AM | |
| 3010 | 01-12-2024 08:19 AM | |
| 1642 | 12-07-2023 01:49 PM | |
| 2419 | 08-02-2023 07:30 AM | |
| 3357 | 03-29-2023 01:22 PM |
03-13-2018
02:36 PM
That may lock down more things. Maybe you need a stronger cert instead of defaults.
... View more
03-12-2018
04:03 PM
Sounds good. Someone from engineering hopefully will help you run this down. I am trying to think what other logs might be useful to post to spot something odd.
... View more
03-12-2018
01:37 PM
For now, you can use this NiFi flow to do schema registry stuff: https://community.hortonworks.com/articles/177349/big-data-devops-apache-nifi-hwx-schema-registry-sc.html
... View more
03-12-2018
12:27 PM
Building the latest source is usually good for odd problems. But this is pretty standard and should work. Engineering will post some more information. Have you tested your certificates? https://www.cyberciti.biz/faq/test-ssl-certificates-diagnosis-ssl-certificate/ Was your MiniFi C++ built with SSL enabled? type curl -V
... View more
03-12-2018
02:30 AM
is there a firewall? something running on that port? all SSL installed? what user is running minifi? can you run with sudo or logged in as root? check here: https://lists.apache.org/thread.html/%3CCAEzjzLm4foJ4BGJpiignL8VLiu5Gw-OXoKTyR6EpVnoohT6tZQ@mail.gmail.com%3E Check this one: https://community.hortonworks.com/questions/54993/remote-instance-of-nifi-is-not-configured-to-allow.html Seems resolved: https://github.com/apache/nifi-minifi-cpp/pull/263 Have you read this free book: http://discover.attunity.com/apache-nifi-for-dummies-en-report-go-c-lp8558.html?utm_source=google&utm_medium=cpc&utm_term=apache%20nifi&utm_campaign=us-reach&utm_content=pg2178&gclid=CjwKCAiAxJPVBRB4EiwAsCA4aVYkFt07MvcgENor1Fovb_vpsd489dAGhhl-CtHgj_GYjpQuTDzQxhoC08gQAvD_BwE
... View more
03-12-2018
02:19 AM
The update: https://community.hortonworks.com/articles/177370/extracting-html-from-pdf-excel-and-word-documents.html
... View more
03-12-2018
02:19 AM
https://community.hortonworks.com/articles/177370/extracting-html-from-pdf-excel-and-word-documents.html
... View more
03-12-2018
02:19 AM
Try the new one: https://community.hortonworks.com/articles/177370/extracting-html-from-pdf-excel-and-word-documents.html Make sure download the nar from github and put in lib directory then reboot. Make sure you are running NiFi with JDK 8.
... View more
03-11-2018
03:02 PM
2 Kudos
Extracting Text or HTML from PDF, Excel and Word Documents via Apache NiFi This version has been tested with HDF 3.1 and Apache NiFi 1.5. This processor is using Apache Tika 1.17 and is a non-supported Open Source Community processor that I have written. A user posted asking about HTML output, I took a look and it was easy so I added an option for that. Apache NiFi Flow You must download or build the nifi-extracttextprocessor nar and put in your lib, then you can add the processor. Select html or text Here's is the autogenerate documentation: You can see we set the output mime.type to text/html. Apache NiFi Example Flow to Read a File and Convert to HTML Source and Junit in Eclipse Example Output HTML <html xmlns="http://www.w3.org/1999/xhtml">
<head><meta name="pdf:PDFVersion" content="1.3"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.pdf.PDFParser"/>
<meta name="xmp:CreatorTool" content="Rave (http://www.nevrona.com/rave)"/>
<meta name="access_permission:modify_annotations" content="true"/>
<meta name="access_permission:can_print_degraded" content="true"/>
<meta name="meta:creation-date" content="2006-03-01T07:28:26Z"/>
<meta name="created" content="Wed Mar 01 02:28:26 EST 2006"/>
<meta name="access_permission:extract_for_accessibility" content="true"/><meta name="access_permission:assemble_document" content="true"/><meta name="xmpTPg:NPages" content="2"/><meta name="Creation-Date" content="2006-03-01T07:28:26Z"/><meta name="dcterms:created" content="2006-03-01T07:28:26Z"/><meta name="dc:format" content="application/pdf; version=1.3"/><meta name="access_permission:extract_content" content="true"/><meta name="access_permission:can_print" content="true"/><meta name="pdf:docinfo:creator_tool" content="Rave (http://www.nevrona.com/rave)"/><meta name="access_permission:fill_in_form" content="true"/><meta name="pdf:encrypted" content="false"/><meta name="producer" content="Nevrona Designs"/><meta name="access_permission:can_modify" content="true"/><meta name="pdf:docinfo:producer" content="Nevrona Designs"/><meta name="pdf:docinfo:created" content="2006-03-01T07:28:26Z"/>
<meta name="Content-Type" content="application/pdf"/>
<title></title></head>
<body>
<div class="page"><p/><p>
A Simple PDF File
This is a small demonstration .pdf file -</p><p> just for use in the Virtual Mechanics tutorials. More text. And moretext. And more text. And more text. And more text.
</p><p> And more text. And more text. And more text. And more text. And moretext. And more text. Boring, zzzzz. And more text. And more text. Andmore text. And more text. And more text. And more text. And more text.And more text. And more text.</p><p> And more text. And more text. And more text. And more text. And moretext. And more text. And more text. Even more. Continued on page 2 ...</p><p/></div>
<div class="page"><p/><p>
Simple PDF File 2...continued from page 1. Yet more text. And more text. And more text.And more text. And more text. And more text. And more text. And moretext. Oh, how boring typing this stuff. But not as boring as watching paint dry. And more text. And more text. And more text. And more text.Boring. More, a little more text. The end, and just as well.
</p><p/></div></body></html> Source Code: https://github.com/tspannhw/nifi-extracttext-processor NAR Release https://github.com/tspannhw/nifi-extracttext-processor/releases/tag/html Resources: See Part 1: https://community.hortonworks.com/articles/81694/extracttext-nifi-custom-processor-powered-by-apach.html https://community.hortonworks.com/articles/76924/data-processing-pipeline-parsing-pdfs-and-identify.html https://community.hortonworks.com/articles/163776/parsing-any-document-with-apache-nifi-15-with-apac.html
... View more
Labels:
03-11-2018
12:43 PM
See: https://community.hortonworks.com/articles/85549/how-to-secure-minifi-c-site-to-site.html
... View more