Member since
07-16-2023
6
Posts
0
Kudos Received
0
Solutions
10-24-2024
12:26 AM
In PutSql Processor, there is a property "Support Fragmented Transactions". If set its value to be false, then it is proceeding and not penalising the flow files.
... View more
01-22-2024
12:30 PM
Do you have a sample? I'm not sure NiFi can do this natively, but I have recently done some PDF parsing inside NiFi with a custom Groovy processor.
... View more
11-13-2023
10:39 PM
We did this at our end and ended up re-cycling the provenance repository much faster than usual. The huge amount of data that an output of a tailfile generates can fill up both your content and provenance repositories.
... View more
10-30-2023
10:52 AM
@Madhav_VD Apache NiFi is data agnostic which makes it possible to ingest data of any format. It does this by wrapping the bytes of data in to a NiFi FlowFile. The only thing time the content (data bytes) are read from the FlowFile would be by a NiFi processor component that would need to do so. And only a processor specifically designed to handle the specific content type of the data would be able to do anything with the FlowFile content. That being said, NiFi had no Content conversion processor capable of reading FFmpeg content format and writing out TIFF content format. So the question here is how woudl you accomplish the format conversion outside of NiFi? Perhaps this thread would help you there: https://superuser.com/questions/881783/convert-from-avi-to-uncompressed-tiff-using-ffmpeg NiFi in this case could still be used to automate the ingestion and conversion of your FFmpeg files by utilizing perhaps one the NiFi Scripting processors like ExecuteGroovyScript or ExecuteScript (where you can create a custom script that has some dependency on client libraries capable of doing the conversion) or maybe the ExecuteStreamCommand or ExecuteProcess processor that could call a local command line command that you pass the FlowFile content to and returns a content stream with the new Tiff format? If you found any of the suggestions/solutions provided helped you with your issue, please take a moment to login and click "Accept as Solution" on one or more of them that helped. Thank you, Matt
... View more
08-11-2023
08:18 AM
1 Kudo
@Madhav_VD Apache NiFi contains no native processors that utilize Apache Tika other than IdentifyMimeType (this processor does not do any extraction), but you can find others in the Apache that have created custom processors that utilize Apache Tika. Adding custom nars to Apache NiFi is as easy as adding the custom nar to the auto-load directory: https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html#autoloading-processors While I have no experience with any of these custom nars, you can give them a try to see if they meet your needs. If not they may provide you with a stepping stone for creating your own custom variant. https://github.com/tspannhw/nifi-extracttext-processor/releases/tag/html https://community.cloudera.com/t5/Community-Articles/ExtractText-NiFi-Custom-Processor-Powered-by-Apache-Tika/ta-p/249392 https://community.cloudera.com/t5/Community-Articles/Creating-HTML-from-PDF-Excel-and-Word-Documents-using-Apache/ta-p/247968 https://github.com/tspannhw/nifi-extracttext-processor If you found that the provided solution(s) assisted you with your query, please take a moment to login and click Accept as Solution below each response that helped. Thank you, Matt
... View more