Support Questions
Find answers, ask questions, and share your expertise

what is the best possible way of creating data ingestion pipeline for image/pdf processing using NiFi

what is the best possible way of creating data ingestion pipeline for image/pdf processing using NiFi

New Contributor

Hi Experts,

I am new to NiFi, I wanted to explore NiFi for creating a data ingestion pipeline from source system to HDFS for images/pdf files present in the source system.

What is the best possible approach for achieving this using Nifi?

1) I want to ingest images/pdf from source system to HDFS

2) I also want to maintain metadata of those images/pdf in any database like hive.

3) What if image/pdf size is less than block size 128MB? What is the best practice in that case?

Any quick reference would be appreciated.

Regards,

Bhupesh