Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

what is the best possible way of creating data ingestion pipeline for image/pdf processing using NiFi

Highlighted

what is the best possible way of creating data ingestion pipeline for image/pdf processing using NiFi

New Contributor

Hi Experts,

I am new to NiFi, I wanted to explore NiFi for creating a data ingestion pipeline from source system to HDFS for images/pdf files present in the source system.

What is the best possible approach for achieving this using Nifi?

1) I want to ingest images/pdf from source system to HDFS

2) I also want to maintain metadata of those images/pdf in any database like hive.

3) What if image/pdf size is less than block size 128MB? What is the best practice in that case?

Any quick reference would be appreciated.

Regards,

Bhupesh