Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

How to ingest HDFS data into Kafka

avatar
Expert Contributor

Is there any better way to pull HDFS data into Kafka? I could use a flume exec source and kafka sink, but any alternative?

1 ACCEPTED SOLUTION

avatar
Master Guru

You could use Apache NiFi with a ListHDFS + FetchHDFS processor followed by PublishKafka.

View solution in original post

2 REPLIES 2

avatar
Master Guru

You could use Apache NiFi with a ListHDFS + FetchHDFS processor followed by PublishKafka.

avatar
Contributor

Can you help to provide more insight on how we can able to achieve this?

 

Thank you

Srinu