Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

How to load .csv file and read it in Flume?

avatar

Hi,

I would like to know, How to load .csv file from spool dir into HDFS using Flume and read its contains?

Can anyone help me on this?

1 ACCEPTED SOLUTION

avatar
Master Mentor

here's an example, file type doesn't matter as everything is bytes. You can the ingest csv with Hive, pig or spark. http://www.lampdev.org/programming/hadoop/apache-flume-spooldir-sink-tutorial.html

View solution in original post

1 REPLY 1

avatar
Master Mentor

here's an example, file type doesn't matter as everything is bytes. You can the ingest csv with Hive, pig or spark. http://www.lampdev.org/programming/hadoop/apache-flume-spooldir-sink-tutorial.html