I would like to know, How to load .csv file from spool dir into HDFS using Flume and read its contains?
Can anyone help me on this?
here's an example, file type doesn't matter as everything is bytes. You can the ingest csv with Hive, pig or spark. http://www.lampdev.org/programming/hadoop/apache-flume-spooldir-sink-tutorial.html
View solution in original post