I want to load data(~300 GB) from local filesystem to HDFS. And i will be doing similar activity once every month.
What would be the feasible way to get this done. I am looking at Flume & HDFS Put options.
These are some files (XML) and not log data. I dont need any conversion, its a straight copy to HDFS.