28732
DISCUSSIONS
101745
MEMBERS
3157
ARTICLES
Created 12-02-2014 12:39 PM
We got the data ingestion of raw apache access logs through Flume to HDFS. I'm looking for ways to parse the logs for various fields like timestamp, ip, query params, etc. and load the data into appropriate Hive/Impala tables.
Can all this be done as part of Flume?
Thanks!