Welcome to the Cloudera Community

Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Who agreed with this topic

Log parsing and loading to Hive/Impala tables

avatar
Expert Contributor

We got the data ingestion of raw apache access logs through Flume to HDFS. I'm looking for ways to parse the logs for various fields like timestamp, ip, query params, etc. and load the data into appropriate Hive/Impala tables.

 

Can all this be done as part of Flume?

 

Thanks!

Who agreed with this topic