Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Flume Tutorials

avatar
Contributor

Are there any good walkthrough tutorials for Flume? I've seen the two listed here. However, after skimming through the second one "Analyzing Social Media and Customer Sentiment," I fail to see any use or reference of Flume within it. I would specifically like something that walks through performance of the two Flume objectives documented in the HDP Certified Developer Exam Objectives sheet:

  1. Given a Flume configuration file, start a Flume agent
  2. Given a configured sink and source, configure a Flume memory channel with a specified capacity https

The 1st tutorial from the link above starts a Flume agent via Ambari, but I assume the Exam will require this to be done via the Terminal.

1 ACCEPTED SOLUTION

avatar
Master Mentor
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login
8 REPLIES 8

avatar
Master Mentor

avatar
Contributor

@Neeraj Sabharwal Thanks! I hadn't ran across this one yet.

avatar
Rising Star

Hi Daniel,

Were you able to run this flume example? Because I am trying. What would be the values if the sink is hdfs and not elasticsearch? any idea

Thank you.

avatar
Master Mentor

avatar
Master Mentor
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login

avatar
Contributor

@Artem Ervits Thanks, this is very helpful.

avatar
Guru

Hi @Daniel Hendrix

You are correct - you should be able to start a Flume agent from the command line. The docs show how to do this:

https://flume.apache.org/FlumeUserGuide.html#starting-an-agent

You also need to know how to configure a memory channel, which is also demonstrated in the docs:

https://flume.apache.org/FlumeUserGuide.html#memory-channel

I would recommend going through the same tutorial that you found here:

http://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-server-log-data/

Within that tutorial, configure a memory channel and try starting it from the command line. Let me know if you have any issues along the way and I'll be glad to assist.

Thanks,

Rich Raposa

Certification Manager

avatar
Master Mentor

@rich This is perfect for an article..Thanks , Rich! @Daniel Hendrix