<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to run Flume in HA ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95447#M8796</link>
    <description>&lt;P&gt;I don't think there is HA in Flume. If you are worried about losing events because of Flume Agent going down you can use the &lt;A href="https://flume.apache.org/FlumeUserGuide.html#file-channel"&gt;File Channel&lt;/A&gt; which uses checkpointing. This makes sure that no events are lost while the Flume Agent is down and can begin to send event to sink from where it left off.&lt;/P&gt;&lt;P&gt;In case you are worried about the destination sink your agent is writing to going down then you can use the &lt;A href="https://flume.apache.org/FlumeUserGuide.html#failover-sink-processor"&gt;Failover Sink Processor&lt;/A&gt;.&lt;/P&gt;</description>
    <pubDate>Thu, 15 Oct 2015 03:08:41 GMT</pubDate>
    <dc:creator>deepesh1</dc:creator>
    <dc:date>2015-10-15T03:08:41Z</dc:date>
    <item>
      <title>How to run Flume in HA ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95446#M8795</link>
      <description>&lt;P&gt;Would like know if there is a way to  run flume in HA mode.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Oct 2015 03:04:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95446#M8795</guid>
      <dc:creator>Jagatheeshr</dc:creator>
      <dc:date>2015-10-15T03:04:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Flume in HA ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95447#M8796</link>
      <description>&lt;P&gt;I don't think there is HA in Flume. If you are worried about losing events because of Flume Agent going down you can use the &lt;A href="https://flume.apache.org/FlumeUserGuide.html#file-channel"&gt;File Channel&lt;/A&gt; which uses checkpointing. This makes sure that no events are lost while the Flume Agent is down and can begin to send event to sink from where it left off.&lt;/P&gt;&lt;P&gt;In case you are worried about the destination sink your agent is writing to going down then you can use the &lt;A href="https://flume.apache.org/FlumeUserGuide.html#failover-sink-processor"&gt;Failover Sink Processor&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Oct 2015 03:08:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95447#M8796</guid>
      <dc:creator>deepesh1</dc:creator>
      <dc:date>2015-10-15T03:08:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Flume in HA ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95448#M8797</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/222/deepesh.html" nodeid="222"&gt;@Deepesh&lt;/A&gt;. File Channel would solve the Data Loss problem and failover sink processer address the issue with the sink failure rather than the flume failure.&lt;/P&gt;&lt;P&gt;What if the flume agent on a node gets killed and as a result there is no message is passed to the sink. Wouldn't it be a good idea to have another Flume agent registered in a Zookeeper to periodically check if the other flume agent is alive,if ever it  dies then this can start piping the data to the sink.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Oct 2015 05:01:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95448#M8797</guid>
      <dc:creator>Jagatheeshr</dc:creator>
      <dc:date>2015-10-15T05:01:18Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Flume in HA ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95449#M8798</link>
      <description>&lt;P&gt;Its hard to give a generic answer on how to achieve high availability without knowing the topology the data and form of ingestion and where and how it is written in destination. In many cases if the data at source is available even if the agent gets killed, upon restarting the agent the checkpointing on the file channel will let the agent recover from the point where it failed. Sometimes topology has multiple Flume agents started for availability, ofcourse there will be issue with data redundancy but thats fine in some cases.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Oct 2015 07:53:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95449#M8798</guid>
      <dc:creator>deepesh1</dc:creator>
      <dc:date>2015-10-15T07:53:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Flume in HA ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95450#M8799</link>
      <description>&lt;P&gt;Here is some of the key points to use Flume in "HA"&lt;/P&gt;&lt;P&gt;1. Setup File Channels instead of Memory Channels (using a RAID array is very paranoid but possible) on any Flume agent in use&lt;/P&gt;&lt;P&gt;2. Create a nanny process/script to watch for flume agent failures and restart immediately&lt;/P&gt;&lt;P&gt;3. Put the Flume agent collector/aggregation/2nd tier behind a network load balancer and use a VIP. This also has the benefit for balancing load for high ingest&lt;/P&gt;&lt;P&gt;4. Optionally have a sink that dumps to cycling files (separate from the drive the File Channel operates on) on the local drives in addition to a sink that forwards it on the next flume node or directly to HDFS. At least then you have the time it takes to fill a drive to correct any major issues and recover lost ingest streams.&lt;/P&gt;&lt;P&gt;5. Use the built in JMX counters in Flume to setup alerts in your favorite Operations Center application&lt;/P&gt;</description>
      <pubDate>Wed, 28 Oct 2015 23:58:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95450#M8799</guid>
      <dc:creator>orenault</dc:creator>
      <dc:date>2015-10-28T23:58:48Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Flume in HA ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95451#M8800</link>
      <description>&lt;P&gt;Using ambari for high availability setup for flume ,is there any complete  step by step documentation installation instructions somewhere i can read . Please let me know the link . Thanks once again&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2016 11:42:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Flume-in-HA/m-p/95451#M8800</guid>
      <dc:creator>samkt99</dc:creator>
      <dc:date>2016-06-20T11:42:02Z</dc:date>
    </item>
  </channel>
</rss>

