<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: what are the different sources used in real-time to import log files through Apache Flume ? how is the real-time data injection on a daily basis ? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227431#M189291</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/29170/moorej.html" nodeid="29170"&gt;@Jordan Moore&lt;/A&gt; Thanks for the suggestion.&lt;/P&gt;&lt;P&gt;Can you please let me know how log from different sever collected in real-time projects ? If you know any link, you can share.&lt;/P&gt;</description>
    <pubDate>Thu, 14 Dec 2017 20:46:11 GMT</pubDate>
    <dc:creator>rakesh_an1992</dc:creator>
    <dc:date>2017-12-14T20:46:11Z</dc:date>
    <item>
      <title>what are the different sources used in real-time to import log files through Apache Flume ? how is the real-time data injection on a daily basis ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227429#M189289</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I want to know how Flume is very much useful in streaming log files in real-time. I have practiced to import files through 'exec' command but I want to know what are the different sources used in Flume streaming in real-time projects.&lt;/P&gt;&lt;P&gt;Please help me out in clearing this doubt.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Tue, 12 Dec 2017 18:50:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227429#M189289</guid>
      <dc:creator>rakesh_an1992</dc:creator>
      <dc:date>2017-12-12T18:50:53Z</dc:date>
    </item>
    <item>
      <title>Re: what are the different sources used in real-time to import log files through Apache Flume ? how is the real-time data injection on a daily basis ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227430#M189290</link>
      <description>&lt;P&gt;You should read the warning on the ExecSource docs &lt;EM&gt;against &lt;/EM&gt;using tail -f &lt;/P&gt;&lt;P&gt;&lt;A href="https://flume.apache.org/FlumeUserGuide.html#exec-source" target="_blank"&gt;https://flume.apache.org/FlumeUserGuide.html#exec-source&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It even provides you the other sources to consider using instead. Those being "Spooling Directory Source, Taildir Source or direct integration with Flume via the SDK."&lt;/P&gt;&lt;P&gt;Personally, I like tools such as Filebeat or Fluentd for real time collection of logs, and sending those to either Elasticsearch or Solr, since they provide better tooling around log inspection. &lt;/P&gt;</description>
      <pubDate>Wed, 13 Dec 2017 03:26:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227430#M189290</guid>
      <dc:creator>JordanMoore</dc:creator>
      <dc:date>2017-12-13T03:26:54Z</dc:date>
    </item>
    <item>
      <title>Re: what are the different sources used in real-time to import log files through Apache Flume ? how is the real-time data injection on a daily basis ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227431#M189291</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/29170/moorej.html" nodeid="29170"&gt;@Jordan Moore&lt;/A&gt; Thanks for the suggestion.&lt;/P&gt;&lt;P&gt;Can you please let me know how log from different sever collected in real-time projects ? If you know any link, you can share.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Dec 2017 20:46:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227431#M189291</guid>
      <dc:creator>rakesh_an1992</dc:creator>
      <dc:date>2017-12-14T20:46:11Z</dc:date>
    </item>
    <item>
      <title>Re: what are the different sources used in real-time to import log files through Apache Flume ? how is the real-time data injection on a daily basis ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227432#M189292</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/47553/rakeshan1992.html" nodeid="47553"&gt;@Rakesh AN&lt;/A&gt;
&lt;/P&gt;&lt;P&gt; I have not used Flume in a distributed fashion, but whatever agent you choose, it tails the logs from the agent on that server, then ships them to the configured sink destinations. One agent per server makes it collect from different servers. Flume is &lt;STRONG&gt;near &lt;/STRONG&gt;real-time, since it is configured with a batch size. &lt;/P&gt;&lt;P&gt;It's not clear what doubt you have... Can you please explain how you've configured your Flume agents, and the issues you are experiencing? &lt;/P&gt;&lt;P&gt;The Flume documentation is fairly straightforward&lt;/P&gt;</description>
      <pubDate>Fri, 15 Dec 2017 00:32:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/what-are-the-different-sources-used-in-real-time-to-import/m-p/227432#M189292</guid>
      <dc:creator>JordanMoore</dc:creator>
      <dc:date>2017-12-15T00:32:45Z</dc:date>
    </item>
  </channel>
</rss>

