<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: HDF be used to feed Logstash? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120871#M26455</link>
    <description>&lt;P&gt;Thanks people. That is very helpful. It sounds like I have some learning to do &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 28 Apr 2016 21:26:04 GMT</pubDate>
    <dc:creator>alexmc</dc:creator>
    <dc:date>2016-04-28T21:26:04Z</dc:date>
    <item>
      <title>HDF be used to feed Logstash?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120868#M26452</link>
      <description>&lt;P&gt;I am thinking about setting up a Logstash infrastructure to monitor my system. (It happens to be Hortonworks HDP Hadoop cluster, but assume it isn't). So I have various things which generate logs and I want to transfer these logs outside my system to a new system - such as ElasticSearch inside Logstash. And I want to do this securely. &lt;/P&gt;&lt;P&gt;I don't really want Flume for this as there are better tools. &lt;/P&gt;&lt;P&gt;Now I might use Logstash forwarders - which most recently seems to be a new system called "Beats" - in particular FileBeat. However I would prefer to use Apache NiFi because of its security reputation. I would like to use HDF as I am a Hortonworks Partner and we are already using HDP. &lt;/P&gt;&lt;P&gt;Can anyone say:&lt;/P&gt;&lt;P&gt;"Yes this makes sense", "Yes, I have done it", "You need to read URL blah blah blah"?&lt;/P&gt;&lt;P&gt;Or have I got the wrong end of the stick?&lt;/P&gt;&lt;P&gt;PS I know that Ambari Metrics moves operational logs from the Hadoop cluster into the HDFS system - this is separate from that.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 12:31:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120868#M26452</guid>
      <dc:creator>alexmc</dc:creator>
      <dc:date>2026-04-21T12:31:52Z</dc:date>
    </item>
    <item>
      <title>Re: HDF be used to feed Logstash?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120869#M26453</link>
      <description>&lt;P&gt;Advantage of using HDF here is that you can do any preprocessing/filtering on your logs before you put into ElasticSearch. This is one of the common usecase where logs are preprocessed before putting into a system like Splunk/Logstash. &lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 19:40:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120869#M26453</guid>
      <dc:creator>ravi1</dc:creator>
      <dc:date>2016-04-28T19:40:17Z</dc:date>
    </item>
    <item>
      <title>Re: HDF be used to feed Logstash?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120870#M26454</link>
      <description>&lt;P&gt;Alex – This makes sense to me.  If you’re tailing files or listening to
syslog, NiFi has great easy processors that can easily forward this information
to a downstream search tool (SOLR, ES, etc) or even persist the information in
long term storage (HDFS).  You can
encrypt and compress the data as you capture it, send it over a secure wire and
do the simple event processing you need in order to route the information to
the appropriate end point.  There are
also some processors such as ScanContent or RouteContent that can allow you to
route message based on patterns (think Regex and Whitelists) that you find in the
message payload (i.e. route errors here, info there) or create priorities for
those messages.&lt;/P&gt;&lt;P&gt;The other place where NiFi helps tremendously is around data
conversion.  For example,  convert AvroToJSON or CSVtoAvro or
AttributesToJSON.  These help you get
messages into the proper files streams to be indexed by your search tool.&lt;/P&gt;&lt;P&gt;The one place I would look at closely is the amount of log
parsing you need to do.  For unique
formats, you may need to create a custom processor in NiFi to assist you with
extracting log attributes.  There are
processors such as EvaluateXPath or EvaluateXQuery that allow you to use XPath
to find attribute value pair information in XML and JSON which is extremely helpful
and may be all you need.  Otherwise, it’s
really easy to get started and play around with your use case to see if there’s
a fit.&lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 19:54:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120870#M26454</guid>
      <dc:creator>ccasano</dc:creator>
      <dc:date>2016-04-28T19:54:01Z</dc:date>
    </item>
    <item>
      <title>Re: HDF be used to feed Logstash?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120871#M26455</link>
      <description>&lt;P&gt;Thanks people. That is very helpful. It sounds like I have some learning to do &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 21:26:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120871#M26455</guid>
      <dc:creator>alexmc</dc:creator>
      <dc:date>2016-04-28T21:26:04Z</dc:date>
    </item>
    <item>
      <title>Re: HDF be used to feed Logstash?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120872#M26456</link>
      <description>&lt;P&gt;Thanks. That is helpful&lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 21:26:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDF-be-used-to-feed-Logstash/m-p/120872#M26456</guid>
      <dc:creator>alexmc</dc:creator>
      <dc:date>2016-04-28T21:26:43Z</dc:date>
    </item>
  </channel>
</rss>

