<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: need help with flume - morphline - solr pipeline, CDH5 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11620#M1674</link>
    <description>Interceptors are executed prior to Sinks.&lt;BR /&gt;&lt;BR /&gt;If the UUIDInterceptor does nothing it's probably misconfigured or attached to the wrong channel in flume.conf, or similar.&lt;BR /&gt;&lt;BR /&gt;Alternatively, consider replacing the UUIDInterceptor with a MorphlineInterceptor that uses the generateUUID command, or move the generateUUID command into the morphline config of the MorphlineSolrSink.&lt;BR /&gt;&lt;BR /&gt;Also see &lt;A target="_blank" href="http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html#/generateUUID"&gt;http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html#/generateUUID&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Wolfgang.&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Thu, 01 May 2014 16:37:09 GMT</pubDate>
    <dc:creator>whosch</dc:creator>
    <dc:date>2014-05-01T16:37:09Z</dc:date>
    <item>
      <title>need help with flume - morphline - solr pipeline, CDH5 &amp; CM</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11560#M1666</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I wanted to build a pipeline of ingesting log data via flume into solr, sounds like nothing special....&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;But I get stuck at starting the flume agent (with exec source 'tail -f...'), its log tells me that it stops doing anything after "INFO org.kitesdk.morphline.api.MorphlineContext: Importing commands", because this is the last log entry before it repeatedly got restarted (every 30sec.).&lt;/P&gt;&lt;P&gt;If I remove the SolrSink from my flume config, the expected files are written to the HDFS sink, thereby the base workflow is fine.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For my testing I tried to use the Syslog-example provided in the SearchUserGuide (&lt;A target="_blank" href="http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_morphline_example.html)."&gt;http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_morphline_example.html).&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;One strange thing is, how to configure the grok-dictionaries in the morphlines.conf while using ClouderaManager for configuring the stuff?&lt;/P&gt;&lt;P&gt;The configuration itself is clear, the text area in "Flume-NG Solr Sink", but how to reference the grok-dictionaries? just "dictionaryFiles : [grok-dictionaries]" or some path prefixes ?!?!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;=========================&lt;/P&gt;&lt;P&gt;this is the log of the flume agent (while I am writing entries to the watched file, but nothing will be processed):&lt;/P&gt;&lt;P&gt;""&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;2014-04-30 15:42:37,285 INFO org.apache.flume.sink.hdfs.HDFSDataStream: Serializer = TEXT, UseRawLocalFileSystem = false&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,448 INFO org.apache.flume.node.PollingPropertiesFileConfigurationProvider: Configuration provider starting&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,493 INFO org.apache.flume.node.PollingPropertiesFileConfigurationProvider: Reloading configuration file:/var/run/cloudera-scm-agent/process/1027-flume-AGENT/flume.conf&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,506 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,507 INFO org.apache.flume.conf.FlumeConfiguration: Processing:solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,507 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,508 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,508 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,508 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,509 INFO org.apache.flume.conf.FlumeConfiguration: Processing:solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,509 INFO org.apache.flume.conf.FlumeConfiguration: Processing:solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,510 INFO org.apache.flume.conf.FlumeConfiguration: Processing:solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,510 INFO org.apache.flume.conf.FlumeConfiguration: Processing:solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,510 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,511 INFO org.apache.flume.conf.FlumeConfiguration: Added sinks: HDFS solrSink Agent: agent&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,511 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,512 INFO org.apache.flume.conf.FlumeConfiguration: Processing:solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,512 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,513 INFO org.apache.flume.conf.FlumeConfiguration: Processing:HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,561 INFO org.apache.flume.conf.FlumeConfiguration: Post-validation flume configuration contains configuration for agents: [agent]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,562 INFO org.apache.flume.node.AbstractConfigurationProvider: Creating channels&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,580 INFO org.apache.flume.channel.DefaultChannelFactory: Creating instance of channel memoryChannel type memory&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,592 INFO org.apache.flume.node.AbstractConfigurationProvider: Created channel memoryChannel&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,594 INFO org.apache.flume.source.DefaultSourceFactory: Creating instance of source execSrc, type exec&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,609 INFO org.apache.flume.sink.DefaultSinkFactory: Creating instance of sink: solrSink, type: org.apache.flume.sink.solr.morphline.MorphlineSolrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:16,616 INFO org.apache.flume.sink.DefaultSinkFactory: Creating instance of sink: HDFS, type: hdfs&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,477 INFO org.apache.flume.sink.hdfs.HDFSEventSink: Hadoop Security enabled: false&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,481 INFO org.apache.flume.node.AbstractConfigurationProvider: Channel memoryChannel connected to [execSrc, solrSink, HDFS]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,509 INFO org.apache.flume.node.Application: Starting new configuration:{ sourceRunners:{execsrc=EventDrivenSourceRunner: { source:org.apache.flume.source.ExecSource{name:execSrc,state:IDLE} }} sinkRunners:{HDFS=SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@9a87fad counterGroup:{ name:null counters:{} } }, solrSink=SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@15563bcf counterGroup:{ name:null counters:{} } }} channels:{memoryChannel=org.apache.flume.channel.MemoryChannel{name: memoryChannel}} }&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,521 INFO org.apache.flume.node.Application: Starting Channel memoryChannel&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,623 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Monitored counter group for type: CHANNEL, name: memoryChannel: Successfully registered new MBean.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,623 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Component type: CHANNEL, name: memoryChannel started&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,630 INFO org.apache.flume.node.Application: Starting Sink HDFS&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,632 INFO org.apache.flume.node.Application: Starting Sink solrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,632 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Monitored counter group for type: SINK, name: HDFS: Successfully registered new MBean.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,633 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Component type: SINK, name: HDFS started&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,633 INFO org.apache.flume.sink.solr.morphline.MorphlineSink: Starting Morphline Sink solrSink (MorphlineSolrSink) ...&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,633 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Monitored counter group for type: SINK, name: solrSink: Successfully registered new MBean.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,633 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Component type: SINK, name: solrSink started&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,634 INFO org.apache.flume.node.Application: Starting Source execSrc&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,637 INFO org.apache.flume.source.ExecSource: Exec source starting with command:tail -F /tmp/spooldir/test.txt&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,650 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Monitored counter group for type: SOURCE, name: execSrc: Successfully registered new MBean.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,650 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Component type: SOURCE, name: execSrc started&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,687 INFO org.mortbay.log: Logging to org.slf4j.impl.Log4jLoggerAdapter(org.mortbay.log) via org.mortbay.log.Slf4jLog&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,877 INFO org.mortbay.log: jetty-6.1.26&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:17,956 INFO org.mortbay.log: Started SelectChannelConnector@0.0.0.0:41414&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:44:18,134 INFO org.kitesdk.morphline.api.MorphlineContext: Importing commands&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-04-30 15:45:00,994 INFO org.apache.flume.sink.hdfs.HDFSDataStream: Serializer = TEXT, UseRawLocalFileSystem = false&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;""&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This log lines are written every ~30sec.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;=====flume config====&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;agent.sources = execSrc&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.channels = memoryChannel&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks = HDFS solrSink&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.type = exec&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.command = tail -F /tmp/spooldir/test.txt&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.interceptors.uuidinterceptor.type = org.apache.flume.sink.solr.morphline.UUIDInterceptor$Builder&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.interceptors.uuidinterceptor.headerName = id&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.interceptors.uuidinterceptor.preserveExisting = false&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.interceptors.uuidinterceptor.prefix = myhostname&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sources.execSrc.channels = memoryChannel&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;agent.channels.memoryChannel.type = memory&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.channels.memoryChannel.capacity = 10000&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.channels.memoryChannel.transactionCapacity = 1000&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;agent.sinks.solrSink.type = org.apache.flume.sink.solr.morphline.MorphlineSolrSink&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.solrSink.channel = memoryChannel&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.solrSink.batchSize = 1000&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.solrSink.batchDurationMillis = 1000&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.solrSink.morphlineFile = morphlines.conf&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.solrSink.morphlineId = morphline1&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.channel = memoryChannel&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.type = hdfs&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.path = hdfs://hadoop-pg-6.cluster:8020/tmp/test4solr&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.fileType = DataStream&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.writeFormat = Text&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.batchSize = 2000&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.rollSize = 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.rollCount = 2000&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;agent.sinks.HDFS.hdfs.rollInterval = 30&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;======morphline config=======&lt;/P&gt;&lt;P&gt;&lt;SPAN style="line-height: normal; font-size: x-small;"&gt;# Specify server locations in a SOLR_LOCATOR variable; used later in variable substitutions:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;SOLR_LOCATOR : {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;collection : workshop&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# ZooKeeper ensemble&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;zkHost : "$ZK_HOST"&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# The maximum number of documents to send to Solr per network batch (throughput knob)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# batchSize : 100&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;morphlines : [&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;id : morphline1 &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;importCommands : ["org.kitesdk.**", "org.apache.solr.**"]&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;commands : [ &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{ readLine { charset : UTF-8 } }&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{ addCurrentTime {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;field : manual_timestamp&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;preserveExisting : false&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;} &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;grok {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;dictionaryFiles : [grok-dictionaries]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;expressions : {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;message : """&amp;lt;%{POSINT:syslog_pri}&amp;gt;%{SYSLOGTIMESTAMP:syslog_timestamp} %{SYSLOGHOST:syslog_hostname} %{DATA&amp;amp;colon;syslog_program}(?:\[%{POSINT:syslog_pid}\])?: %{GREEDYDATA&amp;amp;colon;syslog_message}"""&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;# convert timestamp field to native Solr timestamp format&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# e.g. 2012-09-06T07:14:34Z to 2012-09-06T07:14:34.000Z&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;convertTimestamp {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;field : created_at&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;inputFormats : ["yyyy-MM-dd'T'HH:mm:ss'Z'", "yyyy-MM-dd"]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;inputTimezone : America/Los_Angeles&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;outputFormat : "yyyy-MM-dd'T'HH:mm:ss.SSSZ" &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;outputTimezone : UTC&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# Recall that Solr throws an exception on any attempt to load a document that contains a &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# field that isn't specified in schema.xml.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;sanitizeUnknownSolrFields {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# Location from which to fetch Solr schema&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;solrLocator : ${SOLR_LOCATOR}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;} &lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# log the record at DEBUG level to SLF4J&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{ logDebug { format : "output record: {}", args : ["@{}"] } }&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;# load the record into a SolrServer&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;{ &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;loadSolr {&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;solrLocator : ${SOLR_LOCATOR}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;]&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Additionally I wanted to ask where the logDebug output from the morphline will be written to?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What do I need to modify to be able to ingest data into Solr?!?!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;any help appreciated....&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 08:58:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11560#M1666</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2022-09-16T08:58:14Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11582#M1667</link>
      <description>Try to call it dictionaryFiles : [grok-dictionary.conf] perhttps://&lt;A target="_blank" href="http://www.cloudera.com/content/cloudera-content/cloudera-docs/CM4Ent/4.8.1/Cloudera-Manager-Managing-Clusters/cmmc_adding_search_solr.html"&gt;www.cloudera.com/content/cloudera-content/cloudera-docs/CM4Ent/4.8.1/Cloudera-Manager-Managing-Clusters/cmmc_adding_search_solr.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Wolfgang.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 30 Apr 2014 20:14:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11582#M1667</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-04-30T20:14:09Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11598#M1668</link>
      <description>&lt;P&gt;Hi Wolfgang,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thanks for answering. I modified that entry accordingly, but nothing has changed in the workflow (no data processing happens, no output in HDFS and solr).&lt;/P&gt;&lt;P&gt;The flume.log remains at (same output as in the original post):&lt;/P&gt;&lt;P&gt;...&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;2014-05-01 12:10:37,893 INFO org.mortbay.log: jetty-6.1.26&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 12:10:37,990 INFO org.mortbay.log: Started SelectChannelConnector@0.0.0.0:41414&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 12:10:38,179 INFO org.kitesdk.morphline.api.MorphlineContext: Importing commands&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and restarts periodically, about every 90sec.&lt;/P&gt;&lt;P&gt;Additionally the status of this agent in CM is "bad", because of "&lt;BR /&gt;&lt;SPAN&gt;This role encountered 3 unexpected exit(s) in the previous 5 minute(s). Critical threshold: any."&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Where do I have some more log output of what is going on in the morphline pipeline? There is no log/debug output in the flume log, where does the morphline debug message go to?!?!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;getting confused &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt; , any help appreciated, Gerd&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;PS: if I disable the solr-morphline-sink I receive data on HDFS, thereby the error must be somewhere in the morphline processing (assumingly)&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 10:17:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11598#M1668</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2014-05-01T10:17:50Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11600#M1669</link>
      <description>Weird, what (Solr, CDH, Cloudera Manager) version is this with?&lt;BR /&gt;&lt;BR /&gt;To automatically print diagnostic information such as the content of records as they pass through the morphline commands, consider enabling TRACE log level, for example by adding the following line to your log4j.properties file, e.g. via Cloudera Manager, per &lt;A target="_blank" href="http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html#/logTrace:"&gt;http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html#/logTrace:&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;log4j.logger.org.kitesdk.morphline=TRACE&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 01 May 2014 10:33:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11600#M1669</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-05-01T10:33:09Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11602#M1670</link>
      <description>&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="2"&gt;I am using:&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="2"&gt;ClouderaManager:&amp;nbsp;&lt;STRONG&gt;Version&lt;/STRONG&gt;&lt;SPAN&gt;: Cloudera Express 5.0.0 (#215 built by jenkins on 20140331-1424 git: 50c701f3e920b1fcf524bf5fa061d65902cde804)&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="2"&gt;&lt;SPAN&gt;Hadoop stack:&amp;nbsp;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;SPAN style="line-height: normal; font-family: arial, helvetica, sans-serif; font-size: small; font-weight: bold;"&gt;CDH&lt;/SPAN&gt;&lt;SPAN style="line-height: normal; font-family: arial, helvetica, sans-serif; font-size: small; font-weight: bold;"&gt;5.0.0-1.cdh5.0.0.p0.47&lt;/SPAN&gt; (with the included Solr)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I configured the TRACE in the corresponding "Advanced" section of the flume instance, but nothing has changed. After restarting the agent I cannot see any additional output in the&amp;nbsp;/var/log/flume-ng/flume-cmf-flume-AGENT-hadoop-pg-7.cluster.log, and the data I insert into the file that is being watched (via tail -f ...) isn't being processed anyhow.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The only thing what happens is that the the agent seems to reset/restart periodically and CM shows that agent in state 'BAD' due to a lot of unexpected exits.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hmmm?!?!&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="2"&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 12:54:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11602#M1670</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2014-05-01T12:54:18Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11606#M1671</link>
      <description>&lt;P&gt;Hi Wolfgang,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried a different approach, starting with a blank flume on a different cluster node. I configured the exact same settings for flume-/morphline-configuration. The behaviour is almost the same, data isn't being processed, but now I have an error message in the flume.log. Most probably you can interpret it much better &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; o.k., out of memory sounds obvious, but which setting does this affect ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:11,724 DEBUG org.apache.flume.node.PollingPropertiesFileConfigurationProvider: Checking file:/var/run/cloudera-scm-agent/process/1089-flume-AGENT/flume.conf for changes&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:37,382 ERROR org.apache.flume.sink.hdfs.HDFSEventSink: process failed&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,224 DEBUG org.apache.flume.node.PollingPropertiesFileConfigurationProvider: Checking file:/var/run/cloudera-scm-agent/process/1089-flume-AGENT/flume.conf for changes&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,977 ERROR org.apache.flume.lifecycle.LifecycleSupervisor: Unable to start SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@2ed7c530 counterGroup:{ name:null counters:{} } } - Exception follows.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;java.lang.OutOfMemoryError: Java heap space&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.zip.ZipCoder.toString(ZipCoder.java:49)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.zip.ZipFile.getZipEntry(ZipFile.java:531)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.zip.ZipFile.access$900(ZipFile.java:56)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.zip.ZipFile$1.nextElement(ZipFile.java:513)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.zip.ZipFile$1.nextElement(ZipFile.java:483)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.jar.JarFile$1.nextElement(JarFile.java:243)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.jar.JarFile$1.nextElement(JarFile.java:238)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.shaded.com.google.common.reflect.ClassPath$Scanner.scanJar(ClassPath.java:343)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.shaded.com.google.common.reflect.ClassPath$Scanner.scanFrom(ClassPath.java:288)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.shaded.com.google.common.reflect.ClassPath$Scanner.scan(ClassPath.java:276)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.shaded.com.google.common.reflect.ClassPath.from(ClassPath.java:84)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.api.MorphlineContext.getTopLevelClasses(MorphlineContext.java:134)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.api.MorphlineContext.importCommandBuilders(MorphlineContext.java:77)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.stdlib.Pipe.&amp;lt;init&amp;gt;(Pipe.java:43)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.stdlib.PipeBuilder.build(PipeBuilder.java:40)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.base.Compiler.compile(Compiler.java:126)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.kitesdk.morphline.base.Compiler.compile(Compiler.java:55)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.apache.flume.sink.solr.morphline.MorphlineHandlerImpl.configure(MorphlineHandlerImpl.java:101)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.apache.flume.sink.solr.morphline.MorphlineSink.start(MorphlineSink.java:97)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.apache.flume.sink.DefaultSinkProcessor.start(DefaultSinkProcessor.java:46)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.apache.flume.SinkRunner.start(SinkRunner.java:79)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at org.apache.flume.lifecycle.LifecycleSupervisor$MonitorRunnable.run(LifecycleSupervisor.java:251)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;at java.lang.Thread.run(Thread.java:744)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.sink.solr.morphline.MorphlineSink: Morphline Sink solrSink stopping...&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Component type: SINK, name: solrSink stopped&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.start.time == 1398949449905&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.stop.time == 1398949544982&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.batch.complete == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.batch.empty == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.batch.underflow == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.connection.closed.count == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,982 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.connection.creation.count == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,983 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.connection.failed.count == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,983 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.event.drain.attempt == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,983 INFO org.apache.flume.instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SINK, name: solrSink. sink.event.drain.sucess == 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,983 INFO org.apache.flume.sink.solr.morphline.MorphlineSink: Morphline Sink solrSink stopped. Metrics: SINK:solrSink{sink.connection.closed.count=0, sink.event.drain.attempt=0, sink.batch.underflow=0, sink.connection.failed.count=0, sink.connection.creation.count=0, sink.event.drain.sucess=0, sink.batch.empty=0, sink.batch.complete=0}, {}&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:05:44,983 WARN org.apache.flume.lifecycle.LifecycleSupervisor: Component SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@2ed7c530 counterGroup:{ name:null counters:{} } } stopped, since it could not besuccessfully started due to missing dependencies&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;2014-05-01 15:06:14,979 DEBUG org.apache.flume.node.PollingPropertiesFileConfigurationProvider: Checking file:/var/run/cloudera-scm-agent/process/1089-flume-AGENT/flume.conf for changes&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;br, Gerd&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 13:09:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11606#M1671</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2014-05-01T13:09:15Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11610#M1672</link>
      <description>OOM would explain it. The flume default setting for jvm memory is very low. Try something like -Xmx512m -XX:MaxPermSize=256m&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 01 May 2014 13:41:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11610#M1672</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-05-01T13:41:09Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11618#M1673</link>
      <description>&lt;P&gt;Hi Wolfgang, again me &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;things are getting better. Seems like the increased memory setting saves my day.&lt;/P&gt;&lt;P&gt;Now I am getting the expected DEBUG output from the morphline, but the document doesn't fit into Solr because the required field "id" is missing. Yes, of course Solr wouldn't accept such a document, but I configured flume to insert a header field called "id" filled by the UUIDInterceptor.&lt;/P&gt;&lt;P&gt;The DEBUG output is:&lt;/P&gt;&lt;P&gt;""&lt;/P&gt;&lt;P&gt;2014-05-01 17:04:46,280 DEBUG org.kitesdk.morphline.stdlib.LogDebugBuilder$LogDebug: 1 : [{message=[&amp;lt;168&amp;gt;Feb 4 10:46:14 syslog sshd[607]: listening on 0.0.0.0 port 22.]}]&lt;/P&gt;&lt;P&gt;""&lt;/P&gt;&lt;P&gt;This is the whole line of text read by ReadLine. Also the following Grok regex is parsing correctly but I have no idea how to access the header field "id" from the UUIDInterceptor.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How do I have to handle that additional field to be put into the corresponding document-part being inserted into Solr ?!?!&lt;/P&gt;&lt;P&gt;How can I check the header fields of the incoming event at all?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;best, Gerd&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 16:05:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11618#M1673</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2014-05-01T16:05:46Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11620#M1674</link>
      <description>Interceptors are executed prior to Sinks.&lt;BR /&gt;&lt;BR /&gt;If the UUIDInterceptor does nothing it's probably misconfigured or attached to the wrong channel in flume.conf, or similar.&lt;BR /&gt;&lt;BR /&gt;Alternatively, consider replacing the UUIDInterceptor with a MorphlineInterceptor that uses the generateUUID command, or move the generateUUID command into the morphline config of the MorphlineSolrSink.&lt;BR /&gt;&lt;BR /&gt;Also see &lt;A target="_blank" href="http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html#/generateUUID"&gt;http://kitesdk.org/docs/current/kite-morphlines/morphlinesReferenceGuide.html#/generateUUID&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Wolfgang.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 01 May 2014 16:37:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11620#M1674</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-05-01T16:37:09Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11636#M1675</link>
      <description>&lt;P&gt;Hi Wolfgang,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;finally I got it (almost) to work, just some schema.xml settings to adjust.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks for your great support, let's celebrate that at BerlinBuzz &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;br, Gerd&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 20:19:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11636#M1675</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2014-05-01T20:19:58Z</dc:date>
    </item>
    <item>
      <title>Re: need help with flume - morphline - solr pipeline, CDH5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11644#M1676</link>
      <description>Great! Looking forward to meet up @ Berlin Buzzwords.&lt;BR /&gt;&lt;BR /&gt;Wolfgang.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 01 May 2014 21:28:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/need-help-with-flume-morphline-solr-pipeline-CDH5-amp-CM/m-p/11644#M1676</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-05-01T21:28:09Z</dc:date>
    </item>
  </channel>
</rss>

