<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: routeoncontent is slow in processing in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200058#M65577</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/23208/hadoopuserhadoop.html" nodeid="23208"&gt;@Hadoop User&lt;/A&gt;&lt;P&gt;Do you see the data queueing up after the RouteOnContent processor in the flow?&lt;/P&gt;</description>
    <pubDate>Wed, 02 Aug 2017 04:25:37 GMT</pubDate>
    <dc:creator>Wynner</dc:creator>
    <dc:date>2017-08-02T04:25:37Z</dc:date>
    <item>
      <title>routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200052#M65571</link>
      <description>&lt;P&gt;1. I am getting a stream of messages which have a delimeter ';' &lt;/P&gt;&lt;P&gt;2. I am splitting those messages on ';'&lt;/P&gt;&lt;P&gt;3. those messages are then send to routeonconect, based on some part of text (eg. text containing "one" will be sent to 1st puthdfs. text containing "two" will be sent to 2nd puthdfs processor etc)&lt;/P&gt;&lt;P&gt;4. routed messages are then merged in to single file, using mergecontent.&lt;/P&gt;&lt;P&gt;5. merged file is put in hdfs.&lt;/P&gt;&lt;P&gt;Routeoncontent processor is taking toomuch time to route after stream messages arrive from splitcontent.&lt;/P&gt;&lt;P&gt;I am using match requirement as "content must contain match"&lt;/P&gt;&lt;P&gt;Can anybody help in this case&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jul 2017 03:35:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200052#M65571</guid>
      <dc:creator>mark_hadoop</dc:creator>
      <dc:date>2017-07-27T03:35:15Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200053#M65572</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/115917/routeoncontent-is-slow-in-processing.html#"&gt;@Hadoop User&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;How many and what are the size of typical files from the SplitContent processor?&lt;/P&gt;&lt;P&gt;How many Concurrent tasks does the RouteOnContent processor have configured?&lt;/P&gt;&lt;P&gt;Try increasing the Run Duration and/or Concurrent Tasks for better throughput.&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jul 2017 06:44:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200053#M65572</guid>
      <dc:creator>Wynner</dc:creator>
      <dc:date>2017-07-27T06:44:47Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200054#M65573</link>
      <description>&lt;P&gt;typically each message from split content processor is &amp;lt;=3KB&lt;/P&gt;&lt;P&gt;concurrent processor are 1.&lt;/P&gt;&lt;P&gt;Also, every second &amp;gt;50000 messages will be received and splitted and sent to route on content processor. I tested it with 50k messages, till route on content it just takes 2-3 second, but after that it is taking almost 3hours!!&lt;/P&gt;&lt;P&gt;I will increase the number of concurrent processors and see, it this helps me to improve the performance&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jul 2017 21:32:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200054#M65573</guid>
      <dc:creator>mark_hadoop</dc:creator>
      <dc:date>2017-07-27T21:32:57Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200055#M65574</link>
      <description>&lt;P&gt;I tried with changing the concurrent processes with 100(for testing), tested with 1k messages, it took 11 minutes to complete.&lt;/P&gt;&lt;P&gt;Any suggestions, please!!&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jul 2017 21:36:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200055#M65574</guid>
      <dc:creator>mark_hadoop</dc:creator>
      <dc:date>2017-07-27T21:36:23Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200056#M65575</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/23208/hadoopuserhadoop.html" nodeid="23208"&gt;@Hadoop User&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;100 concurrent tasks, unless you have a large number of CPUs available, is too many.&lt;/P&gt;&lt;P&gt;Try using 4 concurrent tasks and a run duration of 2 seconds. How long does that take to process the 50k messages?&lt;/P&gt;&lt;P&gt;What does the RouteOnContent processor configuration look like?&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jul 2017 01:07:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200056#M65575</guid>
      <dc:creator>Wynner</dc:creator>
      <dc:date>2017-07-28T01:07:24Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200057#M65576</link>
      <description>&lt;P&gt;I have changed it to 4 concurrent tasks, and run duration of 2s.&lt;/P&gt;&lt;P&gt;for 50k messages it took almost 3 hours (never expected case).&lt;/P&gt;&lt;P&gt;eg: a message will be like below&lt;/P&gt;&lt;P&gt;this_is_an_example_message &amp;lt;1&amp;gt; [some_"text_and_digits_here"_number="121212"] [some_text_here] --similarly 50k messages&lt;/P&gt;&lt;P&gt;routeoncontent configuration:&lt;/P&gt;&lt;P&gt;Scheduling: concurrent tasks: 4&lt;/P&gt;&lt;P&gt;Run Schedule: 2s&lt;/P&gt;&lt;P&gt;Properties: matchrequirement: content must contain match&lt;/P&gt;&lt;P&gt;character set: UTF-8&lt;/P&gt;&lt;P&gt;Content Buffer Size :1MB&lt;/P&gt;&lt;P&gt;txt: number="121212"&lt;/P&gt;&lt;P&gt;update attribute: filename updated here&lt;/P&gt;&lt;P&gt;puthdfs: configurations and path updated here&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
      <pubDate>Mon, 31 Jul 2017 11:56:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200057#M65576</guid>
      <dc:creator>mark_hadoop</dc:creator>
      <dc:date>2017-07-31T11:56:10Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200058#M65577</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/23208/hadoopuserhadoop.html" nodeid="23208"&gt;@Hadoop User&lt;/A&gt;&lt;P&gt;Do you see the data queueing up after the RouteOnContent processor in the flow?&lt;/P&gt;</description>
      <pubDate>Wed, 02 Aug 2017 04:25:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200058#M65577</guid>
      <dc:creator>Wynner</dc:creator>
      <dc:date>2017-08-02T04:25:37Z</dc:date>
    </item>
    <item>
      <title>Re: routeoncontent is slow in processing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200059#M65578</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/286/dwynne.html" nodeid="286"&gt;@Wynner&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I have replaced RouteOnContent processor, but kept parameters same.&lt;/P&gt;&lt;P&gt;Surprisingly, it works pretty fast(seconds). not sure why the old one was not working.&lt;/P&gt;&lt;P&gt;Thanks for your extended support.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Aug 2017 16:21:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/routeoncontent-is-slow-in-processing/m-p/200059#M65578</guid>
      <dc:creator>mark_hadoop</dc:creator>
      <dc:date>2017-08-02T16:21:28Z</dc:date>
    </item>
  </channel>
</rss>

