<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Accelerate working processor PutHiveStreaming in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119414#M55211</link>
    <description>&lt;P&gt;PutHiveStreaming relies on &lt;A href="https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest"&gt;Streaming API&lt;/A&gt; which has 2 relevant concepts: number of events per transaction and number of transactions per batch.  Generally, the more events you write per transaction the faster the ingest.  I don't see the 1st of these properties in the NiFi doc referenced above - perhaps there is some NiFi specific property that controls this. &lt;/P&gt;</description>
    <pubDate>Fri, 24 Feb 2017 03:33:47 GMT</pubDate>
    <dc:creator>ekoifman</dc:creator>
    <dc:date>2017-02-24T03:33:47Z</dc:date>
    <item>
      <title>Accelerate working processor PutHiveStreaming</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119412#M55209</link>
      <description>&lt;P&gt;Whether there is possibility to accelerate work processor PutHiveStreaming ?  running in parallel ? &lt;/P&gt;&lt;P&gt;Scheduling --&amp;gt; Concurrent Tasks :  posible only single tasks.&lt;/P&gt;</description>
      <pubDate>Wed, 22 Feb 2017 18:12:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119412#M55209</guid>
      <dc:creator>Kyivstar</dc:creator>
      <dc:date>2017-02-22T18:12:03Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerate working processor PutHiveStreaming</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119413#M55210</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/14716/dbca.html" nodeid="14716"&gt;@Dmitro Vasilenko&lt;/A&gt;&lt;P&gt;I'm going to attempt to answer this as there are far better experts here. You can improve performance in a few ways, you can parallelize by running PutHiveStreaming processor on a couple of nodes or tweak some of the parameters in the processor.&lt;/P&gt;&lt;P&gt;&lt;A href="https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveStreaming/" target="_blank"&gt;https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveStreaming/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Probably one property I'd tweak first is Transactions per batch. See if yours is set too low? &lt;/P&gt;</description>
      <pubDate>Fri, 24 Feb 2017 00:46:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119413#M55210</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-02-24T00:46:22Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerate working processor PutHiveStreaming</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119414#M55211</link>
      <description>&lt;P&gt;PutHiveStreaming relies on &lt;A href="https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest"&gt;Streaming API&lt;/A&gt; which has 2 relevant concepts: number of events per transaction and number of transactions per batch.  Generally, the more events you write per transaction the faster the ingest.  I don't see the 1st of these properties in the NiFi doc referenced above - perhaps there is some NiFi specific property that controls this. &lt;/P&gt;</description>
      <pubDate>Fri, 24 Feb 2017 03:33:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119414#M55211</guid>
      <dc:creator>ekoifman</dc:creator>
      <dc:date>2017-02-24T03:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerate working processor PutHiveStreaming</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119415#M55212</link>
      <description>&lt;P&gt;As of &lt;A target="_blank" href="https://issues.apache.org/jira/browse/NIFI-3418"&gt;NIFI-3418&lt;/A&gt;, NiFi will allow the user to set both of the aforementioned properties.&lt;/P&gt;</description>
      <pubDate>Fri, 24 Feb 2017 03:41:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119415#M55212</guid>
      <dc:creator>mburgess</dc:creator>
      <dc:date>2017-02-24T03:41:46Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerate working processor PutHiveStreaming</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119416#M55213</link>
      <description>&lt;P&gt;Even though adding both the suggested properties, the output rate of Hive Streaming processor still seems to be slow, we are getting a mere 2 tps output rate for processor. the input queue contains more than 10k messages, and processor properties are : transactions per batch = 1k , records per transactions = 100k.&lt;/P&gt;</description>
      <pubDate>Thu, 07 Sep 2017 19:56:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119416#M55213</guid>
      <dc:creator>mananstar2001</dc:creator>
      <dc:date>2017-09-07T19:56:26Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerate working processor PutHiveStreaming</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119417#M55214</link>
      <description>&lt;P&gt;I have set my Transactions per batch to 10. Is that too low ?  I am really looking at ways to process my queues faster . it always seems to be piling slowly over time &lt;/P&gt;</description>
      <pubDate>Tue, 15 May 2018 11:54:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Accelerate-working-processor-PutHiveStreaming/m-p/119417#M55214</guid>
      <dc:creator>abhinav_joshi</dc:creator>
      <dc:date>2018-05-15T11:54:43Z</dc:date>
    </item>
  </channel>
</rss>

