<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: PutHiveQL and putHiveStreaming processors in Apache Nifi are very slow in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213052#M63253</link>
    <description>&lt;P&gt;What is the most straightforward way to load data into Hive tables using Nifi? We use Hive 1.1 and have ingested the data and put it into HDFS as Avro files. &lt;A href="https://community.hortonworks.com/questions/108049/puthiveql-and-puthivestreaming-processors-in-apach.html#"&gt;@Matt Burgess&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 11 Feb 2019 06:04:48 GMT</pubDate>
    <dc:creator>manekshaw</dc:creator>
    <dc:date>2019-02-11T06:04:48Z</dc:date>
    <item>
      <title>PutHiveQL and putHiveStreaming processors in Apache Nifi are very slow</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213050#M63251</link>
      <description>&lt;P&gt;I am using the PutHiveSql to insert data into hive table. But it is very slow. It is approximately inserting each row in 2 to 3 secs.&lt;/P&gt;&lt;P&gt;Is there a way to increase the speed of insertion ? &lt;/P&gt;&lt;P&gt;It took around 3 days to insert 15000 rows !&lt;/P&gt;&lt;P&gt;please find below the puthiveQl processor configuration:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="17459-puthiveql.png" style="width: 953px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/16518iFA9B3046FC93E36D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="17459-puthiveql.png" alt="17459-puthiveql.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Complete flow:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="17460-complete-flow.png" style="width: 1515px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/16519i71D0E42D013E2DBA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="17460-complete-flow.png" alt="17460-complete-flow.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;BR /&gt;&lt;IMG src="https://community.cloudera.com/t5/image/serverpage/image-id/7331iA8C93AADF34207A6/image-size/large?v=1.0&amp;amp;px=999" border="0" alt="puthiveql.png" title="puthiveql.png" /&gt;</description>
      <pubDate>Sun, 18 Aug 2019 03:41:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213050#M63251</guid>
      <dc:creator>shivanandk</dc:creator>
      <dc:date>2019-08-18T03:41:54Z</dc:date>
    </item>
    <item>
      <title>Re: PutHiveQL and putHiveStreaming processors in Apache Nifi are very slow</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213051#M63252</link>
      <description>&lt;P&gt;What version of NiFi/HDF are you using? As of NiFi 1.2.0 / HDF 3.0.0, &lt;A target="_blank" href="https://issues.apache.org/jira/browse/NIFI-3031"&gt;PutHiveQL can accept multiple statements&lt;/A&gt; in one flow file, so if you are currently dealing with one INSERT statement per flow file, try MergeContent to batch them up into a single flow file. This should increase performance, but since Hive is an auto-commit database, PutHiveQL is probably not the best choice for large/fast ingest needs. You may be better off putting the data into HDFS and creating/loading a table from it.&lt;/P&gt;&lt;P&gt;For PutHiveStreaming, there is a &lt;A target="_blank" href="https://issues.apache.org/jira/browse/NIFI-3418"&gt;known issue&lt;/A&gt; that can reduce performance, it also was fixed in NiFi 1.2.0 / HDF 3.0.0.&lt;/P&gt;</description>
      <pubDate>Wed, 21 Jun 2017 02:47:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213051#M63252</guid>
      <dc:creator>mburgess</dc:creator>
      <dc:date>2017-06-21T02:47:14Z</dc:date>
    </item>
    <item>
      <title>Re: PutHiveQL and putHiveStreaming processors in Apache Nifi are very slow</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213052#M63253</link>
      <description>&lt;P&gt;What is the most straightforward way to load data into Hive tables using Nifi? We use Hive 1.1 and have ingested the data and put it into HDFS as Avro files. &lt;A href="https://community.hortonworks.com/questions/108049/puthiveql-and-puthivestreaming-processors-in-apach.html#"&gt;@Matt Burgess&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 11 Feb 2019 06:04:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213052#M63253</guid>
      <dc:creator>manekshaw</dc:creator>
      <dc:date>2019-02-11T06:04:48Z</dc:date>
    </item>
    <item>
      <title>Re: PutHiveQL and putHiveStreaming processors in Apache Nifi are very slow</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213053#M63254</link>
      <description>&lt;P&gt;In an upcoming release you'll be able to use Hive 1.1 processors, so in your case you'd want to keep what you have (Avro in HDFS) and use PutHive_1_1QL to issue a LOAD DATA or CREATE EXTERNAL TABLE statement so Hive can see your data.&lt;/P&gt;</description>
      <pubDate>Tue, 12 Feb 2019 02:28:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/PutHiveQL-and-putHiveStreaming-processors-in-Apache-Nifi-are/m-p/213053#M63254</guid>
      <dc:creator>mburgess</dc:creator>
      <dc:date>2019-02-12T02:28:09Z</dc:date>
    </item>
  </channel>
</rss>

