<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: how to use puthivestreaming in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108432#M71287</link>
    <description>&lt;P&gt;Great answer! Just to add a caveat, if you are using HDF 2.0 and HDP 2.5, please see the following: &lt;A href="https://community.hortonworks.com/questions/59681/puthivestreaming-nifi-processor-various-errors.html" target="_blank"&gt;https://community.hortonworks.com/questions/59681/puthivestreaming-nifi-processor-various-errors.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 04 Oct 2016 20:08:53 GMT</pubDate>
    <dc:creator>mburgess</dc:creator>
    <dc:date>2016-10-04T20:08:53Z</dc:date>
    <item>
      <title>how to use puthivestreaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108430#M71285</link>
      <description>&lt;P&gt;I want to move data from hdfs to hive using puthivestreaming of nifi. does anyone have example?&lt;/P&gt;</description>
      <pubDate>Sun, 02 Oct 2016 21:09:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108430#M71285</guid>
      <dc:creator>121285904</dc:creator>
      <dc:date>2016-10-02T21:09:35Z</dc:date>
    </item>
    <item>
      <title>Re: how to use puthivestreaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108431#M71286</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;Hive streaming is supported against tables having the following :&lt;/P&gt;&lt;OL&gt;
&lt;LI&gt;ORC is the only format supported currently. So your table must have "stored as orc"&lt;/LI&gt;&lt;LI&gt;transactional = "true" should be set in the table create statement&lt;/LI&gt;&lt;LI&gt;Bucketed but not sorted. So your table must have "clustered by (colName) into &lt;EM&gt;(n)&lt;/EM&gt; buckets"&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Also, Hive must have the following properties set&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;hive.txn.manager = org.apache.hadoop.hive.ql.lockmgr.DbTxnManager&lt;/LI&gt;&lt;LI&gt;hive.compactor.initiator.on = true&lt;/LI&gt;&lt;LI&gt;hive.compactor.worker.threads &amp;gt; 0&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Please follow the documentation here &lt;A href="https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest" target="_blank" rel="nofollow noopener noreferrer"&gt;https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest&lt;/A&gt;&lt;/P&gt;&lt;P&gt;PutHiveStreaming requires your input data to be in Avro format, as is provided in the documentation here: &lt;/P&gt;&lt;P&gt;&lt;A href="https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveStreaming/" target="_blank" rel="nofollow noopener noreferrer"&gt;https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveStreaming/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Considering your data is JSON, we can construct a Nifi workflow as follows:&lt;/P&gt;&lt;P&gt;ListHDFS--&amp;gt; FetchHDFS--&amp;gt; ConvertJsonToAvro--&amp;gt;PutHiveStreaming&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8202-screen-shot-2016-10-03-at-122338-pm-copy.jpg" style="width: 2324px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23417i68D0F2BCF33E960B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8202-screen-shot-2016-10-03-at-122338-pm-copy.jpg" alt="8202-screen-shot-2016-10-03-at-122338-pm-copy.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8201-screen-shot-2016-10-03-at-121935-pm.png" style="width: 1586px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23418i561B7C6DC7747DDA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8201-screen-shot-2016-10-03-at-121935-pm.png" alt="8201-screen-shot-2016-10-03-at-121935-pm.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Let me know if this helps.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 11:39:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108431#M71286</guid>
      <dc:creator>mkalyanpur</dc:creator>
      <dc:date>2019-08-19T11:39:58Z</dc:date>
    </item>
    <item>
      <title>Re: how to use puthivestreaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108432#M71287</link>
      <description>&lt;P&gt;Great answer! Just to add a caveat, if you are using HDF 2.0 and HDP 2.5, please see the following: &lt;A href="https://community.hortonworks.com/questions/59681/puthivestreaming-nifi-processor-various-errors.html" target="_blank"&gt;https://community.hortonworks.com/questions/59681/puthivestreaming-nifi-processor-various-errors.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 04 Oct 2016 20:08:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-use-puthivestreaming/m-p/108432#M71287</guid>
      <dc:creator>mburgess</dc:creator>
      <dc:date>2016-10-04T20:08:53Z</dc:date>
    </item>
  </channel>
</rss>

