<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Loading HBase from Hive ORC Tables in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96412#M9851</link>
    <description>&lt;P&gt;Demo article has been added here:&lt;/P&gt;&lt;P&gt;&lt;A href="http://community.hortonworks.com/articles/2745/creating-hbase-hfiles-from-an-existing-hive-table.html"&gt;creating-hbase-hfiles-from-an-existing-hive-table&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 04 Nov 2015 07:56:23 GMT</pubDate>
    <dc:creator>skumpf</dc:creator>
    <dc:date>2015-11-04T07:56:23Z</dc:date>
    <item>
      <title>Loading HBase from Hive ORC Tables</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96407#M9846</link>
      <description>&lt;P&gt;Looking for approaches for loading HBase tables if all I have is the data in an ORC backed Hive table.&lt;/P&gt;&lt;P&gt;I would prefer a bulk load approach, given there are several hundred million rows in the ORC backed Hive table.&lt;/P&gt;&lt;P&gt;I found the following, anyone have experience with Hive's HBase bulk load feature? Would it be better to create a CSV table and CTAS from ORC into the CSV table, and then use ImportTsv on the HBase side?&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://cwiki.apache.org/confluence/display/Hive/HBaseBulkLoad"&gt;HiveHBaseBulkLoad&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Any experiences here would be appreciated.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Nov 2015 04:49:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96407#M9846</guid>
      <dc:creator>skumpf</dc:creator>
      <dc:date>2015-11-03T04:49:12Z</dc:date>
    </item>
    <item>
      <title>Re: Loading HBase from Hive ORC Tables</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96408#M9847</link>
      <description>&lt;P&gt;Hey&lt;/P&gt;&lt;P&gt;You can Bulk load into Hbase in several different manners.The importTsv tool has been out there for a while. However if your data is in ORC with a HIve table on top the Hive bulk load is an easier option with less moving parts.&lt;/P&gt;&lt;P&gt;This slide from nick has a lot of info &lt;A href="http://fr.slideshare.net/HBaseCon/ecosystem-session-3a"&gt;http://fr.slideshare.net/HBaseCon/ecosystem-session-3a&lt;/A&gt;, slide 12 is the one you want to look at. &lt;/P&gt;&lt;P&gt;Essentially&lt;/P&gt;&lt;P&gt;set hive.hbase.generatehfiles=true&lt;/P&gt;&lt;P&gt;set hfile.family.path=/tmp/somewhere (this can also be a property)&lt;/P&gt;&lt;P&gt;this allows you to do insert into with the result of a sql statement a little more agile then having to go down the csv way. Careful the Hbase user will be picking up the generated files.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Nov 2015 05:02:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96408#M9847</guid>
      <dc:creator>nmaillard1</dc:creator>
      <dc:date>2015-11-03T05:02:33Z</dc:date>
    </item>
    <item>
      <title>Re: Loading HBase from Hive ORC Tables</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96409#M9848</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/157/rgelhausen.html" nodeid="157"&gt;@Randy Gelhausen&lt;/A&gt; recently was able to get this to work after messing with classpath:&lt;PRE&gt;HADOOP_CLASSPATH=/usr/hdp/current/hbase-client/lib/hbase-protocol.jar:/etc/hbase/conf hadoop jar /usr/hdp/current/phoenix-client/phoenix-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool --table test --input /user/root/test --zookeeper localhost:2181:/hbase-unsecure
&lt;/PRE&gt;</description>
      <pubDate>Tue, 03 Nov 2015 05:09:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96409#M9848</guid>
      <dc:creator>abajwa</dc:creator>
      <dc:date>2015-11-03T05:09:10Z</dc:date>
    </item>
    <item>
      <title>Re: Loading HBase from Hive ORC Tables</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96410#M9849</link>
      <description>&lt;P&gt;While I've yet to use this on the large table, it worked very well on a small sample. There were some gotchas that aren't explicitly called out anywhere. I will put together a guide and post it to AH, and link it back here when ready.&lt;/P&gt;&lt;P&gt;I've scripted out an example of using this feature here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/sakserv/hive-hbase-generatehfiles"&gt;https://github.com/sakserv/hive-hbase-generatehfiles&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Wed, 04 Nov 2015 07:47:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96410#M9849</guid>
      <dc:creator>skumpf</dc:creator>
      <dc:date>2015-11-04T07:47:06Z</dc:date>
    </item>
    <item>
      <title>Re: Loading HBase from Hive ORC Tables</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96411#M9850</link>
      <description>&lt;P&gt;This shows promise as well. I plan to give this a try soon. However, the accepted answer avoids needing to go from ORC back to Csv, so it gets the win. &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Nov 2015 07:48:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96411#M9850</guid>
      <dc:creator>skumpf</dc:creator>
      <dc:date>2015-11-04T07:48:24Z</dc:date>
    </item>
    <item>
      <title>Re: Loading HBase from Hive ORC Tables</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96412#M9851</link>
      <description>&lt;P&gt;Demo article has been added here:&lt;/P&gt;&lt;P&gt;&lt;A href="http://community.hortonworks.com/articles/2745/creating-hbase-hfiles-from-an-existing-hive-table.html"&gt;creating-hbase-hfiles-from-an-existing-hive-table&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Nov 2015 07:56:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Loading-HBase-from-Hive-ORC-Tables/m-p/96412#M9851</guid>
      <dc:creator>skumpf</dc:creator>
      <dc:date>2015-11-04T07:56:23Z</dc:date>
    </item>
  </channel>
</rss>

