<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: What is the best, most performant, method to join a Hive table with data in Hbase? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96985#M10535</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/267/hrongali.html" nodeid="267"&gt;@hrongali@hortonworks.com&lt;/A&gt; I think a hive UDF could implement same logic, but would be easier to consume than map-reduce program. I think this UDF from brickhouse do this:&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://github.com/klout/brickhouse/blob/master/src/main/java/brickhouse/hbase/CachedGetUDF.java"&gt;https://github.com/klout/brickhouse/blob/master/src/main/java/brickhouse/hbase/CachedGetUDF.java&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 13 Nov 2015 03:33:18 GMT</pubDate>
    <dc:creator>gbraccialli3</dc:creator>
    <dc:date>2015-11-13T03:33:18Z</dc:date>
    <item>
      <title>What is the best, most performant, method to join a Hive table with data in Hbase?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96983#M10533</link>
      <description>&lt;P&gt;I have batch data stored in Hive and realtime streaming data stored in HBase. I would like to create a view in Hive which joins a table in Hive with data in HBase.  Using Hive on HBase is extremely slow. Is there a better way to accomplish this?&lt;/P&gt;</description>
      <pubDate>Thu, 12 Nov 2015 23:56:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96983#M10533</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2015-11-12T23:56:16Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best, most performant, method to join a Hive table with data in Hbase?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96984#M10534</link>
      <description>&lt;P&gt;Sort Bucket the Hive table and Read the bucketed Hive table in Mapreduce program and hit Hbase when the Key changes. Requires programming effor, but very effective. Bucketing the Hive table will make sure that a particular key goes to only one bucket, so you hit Hbase Once for a particular key.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Nov 2015 00:00:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96984#M10534</guid>
      <dc:creator>hrongali</dc:creator>
      <dc:date>2015-11-13T00:00:00Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best, most performant, method to join a Hive table with data in Hbase?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96985#M10535</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/267/hrongali.html" nodeid="267"&gt;@hrongali@hortonworks.com&lt;/A&gt; I think a hive UDF could implement same logic, but would be easier to consume than map-reduce program. I think this UDF from brickhouse do this:&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://github.com/klout/brickhouse/blob/master/src/main/java/brickhouse/hbase/CachedGetUDF.java"&gt;https://github.com/klout/brickhouse/blob/master/src/main/java/brickhouse/hbase/CachedGetUDF.java&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 13 Nov 2015 03:33:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-the-best-most-performant-method-to-join-a-Hive-table/m-p/96985#M10535</guid>
      <dc:creator>gbraccialli3</dc:creator>
      <dc:date>2015-11-13T03:33:18Z</dc:date>
    </item>
  </channel>
</rss>

