<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: What is Sort Merge Bucket (SMB) Join in Hive? When it is used? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123658#M86402</link>
    <description>&lt;P&gt;I got below answer:&lt;/P&gt;&lt;P&gt;In
SMB join in Hive, each mapper reads a bucket from the first table and
the corresponding bucket from the second table and then a merge sort
join is performed. Sort Merge Bucket (SMB) join in hive is mainly
used as there is no limit on file or partition or table join. SMB
join can best be used when the tables are large. In SMB join the
columns are bucketed and sorted using the join columns. All tables
should have the same number of buckets in SMB join.&lt;/P&gt;</description>
    <pubDate>Sat, 12 Mar 2016 18:04:18 GMT</pubDate>
    <dc:creator>rushikeshdeshmu</dc:creator>
    <dc:date>2016-03-12T18:04:18Z</dc:date>
    <item>
      <title>What is Sort Merge Bucket (SMB) Join in Hive? When it is used?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123655#M86399</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Can anyone explain What is Sort Merge Bucket (SMB) Join in Hive? When it is used?&lt;/P&gt;</description>
      <pubDate>Sat, 12 Mar 2016 17:19:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123655#M86399</guid>
      <dc:creator>rushikeshdeshmu</dc:creator>
      <dc:date>2016-03-12T17:19:08Z</dc:date>
    </item>
    <item>
      <title>Re: What is Sort Merge Bucket (SMB) Join in Hive? When it is used?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123656#M86400</link>
      <description>&lt;P&gt;please refer to Hive wiki &lt;A href="https://cwiki.apache.org/confluence/display/Hive/LanguageManual+JoinOptimization" target="_blank"&gt;https://cwiki.apache.org/confluence/display/Hive/LanguageManual+JoinOptimization&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 12 Mar 2016 17:45:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123656#M86400</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-12T17:45:15Z</dc:date>
    </item>
    <item>
      <title>Re: What is Sort Merge Bucket (SMB) Join in Hive? When it is used?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123657#M86401</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/393/aervits.html"&gt;Artem Ervits&lt;/A&gt;, thanks for reply and link.&lt;/P&gt;</description>
      <pubDate>Sat, 12 Mar 2016 18:01:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123657#M86401</guid>
      <dc:creator>rushikeshdeshmu</dc:creator>
      <dc:date>2016-03-12T18:01:02Z</dc:date>
    </item>
    <item>
      <title>Re: What is Sort Merge Bucket (SMB) Join in Hive? When it is used?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123658#M86402</link>
      <description>&lt;P&gt;I got below answer:&lt;/P&gt;&lt;P&gt;In
SMB join in Hive, each mapper reads a bucket from the first table and
the corresponding bucket from the second table and then a merge sort
join is performed. Sort Merge Bucket (SMB) join in hive is mainly
used as there is no limit on file or partition or table join. SMB
join can best be used when the tables are large. In SMB join the
columns are bucketed and sorted using the join columns. All tables
should have the same number of buckets in SMB join.&lt;/P&gt;</description>
      <pubDate>Sat, 12 Mar 2016 18:04:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123658#M86402</guid>
      <dc:creator>rushikeshdeshmu</dc:creator>
      <dc:date>2016-03-12T18:04:18Z</dc:date>
    </item>
    <item>
      <title>Re: What is Sort Merge Bucket (SMB) Join in Hive? When it is used?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123659#M86403</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/2769/rushikeshdeshmukh007.html"&gt;Rushikesh Deshmukh&lt;/A&gt;&lt;/P&gt;&lt;P&gt;What is the purpose of merging the tables used in joins ?? can you please explain??&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Sep 2016 12:00:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123659#M86403</guid>
      <dc:creator>shivanageshch</dc:creator>
      <dc:date>2016-09-19T12:00:05Z</dc:date>
    </item>
    <item>
      <title>Re: What is Sort Merge Bucket (SMB) Join in Hive? When it is used?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123660#M86404</link>
      <description>&lt;P&gt;Does these configuration mentioned in this page work on TEZ engine .I could see SMB working only on MR&lt;/P&gt;</description>
      <pubDate>Fri, 09 Jun 2017 22:11:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/What-is-Sort-Merge-Bucket-SMB-Join-in-Hive-When-it-is-used/m-p/123660#M86404</guid>
      <dc:creator>viswanath_kammu</dc:creator>
      <dc:date>2017-06-09T22:11:47Z</dc:date>
    </item>
  </channel>
</rss>

