<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hbase region split in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108384#M38223</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11799/sunny1154.html" nodeid="11799"&gt;@sunny malik&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Which HBase version are you using? Default split policy for HBase 0.94 and above is not based on size. It is "IncreasingToUpperBoundRegionSplitPolicy". Assuming this is your split policy, and given your regions are less than 3.5GB, what you are seeing is expected behavior.&lt;/P&gt;&lt;P&gt;The default split policy for HBase 0.94 and trunk is IncreasingToUpperBoundRegionSplitPolicy, which does more aggressive splitting based on the number of regions hosted in the same region server. The split policy uses the max store file size based on Min (R^2 * “hbase.hregion.memstore.flush.size”, “hbase.hregion.max.filesize”), where R is the number of regions of the same table hosted on the same regionserver. So for example, with the default memstore flush size of 128MB and the default max store size of 10GB, the first region on the region server will be split just after the first flush at 128MB. As number of regions hosted in the region server increases, it will use increasing split sizes: 512MB, 1152MB, 2GB, 3.2GB, 4.6GB, 6.2GB, etc. After reaching 9 regions, the split size will go beyond the configured “hbase.hregion.max.filesize”, at which point, 10GB split size will be used from then on. &lt;/P&gt;&lt;P&gt;Please see the following link.&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/blog/apache-hbase-region-splitting-and-merging/" target="_blank"&gt;http://hortonworks.com/blog/apache-hbase-region-splitting-and-merging/&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 18 Aug 2016 06:21:04 GMT</pubDate>
    <dc:creator>mqureshi</dc:creator>
    <dc:date>2016-08-18T06:21:04Z</dc:date>
    <item>
      <title>Hbase region split</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108382#M38221</link>
      <description>&lt;P&gt;is it possible for Load Balancer to create new regions??&lt;/P&gt;&lt;P&gt;i have a table with 100 regions and all regions have size less than 3.5 GB&lt;/P&gt;&lt;P&gt;max region size is set to 10GB (default) &lt;/P&gt;&lt;P&gt;daily load is via bulk loader and approx 250MB is added to each region &lt;/P&gt;&lt;P&gt;but &lt;/P&gt;&lt;P&gt;i see on daily basis new regions are added to this table (dont know why)&lt;/P&gt;&lt;P&gt;New regions should be created only if size of region goes beyond 10GB and region split happens.&lt;/P&gt;&lt;P&gt;Any help, thoughts are appriciated &lt;/P&gt;</description>
      <pubDate>Thu, 18 Aug 2016 04:05:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108382#M38221</guid>
      <dc:creator>sunny11541</dc:creator>
      <dc:date>2016-08-18T04:05:49Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region split</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108383#M38222</link>
      <description>&lt;P&gt;Can you provide more information (attaching region server log) ?&lt;/P&gt;&lt;P&gt;Load balancer wouldn't create new region.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Aug 2016 04:45:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108383#M38222</guid>
      <dc:creator>tyu</dc:creator>
      <dc:date>2016-08-18T04:45:44Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region split</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108384#M38223</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11799/sunny1154.html" nodeid="11799"&gt;@sunny malik&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Which HBase version are you using? Default split policy for HBase 0.94 and above is not based on size. It is "IncreasingToUpperBoundRegionSplitPolicy". Assuming this is your split policy, and given your regions are less than 3.5GB, what you are seeing is expected behavior.&lt;/P&gt;&lt;P&gt;The default split policy for HBase 0.94 and trunk is IncreasingToUpperBoundRegionSplitPolicy, which does more aggressive splitting based on the number of regions hosted in the same region server. The split policy uses the max store file size based on Min (R^2 * “hbase.hregion.memstore.flush.size”, “hbase.hregion.max.filesize”), where R is the number of regions of the same table hosted on the same regionserver. So for example, with the default memstore flush size of 128MB and the default max store size of 10GB, the first region on the region server will be split just after the first flush at 128MB. As number of regions hosted in the region server increases, it will use increasing split sizes: 512MB, 1152MB, 2GB, 3.2GB, 4.6GB, 6.2GB, etc. After reaching 9 regions, the split size will go beyond the configured “hbase.hregion.max.filesize”, at which point, 10GB split size will be used from then on. &lt;/P&gt;&lt;P&gt;Please see the following link.&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/blog/apache-hbase-region-splitting-and-merging/" target="_blank"&gt;http://hortonworks.com/blog/apache-hbase-region-splitting-and-merging/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Aug 2016 06:21:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108384#M38223</guid>
      <dc:creator>mqureshi</dc:creator>
      <dc:date>2016-08-18T06:21:04Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region split</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108385#M38224</link>
      <description>&lt;P&gt;Yes, with IncreasingToUpeerBoundRegionSplitPolicy it is possible to have a split of a region which is far from maximum size - this is expected behavior. The reason why? HBase tries to create many regions while they are small and distribute them across the cluster. You will need to switch to ConstantSizeRegionSplitPolicy if you do not want this.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;hbase.regionserver.region.split.policy controls the setting per HBase table.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;   &lt;/P&gt;</description>
      <pubDate>Fri, 19 Aug 2016 07:12:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hbase-region-split/m-p/108385#M38224</guid>
      <dc:creator>vrodionov</dc:creator>
      <dc:date>2016-08-19T07:12:32Z</dc:date>
    </item>
  </channel>
</rss>

