<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Number of regions increases after major compaction in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126158#M88895</link>
    <description>&lt;P&gt;We have a table... &lt;/P&gt;&lt;P&gt;size of table is 8.7 GB 
number of regions was 5 &lt;/P&gt;&lt;P&gt;we ran major compaction on table &lt;/P&gt;&lt;P&gt;size increased to 21.7 GB but in some time, size came down to 8.7 GB as earlier 
but 
number of regions increased from 5 to 27 and then came down to 17 
and then never came down to 5 again &lt;/P&gt;&lt;P&gt;why is number of regions increased from 5 to 17 although size of data remains same??&lt;/P&gt;</description>
    <pubDate>Wed, 13 Jul 2016 08:21:49 GMT</pubDate>
    <dc:creator>sunny11541</dc:creator>
    <dc:date>2016-07-13T08:21:49Z</dc:date>
    <item>
      <title>Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126158#M88895</link>
      <description>&lt;P&gt;We have a table... &lt;/P&gt;&lt;P&gt;size of table is 8.7 GB 
number of regions was 5 &lt;/P&gt;&lt;P&gt;we ran major compaction on table &lt;/P&gt;&lt;P&gt;size increased to 21.7 GB but in some time, size came down to 8.7 GB as earlier 
but 
number of regions increased from 5 to 27 and then came down to 17 
and then never came down to 5 again &lt;/P&gt;&lt;P&gt;why is number of regions increased from 5 to 17 although size of data remains same??&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jul 2016 08:21:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126158#M88895</guid>
      <dc:creator>sunny11541</dc:creator>
      <dc:date>2016-07-13T08:21:49Z</dc:date>
    </item>
    <item>
      <title>Re: Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126159#M88896</link>
      <description>&lt;P&gt;The size of a table, in bytes, is not necessarily tied to the number of regions. For example, a change in configuration might cause more or less regions for the same amount of data.&lt;/P&gt;&lt;P&gt;I don't have any definitive explanation why you saw the number of regions spike to 27; it might have just been transient. The number of regions likely increased from 5 to 17 due to splitting of the regions in this table as a part of the compaction.&lt;/P&gt;&lt;P&gt;You can investigate the RegionServer and Master logs on your cluster for the given table to understand if the regions underwent any splits. There are many reasons that the number of regions might have increased -- it is hard to definitively say why given the information you provided so far.&lt;/P&gt;&lt;P&gt;I would not be worried about 17 regions instead of only 5.&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jul 2016 08:53:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126159#M88896</guid>
      <dc:creator>elserj</dc:creator>
      <dc:date>2016-07-13T08:53:44Z</dc:date>
    </item>
    <item>
      <title>Re: Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126160#M88897</link>
      <description>&lt;P&gt;Hi Josh&lt;/P&gt;&lt;P&gt;thanks for reply....&lt;/P&gt;&lt;P&gt;I have table in production that holds only 1TB of data and max.region.size is set to 10GB. &lt;/P&gt;&lt;P&gt;i will except regions in range of 100 - 200 for this dataset but i see that number of regions are ~800 for table.&lt;/P&gt;&lt;P&gt;----------&lt;/P&gt;&lt;P&gt;I created a demo table and shared results.....&lt;/P&gt;&lt;P&gt;in test....&lt;/P&gt;&lt;P&gt;Data size is 9GB less than max region size (10GB), with 5 regions....... &lt;/P&gt;&lt;P&gt;why region can grow to 5 in first place although 1 region was good enough? no pre-splitting of table was done&lt;/P&gt;&lt;P&gt;before major compaction, all 5 regions had data less than 10GB and no new data was added.... then why will major compaction will increase the number of regions? it should have only tried to merge Hfiles into 1 single Hfile for all 5 regions.&lt;/P&gt;&lt;P&gt;In formation or explanation will help...&lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jul 2016 21:28:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126160#M88897</guid>
      <dc:creator>sunny11541</dc:creator>
      <dc:date>2016-07-13T21:28:50Z</dc:date>
    </item>
    <item>
      <title>Re: Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126161#M88898</link>
      <description>&lt;P&gt;
	&lt;A rel="user" href="https://community.cloudera.com/users/11799/sunny1154.html" nodeid="11799"&gt;@sunny malik&lt;/A&gt;, are you still using the &lt;CODE&gt;org.apache.hadoop.hbase.regionserver.IncreasingToUpperBoundRegionSplitPolicy&lt;/CODE&gt; split policy? Look at &lt;CODE&gt;hbase.regionserver.region.split.policy&lt;/CODE&gt; in hbase-site.xml.&lt;/P&gt;&lt;P&gt;
	Try using the ConstantSizeRegionSplitPolicy instead which will only split at 10GB. The IncreasingToUpperBoundRegionSplitPolicy will split more aggressively in the beginning, slowing down to larger regions as the number of regions for the table increases.&lt;/P&gt;&lt;P&gt;You can find more information in the &lt;A href="http://hbase.apache.org/book.html#config.files"&gt;HBase Book &lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jul 2016 21:36:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126161#M88898</guid>
      <dc:creator>elserj</dc:creator>
      <dc:date>2016-07-13T21:36:43Z</dc:date>
    </item>
    <item>
      <title>Re: Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126162#M88899</link>
      <description>&lt;P&gt;thanks guys for sharing thoughts&lt;/P&gt;&lt;P&gt;We were using &lt;B&gt;IncreasingToUpperBoundRegionSplitPolicy &lt;/B&gt;and now changed it to &lt;B&gt;ConstantSizeRegionSplitPolicy&lt;/B&gt;.&lt;/P&gt;&lt;P&gt;Above solved the mystery &lt;/P&gt;&lt;P&gt;thanks for help again!!&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jul 2016 07:07:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126162#M88899</guid>
      <dc:creator>sunny11541</dc:creator>
      <dc:date>2016-07-14T07:07:19Z</dc:date>
    </item>
    <item>
      <title>Re: Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126163#M88900</link>
      <description>&lt;P&gt;Thanks mqureshi for reference doc and that exactly whats happened....&lt;/P&gt;&lt;P&gt;............&lt;/P&gt;&lt;P&gt;I created new table and loaded data 6 times... which created single region for table with 6 hfiles.... &lt;/P&gt;&lt;P&gt;total size of table was 24.2GB and 10GB region limit&lt;/P&gt;&lt;P&gt;ran major compaction and it created 12 new regions and deleted parent region.&lt;/P&gt;&lt;P&gt;............&lt;/P&gt;&lt;P&gt;looks like, when ever split is happening.... new regions are created by formula&lt;/P&gt;&lt;P&gt;my guess formula....&lt;/P&gt;&lt;P&gt;new regions added after split = ~(number of HFiles * 2) - 1 (original region is removed)&lt;/P&gt;&lt;P&gt; or &lt;/P&gt;&lt;P&gt;is there a actual way to get number of regions after split??&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jul 2016 06:04:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/126163#M88900</guid>
      <dc:creator>sunny11541</dc:creator>
      <dc:date>2016-07-18T06:04:46Z</dc:date>
    </item>
    <item>
      <title>Re: Number of regions increases after major compaction</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/294039#M217008</link>
      <description>&lt;P&gt;I am using&amp;nbsp;&lt;A href="https://hbase.apache.org/devapidocs/org/apache/hadoop/hbase/regionserver/ConstantSizeRegionSplitPolicy.html" target="_blank"&gt;ConstantSizeRegionSplitPolicy&lt;/A&gt;&amp;nbsp;and MaxFileSize is set to 30 GB.&lt;/P&gt;&lt;P&gt;But, I found that file is not split across regions when file size reaches 30 GB.&lt;/P&gt;&lt;P&gt;Some of my file size is 300 GB across particular regions.&lt;/P&gt;&lt;P&gt;Can you please help me to solve this probelm.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have huge volume of data 10 TB.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Apr 2020 12:10:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Number-of-regions-increases-after-major-compaction/m-p/294039#M217008</guid>
      <dc:creator>raus</dc:creator>
      <dc:date>2020-04-15T12:10:30Z</dc:date>
    </item>
  </channel>
</rss>

