<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Kudu T-server data distribution in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318901#M227591</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29186"&gt;@kingpin&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I did execute the script, ran rebalance report &amp;amp; did a rebalance too however the result I was looking for was not archived (space is still over consumed in 1 TS). I think rebalance just distributes tablets evenly to all TS what I am looking to achieve is like HDFS rebalancer and I don’t think it is there in Kudu, correct me if I am wrong.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&amp;nbsp;&lt;/P&gt;&lt;P&gt;Wert&lt;/P&gt;</description>
    <pubDate>Fri, 18 Jun 2021 03:43:27 GMT</pubDate>
    <dc:creator>wert_1311</dc:creator>
    <dc:date>2021-06-18T03:43:27Z</dc:date>
    <item>
      <title>Kudu T-server data distribution</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318849#M227560</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;I would like some guidance/ information on data distribution in Kudu T-Servers.&lt;BR /&gt;We have Kudu cluster of 3 Masters and 9 T-Servers (each t-server has storage of 1TB). We are noticing that space in some t-server is getting consumed rapidly whereas in other its not that much being consumed. Would like to know why this is happening and is there any way this can be overcome, so that data can be distributed evenly across of 9 t-servers.&lt;/P&gt;&lt;P&gt;Kudu 1.7.0-cdh5.16.2/ CM 5.16.2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Appreciate any assistance in this regard.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Wert&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 14:42:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318849#M227560</guid>
      <dc:creator>wert_1311</dc:creator>
      <dc:date>2022-09-16T14:42:02Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu T-server data distribution</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318891#M227584</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29490"&gt;@wert_1311&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Check for Tablet distribution across tablet servers. For some reason if one tablet server goes down/unavailable, the data will be replicated to other tablet servers.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You get can get number of tablets per tablet server using this command :-&lt;/P&gt;&lt;P&gt;sudo -u kudu kudu table list &amp;lt;csv of master addresses&amp;gt;&amp;nbsp; -list_tablets | grep "^&amp;nbsp;&amp;nbsp;&amp;nbsp; " | cut -d' ' -f6,7 | sort | uniq -c&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you find the tablet distribution is uneven. You can go ahead with kudu rebalance tool to balance your cluster.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.cloudera.com/runtime/7.2.2/administering-kudu/topics/kudu-running-tablet-rebalancing-tool.html" target="_blank"&gt;https://docs.cloudera.com/runtime/7.2.2/administering-kudu/topics/kudu-running-tablet-rebalancing-tool.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Let me know how did that go.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If that answers your question, Please mark this post as "accept as solution"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jun 2021 19:39:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318891#M227584</guid>
      <dc:creator>kingpin</dc:creator>
      <dc:date>2021-06-17T19:39:47Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu T-server data distribution</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318901#M227591</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29186"&gt;@kingpin&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I did execute the script, ran rebalance report &amp;amp; did a rebalance too however the result I was looking for was not archived (space is still over consumed in 1 TS). I think rebalance just distributes tablets evenly to all TS what I am looking to achieve is like HDFS rebalancer and I don’t think it is there in Kudu, correct me if I am wrong.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&amp;nbsp;&lt;/P&gt;&lt;P&gt;Wert&lt;/P&gt;</description>
      <pubDate>Fri, 18 Jun 2021 03:43:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/318901#M227591</guid>
      <dc:creator>wert_1311</dc:creator>
      <dc:date>2021-06-18T03:43:27Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu T-server data distribution</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/319045#M227658</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29490"&gt;@wert_1311&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That's right, balancer just balances the tablet across the kudu cluster. If one host is consuming more space, it could be that the size of tablets is huge.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thats right, Kudu cant rebalance like HDFS based on dfs usage.&amp;nbsp;&lt;/P&gt;&lt;P&gt;one of the workaround you can try:-&lt;/P&gt;&lt;P&gt;- Stop that specific kudu TS role&lt;/P&gt;&lt;P&gt;- Run ksck until it comes healthy.&amp;nbsp;&lt;/P&gt;&lt;P&gt;- once ksck is healthy, rebuild that particular Kudu TS (rebuilding = wiping all data and wal dir)&lt;/P&gt;&lt;P&gt;&lt;A href="https://kudu.apache.org/docs/administration.html#rebuilding_kudu" target="_blank"&gt;https://kudu.apache.org/docs/administration.html#rebuilding_kudu&lt;/A&gt;&lt;/P&gt;&lt;P&gt;- start that specific TS&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Run rebalance again&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That should help. Let me know how did that go.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;~ If that answers your question - Please&amp;nbsp; give the thumbs up &amp;amp; mark the post as accept as solution.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Jun 2021 08:29:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/319045#M227658</guid>
      <dc:creator>kingpin</dc:creator>
      <dc:date>2021-06-21T08:29:51Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu T-server data distribution</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/319148#M227692</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29490"&gt;@wert_1311&lt;/a&gt;, has &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29186"&gt;@kingpin&lt;/a&gt;'s reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Jun 2021 05:29:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudu-T-server-data-distribution/m-p/319148#M227692</guid>
      <dc:creator>VidyaSargur</dc:creator>
      <dc:date>2021-06-23T05:29:49Z</dc:date>
    </item>
  </channel>
</rss>

