<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Kudu Master Directories in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66008#M76819</link>
    <description>&lt;P&gt;&lt;STRONG&gt;&amp;gt; Should multiple directories be used for storing the Kudu master data?&lt;/STRONG&gt;&lt;BR /&gt;The master nodes generally don't see a huge amount of disk IO, as their role is&lt;BR /&gt;primarily focused tablet placement, rather than data storage. The reason&lt;BR /&gt;fs_data_dirs is plural for the master is that tablet servers and master nodes&lt;BR /&gt;leverage the same FS configuration code. Feel free to use a single directory.&lt;/P&gt;&lt;P&gt;I wouldn't expect it to bottleneck your cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&amp;gt; Are there significant benefits of having multiple Kudu master data&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&amp;gt; directories or inherit risks with just a single master data directory?&lt;/STRONG&gt;&lt;BR /&gt;Not really. The master isn't a bottleneck for the most part, and they only&lt;BR /&gt;store a few GBs on disk. Also disk failures are not handled for masters as they&lt;BR /&gt;are on tablet servers, so the extra disks don't provide any added fault&lt;BR /&gt;tolerance either.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&amp;gt; I've read that SSDs are recommended for the WAL directories. Is there a major&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&amp;gt; performance impact if the WAL directory is on the same mount point as one of&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&amp;gt; the data directories?&lt;/STRONG&gt;&lt;BR /&gt;It's not uncommon to see this, where the fs_wal_dir is the same as the first&lt;BR /&gt;entry of fs_data_dirs. There is a caveat to this that in Kudu 1.5 and below,&lt;BR /&gt;the first data directory also stored tablet-specific metadata that is used for&lt;BR /&gt;the Raft consensus protocol, and we've seen this lead to occasional dips in&lt;BR /&gt;performance when tablet server ingest workloads coincide with periods of high&lt;BR /&gt;Raft election traffic. This is less relevant for masters, which generally don't&lt;BR /&gt;get bottlenecked by disk IO.&lt;/P&gt;</description>
    <pubDate>Tue, 03 Apr 2018 22:17:55 GMT</pubDate>
    <dc:creator>awong</dc:creator>
    <dc:date>2018-04-03T22:17:55Z</dc:date>
    <item>
      <title>Kudu Master Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66006#M76818</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have just installed Kudu in our test environment, and are currently running CDH 5.13.1. Due to this being a small POC environment, we only have 2 tablet servers, and a single master, making it only usable for functional testing.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There were 4 requried configuration properties upon installation of Kudu with CDH 5.13, for whichthe following were configured:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kudu Master WAL Directory:&lt;/P&gt;&lt;P&gt;/data/kudu/master_wal&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kudu Master Data Directories&lt;/P&gt;&lt;P&gt;/data/kudu/master_wal&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kudu Tablet Server WAL Directory&lt;/P&gt;&lt;P&gt;/data1/kudu/tablet_wal&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kudu Tablet Server&amp;nbsp;Data Directories&lt;/P&gt;&lt;P&gt;/data1/kudu/tablet_data&lt;/P&gt;&lt;P&gt;/data2/kudu/tablet_data&lt;/P&gt;&lt;P&gt;/data3/kudu/tablet_data&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My question is concerning the Master data directories configuration property:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Should multiple directories be used for storing the Kudu master data?&amp;nbsp;It appears this is expected with the configuration property being plural, and it's set up to be configured similar to the tablet server data directories from Cloudera Manager. But&amp;nbsp;if the Kudu Master server resides on one of the master/utility nodes, then there are not multiple JBOD mount points like a worker node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Are there significant benefits of having multiple Kudu master data directories or inherit risks with just a single master data directory? If we configured an additional master data directory on the OS disk (such as under /var or /opt), would this be a concern?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've read that SSDs are recommended for the WAL directories. Is there a major performance impact if the WAL directory is on the same mount point as one of the data directories?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;Braz&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 13:03:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66006#M76818</guid>
      <dc:creator>Braz</dc:creator>
      <dc:date>2022-09-16T13:03:41Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu Master Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66008#M76819</link>
      <description>&lt;P&gt;&lt;STRONG&gt;&amp;gt; Should multiple directories be used for storing the Kudu master data?&lt;/STRONG&gt;&lt;BR /&gt;The master nodes generally don't see a huge amount of disk IO, as their role is&lt;BR /&gt;primarily focused tablet placement, rather than data storage. The reason&lt;BR /&gt;fs_data_dirs is plural for the master is that tablet servers and master nodes&lt;BR /&gt;leverage the same FS configuration code. Feel free to use a single directory.&lt;/P&gt;&lt;P&gt;I wouldn't expect it to bottleneck your cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&amp;gt; Are there significant benefits of having multiple Kudu master data&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&amp;gt; directories or inherit risks with just a single master data directory?&lt;/STRONG&gt;&lt;BR /&gt;Not really. The master isn't a bottleneck for the most part, and they only&lt;BR /&gt;store a few GBs on disk. Also disk failures are not handled for masters as they&lt;BR /&gt;are on tablet servers, so the extra disks don't provide any added fault&lt;BR /&gt;tolerance either.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&amp;gt; I've read that SSDs are recommended for the WAL directories. Is there a major&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&amp;gt; performance impact if the WAL directory is on the same mount point as one of&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&amp;gt; the data directories?&lt;/STRONG&gt;&lt;BR /&gt;It's not uncommon to see this, where the fs_wal_dir is the same as the first&lt;BR /&gt;entry of fs_data_dirs. There is a caveat to this that in Kudu 1.5 and below,&lt;BR /&gt;the first data directory also stored tablet-specific metadata that is used for&lt;BR /&gt;the Raft consensus protocol, and we've seen this lead to occasional dips in&lt;BR /&gt;performance when tablet server ingest workloads coincide with periods of high&lt;BR /&gt;Raft election traffic. This is less relevant for masters, which generally don't&lt;BR /&gt;get bottlenecked by disk IO.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Apr 2018 22:17:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66008#M76819</guid>
      <dc:creator>awong</dc:creator>
      <dc:date>2018-04-03T22:17:55Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu Master Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66039#M76820</link>
      <description>Thanks for these answers. Having issues with replying, but for your last answer concerning the WAL directory and the metadata, would you recommend having a separate directory for the Tablet Server WAL?&lt;BR /&gt;&lt;BR /&gt;Thanks</description>
      <pubDate>Wed, 04 Apr 2018 21:02:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66039#M76820</guid>
      <dc:creator>Braz</dc:creator>
      <dc:date>2018-04-04T21:02:57Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu Master Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66041#M76821</link>
      <description>&lt;P&gt;Yep, that would be ideal in that background flushes/compactions would not affect&amp;nbsp;write performance&amp;nbsp;and Raft elections.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Apr 2018 22:12:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66041#M76821</guid>
      <dc:creator>awong</dc:creator>
      <dc:date>2018-04-04T22:12:28Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu Master Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66072#M76822</link>
      <description>Okay, and I hope I'm not asking too much in this one forum post, but since it's related: what is the recommended number of Tablet Server directories?&lt;BR /&gt;&lt;BR /&gt;Could we number of directories to each JBOD disk used by the DataNode? Of course without using a sub-directory of the DataNode.</description>
      <pubDate>Thu, 05 Apr 2018 17:06:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66072#M76822</guid>
      <dc:creator>Braz</dc:creator>
      <dc:date>2018-04-05T17:06:31Z</dc:date>
    </item>
    <item>
      <title>Re: Kudu Master Directories</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66073#M76823</link>
      <description>&lt;P&gt;That is up to your workload and how much storage you need per node. It's common to see anywhere from 6 to 12 disks per tablet server. Check out the &lt;A href="https://www.cloudera.com/documentation/enterprise/5-13-x/topics/kudu_limitations.html#concept_cws_n4n_5z" target="_blank"&gt;limitations&amp;nbsp;documentation&lt;/A&gt;&amp;nbsp;for some guidance there.&lt;/P&gt;</description>
      <pubDate>Thu, 05 Apr 2018 17:24:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Kudu-Master-Directories/m-p/66073#M76823</guid>
      <dc:creator>awong</dc:creator>
      <dc:date>2018-04-05T17:24:16Z</dc:date>
    </item>
  </channel>
</rss>

