<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Location in disk where data is flushed Kafka in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/340007#M233239</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/11191"&gt;@araujo&lt;/a&gt;&amp;nbsp;Thanks for the answer. I am not saying where the logs are saved but where the data is saved when it is flushed. In the documentation of the log.flush.interval.messages says "&lt;SPAN&gt;The number of messages accumulated on a log partition before messages are flushed to disk". What I refer in this question is about the location where the messages are flushed to disk after receiving 10 messages (in the case of the previous example). At first I thought the properties you are talking about could be also used to specify the directory where I wanted the messages to be flushed to, but they aren't.&lt;/SPAN&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 30 Mar 2022 08:02:44 GMT</pubDate>
    <dc:creator>PabloO</dc:creator>
    <dc:date>2022-03-30T08:02:44Z</dc:date>
    <item>
      <title>Location in disk where data is flushed Kafka</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/339904#M233211</link>
      <description>&lt;P&gt;I have a basic kafka 2.13_3.10 cluster with one broker, consumer and producer.&lt;/P&gt;&lt;P&gt;I was testing if the "log.flush.interval.messages = 10"&lt;/P&gt;&lt;P&gt;worked properly. It is supposed to write the kafka messages of the topics after 10 messages, but where are they saved in the disk by default??&lt;/P&gt;&lt;P&gt;And, is there any way to specify the directory where you want data to be flushed overriding the default configuration?&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 11:18:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/339904#M233211</guid>
      <dc:creator>PabloO</dc:creator>
      <dc:date>2022-03-29T11:18:43Z</dc:date>
    </item>
    <item>
      <title>Re: Location in disk where data is flushed Kafka</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/339985#M233233</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/96930"&gt;@PabloO&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The logs are written to the directories configured in the&amp;nbsp;&lt;FONT face="courier new,courier"&gt;log.dirs&lt;/FONT&gt; and/or&amp;nbsp;&lt;FONT face="courier new,courier"&gt;log.dir&lt;/FONT&gt; properties of the Kafka broker. You can modify those properties to configured to the broker to use the directory that you want.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;André&lt;/P&gt;</description>
      <pubDate>Wed, 30 Mar 2022 00:50:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/339985#M233233</guid>
      <dc:creator>araujo</dc:creator>
      <dc:date>2022-03-30T00:50:05Z</dc:date>
    </item>
    <item>
      <title>Re: Location in disk where data is flushed Kafka</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/340007#M233239</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/11191"&gt;@araujo&lt;/a&gt;&amp;nbsp;Thanks for the answer. I am not saying where the logs are saved but where the data is saved when it is flushed. In the documentation of the log.flush.interval.messages says "&lt;SPAN&gt;The number of messages accumulated on a log partition before messages are flushed to disk". What I refer in this question is about the location where the messages are flushed to disk after receiving 10 messages (in the case of the previous example). At first I thought the properties you are talking about could be also used to specify the directory where I wanted the messages to be flushed to, but they aren't.&lt;/SPAN&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Mar 2022 08:02:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/340007#M233239</guid>
      <dc:creator>PabloO</dc:creator>
      <dc:date>2022-03-30T08:02:44Z</dc:date>
    </item>
    <item>
      <title>Re: Location in disk where data is flushed Kafka</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/340012#M233241</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/96930"&gt;@PabloO&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In Kafka's terminology, a topic is a "distributed logs". The data for each topic's partitions is saved in what's called "log segment files".&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, the "log.dirs" and "log.dir" parameters point to the directories where the actual messages are saved, *not* the "error logs".&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For example, if your "log.dirs" is set to "/data1" and you have a topic named "mytopic". The data for the partition 0 of that topic will be saved in files under the directory "/data1/mytopic-0".&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;André&lt;/P&gt;</description>
      <pubDate>Wed, 30 Mar 2022 09:14:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Location-in-disk-where-data-is-flushed-Kafka/m-p/340012#M233241</guid>
      <dc:creator>araujo</dc:creator>
      <dc:date>2022-03-30T09:14:31Z</dc:date>
    </item>
  </channel>
</rss>

