<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Kafka logs folder is growing for more than 100 G in 24 hours. in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Kafka-logs-folder-is-growing-for-more-than-100-G-in-24-hours/m-p/333449#M231436</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I've a single Kafka node configured which receives data from telegraf agents and then it is passed to Influx Database.&lt;/P&gt;&lt;P&gt;I've a log retention set for 1h in kafka.server.properties file as -&amp;nbsp;log.retention.hours=1&lt;/P&gt;&lt;P&gt;But Kafka is having log queues formed and log getting full in 24 hours of time:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;kafka/bin/kafka-run-class.sh kafka.admin.ConsumerGroupCommand --bootstrap-server localhost:9101 --describe --group kafka_consumer --command-config admin.props | awk '/&amp;lt;topic&amp;gt;/{print $5}'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-20333796490&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is it because of the throughput when logs can grow more than for an hour if not consumed?&lt;/P&gt;&lt;P&gt;In another environment where the load is even more, this issue is not observed.&lt;/P&gt;&lt;P&gt;Any suggestion?&lt;/P&gt;</description>
    <pubDate>Mon, 10 Jan 2022 07:47:35 GMT</pubDate>
    <dc:creator>danurag</dc:creator>
    <dc:date>2022-01-10T07:47:35Z</dc:date>
    <item>
      <title>Kafka logs folder is growing for more than 100 G in 24 hours.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kafka-logs-folder-is-growing-for-more-than-100-G-in-24-hours/m-p/333449#M231436</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I've a single Kafka node configured which receives data from telegraf agents and then it is passed to Influx Database.&lt;/P&gt;&lt;P&gt;I've a log retention set for 1h in kafka.server.properties file as -&amp;nbsp;log.retention.hours=1&lt;/P&gt;&lt;P&gt;But Kafka is having log queues formed and log getting full in 24 hours of time:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;kafka/bin/kafka-run-class.sh kafka.admin.ConsumerGroupCommand --bootstrap-server localhost:9101 --describe --group kafka_consumer --command-config admin.props | awk '/&amp;lt;topic&amp;gt;/{print $5}'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-20333796490&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is it because of the throughput when logs can grow more than for an hour if not consumed?&lt;/P&gt;&lt;P&gt;In another environment where the load is even more, this issue is not observed.&lt;/P&gt;&lt;P&gt;Any suggestion?&lt;/P&gt;</description>
      <pubDate>Mon, 10 Jan 2022 07:47:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kafka-logs-folder-is-growing-for-more-than-100-G-in-24-hours/m-p/333449#M231436</guid>
      <dc:creator>danurag</dc:creator>
      <dc:date>2022-01-10T07:47:35Z</dc:date>
    </item>
    <item>
      <title>Re: Kafka logs folder is growing for more than 100 G in 24 hours.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kafka-logs-folder-is-growing-for-more-than-100-G-in-24-hours/m-p/334003#M231587</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/94886"&gt;@danurag&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;It's recommended to set up retention at the topic level (unless you want all your topics to use 24 hours by default), example:&lt;/P&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;kafka-configs --bootstrap-server &amp;lt;brokerHost:brokerPort&amp;gt; --alter --entity-type topics --entity-name &amp;lt;topicName&amp;gt; --add-config&amp;nbsp;&lt;/SPAN&gt;retention.ms&lt;SPAN&gt;=3600000&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;P class="p1"&gt;The most common configuration for how long Kafka will retain messages is by time.&amp;nbsp;The default is specified in the configuration file using the &lt;SPAN class="s1"&gt;log.retention.hours&amp;nbsp;&lt;/SPAN&gt;parameter, and it is set to 168 hours, or one week. However, there are two other&amp;nbsp;parameters allowed, &lt;SPAN class="s1"&gt;log.retention.minutes &lt;/SPAN&gt;and &lt;SPAN class="s1"&gt;log.retention.ms&lt;/SPAN&gt;. All three of&lt;/P&gt;&lt;P class="p1"&gt;these control the same goal (the amount of time after which messages may be&amp;nbsp;deleted) but the recommended parameter to use is &lt;SPAN class="s1"&gt;log.retention.ms&lt;/SPAN&gt;, as the smaller&amp;nbsp;unit size will take precedence if more than one is specified. This will make sure that the value set for &lt;SPAN class="s1"&gt;log.retention.ms &lt;/SPAN&gt;is always the one used. If more than one is specified, the smaller unit size will take precedence.&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 17 Jan 2022 13:17:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kafka-logs-folder-is-growing-for-more-than-100-G-in-24-hours/m-p/334003#M231587</guid>
      <dc:creator>ManuelCalvo</dc:creator>
      <dc:date>2022-01-17T13:17:07Z</dc:date>
    </item>
  </channel>
</rss>

