<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: __consumer_offsets  partition size is huge in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343844#M234021</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/83706"&gt;@shrikantbm&lt;/a&gt;&amp;nbsp;Since we will need more details to resolve this, w&lt;SPAN&gt;e recommend&amp;nbsp;to raise a support case and get assistance.&amp;nbsp;Thanks.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 16 May 2022 18:47:18 GMT</pubDate>
    <dc:creator>DianaTorres</dc:creator>
    <dc:date>2022-05-16T18:47:18Z</dc:date>
    <item>
      <title>__consumer_offsets  partition size is huge</title>
      <link>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343557#M233963</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have come across a issue where my disk is filling up because of topic&amp;nbsp;__consumer_offsets as i see the size as below.&lt;BR /&gt;24G __consumer_offsets-44&lt;BR /&gt;29G __consumer_offsets-33&lt;BR /&gt;42G __consumer_offsets-13&lt;BR /&gt;&lt;BR /&gt;How do i reduce the size of these partitions, from the logs below is the exception what i am seeing.&lt;BR /&gt;2022-04-05 03:03:30,041 ERROR kafka.log.LogCleaner: [kafka-log-cleaner-thread-0]: Error due to&lt;BR /&gt;java.lang.IllegalArgumentException: inconsistent range&lt;/P&gt;&lt;P&gt;CDH version is 5.16 and kafka is&amp;nbsp;&lt;SPAN&gt;KAFKA-3.1.1-1.3.1.1.p0.2&lt;BR /&gt;&lt;BR /&gt;Assuming that log cleaner can be crashed i tried referring below link but that did not help.&lt;BR /&gt;&lt;A href="https://my.cloudera.com/knowledge/Disk-utilisation-growing-because-Kafka-LogCleaner-stopped?id=86234" target="_blank" rel="noopener"&gt;https://my.cloudera.com/knowledge/Disk-utilisation-growing-because-Kafka-LogCleaner-stopped?id=86234&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Need your help and advise to fix the issue.&lt;BR /&gt;&lt;BR /&gt;Thanking in advance&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 10 May 2022 22:19:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343557#M233963</guid>
      <dc:creator>shrikantbm</dc:creator>
      <dc:date>2022-05-10T22:19:13Z</dc:date>
    </item>
    <item>
      <title>Re: __consumer_offsets  partition size is huge</title>
      <link>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343844#M234021</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/83706"&gt;@shrikantbm&lt;/a&gt;&amp;nbsp;Since we will need more details to resolve this, w&lt;SPAN&gt;e recommend&amp;nbsp;to raise a support case and get assistance.&amp;nbsp;Thanks.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 May 2022 18:47:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343844#M234021</guid>
      <dc:creator>DianaTorres</dc:creator>
      <dc:date>2022-05-16T18:47:18Z</dc:date>
    </item>
    <item>
      <title>Re: __consumer_offsets  partition size is huge</title>
      <link>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343947#M234050</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/83706"&gt;@shrikantbm&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How old is the oldest log segment on those partitions?&lt;/P&gt;&lt;P&gt;What's the default log retention set for your cluster? Have you set any retention for the __consumer_offsets topic specifically?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;André&lt;/P&gt;</description>
      <pubDate>Wed, 18 May 2022 00:24:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/343947#M234050</guid>
      <dc:creator>araujo</dc:creator>
      <dc:date>2022-05-18T00:24:34Z</dc:date>
    </item>
    <item>
      <title>Re: __consumer_offsets  partition size is huge</title>
      <link>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/349662#M235717</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/83706"&gt;@shrikantbm&lt;/a&gt;&amp;amp; team,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Yes, in this case, we need to check cleanup.policy of the topic&amp;nbsp;&lt;SPAN&gt;__consumer_offsets. If the existing cleanup.policy=compact then the log segment of this topic will not be deleted.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;You should follow the below steps to conclude and resolve this issue initially.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;1) Check what is current&amp;nbsp;cleanup.policy of the topic&amp;nbsp;__consumer_offsets. You can check it using the command:&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;kafka-topics.sh --bootstrap-server &amp;lt;broker-hostname:9092&amp;gt; --describe&lt;/P&gt;&lt;P class="p1"&gt;OR&lt;/P&gt;&lt;P class="p1"&gt;kafka-topics.sh --zookeeper &amp;lt;zookeeper-hostname:2181&amp;gt; --describe --topics-with-overrides&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Note: topic_name is the name for which you are facing an issue&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;2)&amp;nbsp; If you want to clear the old log segment of this topic, then you should set cleanup.policy like cleanup.policy=compact,delete,retention.ms=&amp;lt;30days&amp;gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;compact = when the kafka-log is rolled over, it will be compacted&lt;/P&gt;&lt;P class="p1"&gt;delete - once the offset.retention.ms is reached, the older logs will be removed&lt;/P&gt;&lt;P class="p1"&gt;retention.ms=&amp;lt;30days&amp;gt;&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;&amp;gt; the old log segment will be deleted after 30 days.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Note: 30 days are just an example here and this setting will be in&amp;nbsp;ms. You should set it as per your requirement after checking it with the application team and their need.&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;For "delete", the property "log.cleaner.enable" must be set to "true"&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;After configuring this cleanup policy data will be deleted as per retention.ms as suggested above. If you will not set retention.ms then old log segment will be deleted as per retention period set in the CM / Ambari &amp;gt;&amp;gt; kafka &amp;gt;&amp;gt; Conf. The setting is log.retention.hours = &amp;lt;7 Days default&amp;gt; in CM &amp;gt;&amp;gt; Kafka, check what it is in your case so that log segment older than 7 days will be deleted. Kafka will keep checking the old log segment with the help of the property log.retention.check.interval.ms .&lt;/P&gt;&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Important note: The "delete" on consumer offsets is that you may lose offsets which can lead to duplication/data loss. So check it with your application team before setting a deletion policy.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;3) If you still face the same issue, then broker logs need to be reviewed for the root cause of the issue and make the changes accordingly.&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;If you found this information helped with your query, please take a moment to log in and click on&lt;/P&gt;&lt;P class="p1"&gt;KUDOS&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; and "Accept as Solution" below this post.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Sat, 06 Aug 2022 16:17:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/consumer-offsets-partition-size-is-huge/m-p/349662#M235717</guid>
      <dc:creator>Babasaheb</dc:creator>
      <dc:date>2022-08-06T16:17:38Z</dc:date>
    </item>
  </channel>
</rss>

