<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: KUDU master consensus conflicts in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/KUDU-master-consensus-conflicts/m-p/397550#M249895</link>
    <description>&lt;P&gt;Hi Team as per the issue Master A is the fault one because once its in line there is no election happening please follow the below step to sync it you need to cleansafe the data from this master and replicate it back from leader master&lt;/P&gt;&lt;P class="p1"&gt;A)stop the problematic Master&lt;/P&gt;&lt;P class="p1"&gt;B)sudo -u kudu kudu local_replica delete --fs_wal_dir=/wal/kudu/wal --fs_data_dirs=/wal/kudu/data 00000000000000000000000000000000 -clean_unsafe&lt;/P&gt;&lt;P class="p1"&gt;C)sudo -u kudu kudu local_replica copy_from_remote --fs_wal_dir=/wal/kudu/wal --fs_data_dirs=/wal/kudu/data 00000000000000000000000000000000 &amp;lt;active_leader_fqdn&amp;gt;:7051&lt;/P&gt;&lt;P class="p1"&gt;D)stop remaining two masters&lt;/P&gt;&lt;P class="p1"&gt;E)then start all the three masters.&lt;/P&gt;</description>
    <pubDate>Fri, 15 Nov 2024 10:07:07 GMT</pubDate>
    <dc:creator>Asfahan</dc:creator>
    <dc:date>2024-11-15T10:07:07Z</dc:date>
    <item>
      <title>KUDU master consensus conflicts</title>
      <link>https://community.cloudera.com/t5/Support-Questions/KUDU-master-consensus-conflicts/m-p/357448#M237598</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;In a CDP 7.1.6 + Cloudera Manager 7.3.1 cluster.&lt;/P&gt;&lt;P&gt;3Masters +3Workers.&lt;/P&gt;&lt;P&gt;I'm getting the error all the time:&amp;nbsp;Corruption: master consensus error: there are master consensus conflicts&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is the cluster ksck:&lt;/P&gt;&lt;P&gt;Master Summary&lt;BR /&gt;UUID | Address | Status&lt;BR /&gt;----------------------------------+--------------------------+---------&lt;BR /&gt;5620e4a103894151b7bdee5e436f37d8 | master-2.local | HEALTHY&lt;BR /&gt;9cea3b56cc9b4be4846a02c0d89be753 | master-1.local | HEALTHY&lt;BR /&gt;a98a1f26d0254293b6e17e9daf8f6ef8 | master-3.local | HEALTHY&lt;BR /&gt;All reported replicas are:&lt;BR /&gt;A = 9cea3b56cc9b4be4846a02c0d89be753&lt;BR /&gt;B = 5620e4a103894151b7bdee5e436f37d8&lt;BR /&gt;C = a98a1f26d0254293b6e17e9daf8f6ef8&lt;BR /&gt;The consensus matrix is:&lt;BR /&gt;Config source | Replicas | Current term | Config index | Committed?&lt;BR /&gt;---------------+--------------+--------------+--------------+------------&lt;BR /&gt;A | A B C | 10120 | -1 | Yes&lt;BR /&gt;B | A B* C | 10120 | -1 | Yes&lt;BR /&gt;C | A B* C | 10120 | -1 | Yes&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It seems the A node is not voting, this is the log output:&lt;BR /&gt;W1111 11:12:00.526211 18688 leader_election.cc:334] T 00000000000000000000000000000000 P 9cea3b56cc9b4be4846a02c0d89be753 [CANDIDATE]: Term 10122 pre-election: RPC error from VoteRequest() call to peer 5620e4a103894151b7bdee5e436f37d8 (master-2:7051): Network error: Client connection negotiation failed: client connection to 10.157.136.55:7051: connect: Connection refused (error 111)&lt;BR /&gt;W1111 11:12:22.683107 18688 leader_election.cc:334] T 00000000000000000000000000000000 P 9cea3b56cc9b4be4846a02c0d89be753 [CANDIDATE]: Term 10122 pre-election: RPC error from VoteRequest() call to peer 5620e4a103894151b7bdee5e436f37d8 (master-2:7051): Timed out: RequestConsensusVote RPC to 10.157.136.55:7051 timed out after 7.916s (SENT)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;there is conectivity:&lt;/P&gt;&lt;P&gt;# nc -z -v 10.157.136.55 7051&lt;BR /&gt;Ncat: Version 7.50 ( &lt;A href="https://nmap.org/ncat" target="_blank" rel="noopener"&gt;https://nmap.org/ncat&lt;/A&gt; )&lt;BR /&gt;Ncat: Connected to 10.157.136.55:7051.&lt;BR /&gt;Ncat: 0 bytes sent, 0 bytes received in 0.01 seconds.&lt;/P&gt;&lt;P&gt;and the masters have been restarted several times, and the whole cluster...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any idea to fix this? Thanks!&lt;/P&gt;</description>
      <pubDate>Fri, 11 Nov 2022 11:54:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/KUDU-master-consensus-conflicts/m-p/357448#M237598</guid>
      <dc:creator>Juanes</dc:creator>
      <dc:date>2022-11-11T11:54:33Z</dc:date>
    </item>
    <item>
      <title>Re: KUDU master consensus conflicts</title>
      <link>https://community.cloudera.com/t5/Support-Questions/KUDU-master-consensus-conflicts/m-p/397550#M249895</link>
      <description>&lt;P&gt;Hi Team as per the issue Master A is the fault one because once its in line there is no election happening please follow the below step to sync it you need to cleansafe the data from this master and replicate it back from leader master&lt;/P&gt;&lt;P class="p1"&gt;A)stop the problematic Master&lt;/P&gt;&lt;P class="p1"&gt;B)sudo -u kudu kudu local_replica delete --fs_wal_dir=/wal/kudu/wal --fs_data_dirs=/wal/kudu/data 00000000000000000000000000000000 -clean_unsafe&lt;/P&gt;&lt;P class="p1"&gt;C)sudo -u kudu kudu local_replica copy_from_remote --fs_wal_dir=/wal/kudu/wal --fs_data_dirs=/wal/kudu/data 00000000000000000000000000000000 &amp;lt;active_leader_fqdn&amp;gt;:7051&lt;/P&gt;&lt;P class="p1"&gt;D)stop remaining two masters&lt;/P&gt;&lt;P class="p1"&gt;E)then start all the three masters.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 10:07:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/KUDU-master-consensus-conflicts/m-p/397550#M249895</guid>
      <dc:creator>Asfahan</dc:creator>
      <dc:date>2024-11-15T10:07:07Z</dc:date>
    </item>
  </channel>
</rss>

