<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Kudo table not upserting data in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403673#M252157</link>
    <description>&lt;P&gt;Hi-&lt;/P&gt;&lt;DIV class="p-rich_text_section"&gt;There are number of things to note here:&lt;/DIV&gt;&lt;OL class="p-rich_text_list p-rich_text_list__ordered p-rich_text_list--nested"&gt;&lt;LI&gt;Replication factor seems to be 1 for above listed tables which is not a good setting given customer is facing failures and running into unavailability situation. This needs to be at least 3 in order to deal with transient errors or even a permanent data loss on a replica.&lt;/LI&gt;&lt;LI&gt;If the ksck report is showing every TS up and running, where is the problem now? Is there any tablet which is not accessible (for read/write) to the user?&lt;/LI&gt;&lt;LI&gt;You could list all the tablets using&lt;SPAN&gt;&amp;nbsp;"&lt;/SPAN&gt;kudu table list &amp;lt;master_addresses&amp;gt; --list_tablets" and search for the tablet server hosting the replica in question from the output;&amp;nbsp;&lt;SPAN&gt;you can search for tablet uuid in old logs from the time when issue was seen, from there corresponding tablet server entry should have logged at some point.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;But - generally speaking - if your KSCK is fixed and clean - let's say, it finished bootstrapping and/or copying after some time - and you still see INSERT problems from Spark - you need to open a case with Cloudera since the issue may require certain configs' evaluation and tuning.&lt;/LI&gt;&lt;/OL&gt;</description>
    <pubDate>Thu, 06 Mar 2025 19:31:25 GMT</pubDate>
    <dc:creator>Boris G</dc:creator>
    <dc:date>2025-03-06T19:31:25Z</dc:date>
    <item>
      <title>Kudo table not upserting data</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403550#M252098</link>
      <description>&lt;P&gt;Hi, I’m currently experiencing an issue with inserting data into a Kudu table using Spark. Although I can successfully read the data via Hue, the insert operation through Spark is failing. Upon reviewing the tablet details for the table, I noticed an issue highlighted with the tablets, but I'm uncertain about the next steps to resolve it.&lt;/P&gt;&lt;P&gt;sudo -u kudu kudu cluster ksck tstewputil1&lt;BR /&gt;&lt;BR /&gt;Connected to the Master&lt;BR /&gt;Fetched info from all 3 Tablet Servers&lt;BR /&gt;Tablet 0d811e918fd94b31aed4bfe337982ea8 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Tablet 1a27865b4f0c4dedaf43f490272c4f24 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Tablet 260af5f38836442f873d862d04de7615 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Tablet f738d1e1455044899230ead58c0a23e7 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Tablet 3f6347556e0649598f08edf4c5b9eaa1 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Tablet 255ed126c2fb46fcae4acc3eb3481a69 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Tablet b082a852dd40413f87cf3db126eb0f83 of table 'impala::mrs.NumberofSubs' is unavailable: 1 replica(s) not RUNNING&lt;BR /&gt;24483fcd36ce45d78d80beb04b3b0cf4: TS unavailable [LEADER]&lt;/P&gt;&lt;P&gt;Table impala::mrs.NumberofSubs has 7 unavailable tablet(s)&lt;/P&gt;&lt;P&gt;Could you please help me out from this insertion problem?&lt;/P&gt;&lt;P&gt;Thank you for your support&lt;/P&gt;</description>
      <pubDate>Wed, 05 Mar 2025 06:44:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403550#M252098</guid>
      <dc:creator>zeeshanmcs</dc:creator>
      <dc:date>2025-03-05T06:44:14Z</dc:date>
    </item>
    <item>
      <title>Re: Kudo table not upserting data</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403673#M252157</link>
      <description>&lt;P&gt;Hi-&lt;/P&gt;&lt;DIV class="p-rich_text_section"&gt;There are number of things to note here:&lt;/DIV&gt;&lt;OL class="p-rich_text_list p-rich_text_list__ordered p-rich_text_list--nested"&gt;&lt;LI&gt;Replication factor seems to be 1 for above listed tables which is not a good setting given customer is facing failures and running into unavailability situation. This needs to be at least 3 in order to deal with transient errors or even a permanent data loss on a replica.&lt;/LI&gt;&lt;LI&gt;If the ksck report is showing every TS up and running, where is the problem now? Is there any tablet which is not accessible (for read/write) to the user?&lt;/LI&gt;&lt;LI&gt;You could list all the tablets using&lt;SPAN&gt;&amp;nbsp;"&lt;/SPAN&gt;kudu table list &amp;lt;master_addresses&amp;gt; --list_tablets" and search for the tablet server hosting the replica in question from the output;&amp;nbsp;&lt;SPAN&gt;you can search for tablet uuid in old logs from the time when issue was seen, from there corresponding tablet server entry should have logged at some point.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;But - generally speaking - if your KSCK is fixed and clean - let's say, it finished bootstrapping and/or copying after some time - and you still see INSERT problems from Spark - you need to open a case with Cloudera since the issue may require certain configs' evaluation and tuning.&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Thu, 06 Mar 2025 19:31:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403673#M252157</guid>
      <dc:creator>Boris G</dc:creator>
      <dc:date>2025-03-06T19:31:25Z</dc:date>
    </item>
    <item>
      <title>Re: Kudo table not upserting data</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403801#M252196</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/124524"&gt;@zeeshanmcs&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;It seem you're having an issue with unavailable tablets in your Kudu table, which is preventing Spark from inserting data. The output from &lt;STRONG&gt;kudu cluster ksck &lt;/STRONG&gt;clearly shows the problem: the leader replicas for all tablets in the &lt;STRONG&gt;impala::mrs.NumberofSubs &lt;/STRONG&gt;table are on a tablet server that's unavailable.&lt;/P&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;The key issue is that the tablet server with ID &lt;STRONG&gt;24483fcd36ce45d78d80beb04b3b0cf4&lt;/STRONG&gt; is not running, and it happens to be the leader for all 7 tablets in your table.&lt;/P&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;Here's a solution to resolve this issue:&lt;/P&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;1. First, check the status of all Kudu tablet servers&lt;/P&gt;&lt;LI-SPOILER&gt;sudo systemctl status kudu-tserver&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;&lt;SPAN&gt;2.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Look specifically for the tablet server with ID&lt;/SPAN&gt;&lt;STRONG&gt; 24483fcd36ce45d78d80beb04b3b0cf4&lt;/STRONG&gt;&lt;/P&gt;&lt;LI-SPOILER&gt;sudo -u kudu kudu tserver list tstewputil1&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;3.&amp;nbsp;If the tablet server is down, start it.&lt;/P&gt;&lt;LI-SPOILER&gt;sudo systemctl start kudu-tserver&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;4.&amp;nbsp;If the tablet server is running but not responding, restart it&lt;/P&gt;&lt;LI-SPOILER&gt;sudo systemctl restart kudu-tserver&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;5.&amp;nbsp;After restarting the tablet server, wait a few minutes for it to rejoin the cluster and for leadership transitions to occur, then check the status again&lt;/P&gt;&lt;LI-SPOILER&gt;sudo -u kudu kudu cluster ksck tstewputil1&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;If the tablet server is permanently lost or damaged, you'll need to recover the tablets&lt;BR /&gt;&lt;SPAN&gt;a.&amp;nbsp;Check if you have enough replicas (you should have at least 3 for production)&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-SPOILER&gt;sudo -u kudu kudu table describe impala::mrs.NumberofSubs tstewputil1&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;&lt;SPAN&gt;b. If you have other healthy replicas, you can delete the failed server from the cluster and Kudu will automatically recover&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-SPOILER&gt;sudo -u kudu kudu tserver delete tstewputil1 &amp;lt;tablet_server_uuid&amp;gt;&lt;/LI-SPOILER&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;c.&amp;nbsp;If this is the only replica and you don't have backups, you may need to:&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Create a new table with the same schema&lt;/LI&gt;&lt;LI&gt;Load data from your source systems&lt;/LI&gt;&lt;LI&gt;Or restore from a backup if available&lt;/LI&gt;&lt;/UL&gt;&lt;P class="whitespace-pre-wrap break-words"&gt;If, after restarting, you still have issues, the problem might be:&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;Disk space issues on the tablet server&lt;/LI&gt;&lt;LI&gt;Configuration problems&lt;/LI&gt;&lt;LI&gt;Network connectivity problems between servers&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Check the Kudu tablet server logs for more details&lt;/P&gt;&lt;LI-SPOILER&gt;less /var/log/kudu/kudu-tserver.log&lt;/LI-SPOILER&gt;&lt;P&gt;Once the tablet server is back online and healthy, your Spark job should be able to insert data into the table successfully&lt;BR /&gt;&lt;BR /&gt;Happy hadooping&lt;/P&gt;</description>
      <pubDate>Sun, 09 Mar 2025 12:25:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Kudo-table-not-upserting-data/m-p/403801#M252196</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2025-03-09T12:25:05Z</dc:date>
    </item>
  </channel>
</rss>

