<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: When the number of indexers of lily hbase indexer service is created to 200+, a large number of TIME_WAIT ports appear in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-the-number-of-indexers-of-lily-hbase-indexer-service-is/m-p/203582#M83914</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I've been experiencing similar problem with large number of TIME_WAIT sockets. I knew it is related to replication, so I started researching replication options and found following:&lt;/P&gt;&lt;P&gt;I've set &lt;STRONG&gt;replication.source.sleepforretries&lt;/STRONG&gt; to 1 according to these instructions:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP3/HDP-3.0.1/fault-tolerance/content/spreading_queue_failover_load.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/HDP3/HDP-3.0.1/fault-tolerance/content/spreading_queue_failover_load.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;They say it is 1 for 1 second, but if you look at the &lt;A href="https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSource.java#L162" target="_blank"&gt;HBase source code&lt;/A&gt;, you'll see that it is milliseconds and should be 1000 for 1 second. After changing &lt;STRONG&gt;replication.source.sleepforretries &lt;/STRONG&gt;from 1 to 1000 with replication enabled, the number of TIME_WAIT (TIME-WAIT) sockets dropped to normal value.&lt;/P&gt;&lt;P&gt;So check it, maybe you have set it at 1. And to Hortonworks, please fix the docs.&lt;/P&gt;</description>
    <pubDate>Tue, 27 Nov 2018 21:57:50 GMT</pubDate>
    <dc:creator>gpchelkin</dc:creator>
    <dc:date>2018-11-27T21:57:50Z</dc:date>
    <item>
      <title>When the number of indexers of lily hbase indexer service is created to 200+, a large number of TIME_WAIT ports appear</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-the-number-of-indexers-of-lily-hbase-indexer-service-is/m-p/203581#M83913</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="91567-time-wait-副本.jpg" style="width: 2000px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/17267iCB7DD00655B841F6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="91567-time-wait-副本.jpg" alt="91567-time-wait-副本.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Version:
CDH5.13.3, corresponding to hbase1.2.0 &lt;/P&gt;&lt;P&gt;Nodes:
Nine work nodes, three management nodes &lt;/P&gt;&lt;P&gt;Main role assignment:
Nine regionserver, datanode, and solr server, three of which were lily hbase indexer &lt;/P&gt;&lt;P&gt;Background:
Hbase for solr, solr stores the secondary index of hbase, and the index is automatically synchronized through lily hbase indexer &lt;/P&gt;&lt;P&gt;Question:&lt;/P&gt;&lt;P&gt;When creating the indexer number to 200+, we found a lot of TIME_WAIT port (almost 30000), in the regionserver log found this :"Retrying the connect to server: xx.xx.com/ipAddress:50020. Already tried 1 time (s); Retry the policy is RetryUpToMaximumCountWithFixedSleep (maxRetries = 10, sleepTime = 1000 MILLISECONDS)". when delete all the indexers , port back to normal, hbase back to normal
Initial suspicion is that this version of hbase's multi-wal is in conflict with the replication functionality. Originally configured with 3 WAL, then we changed to single and re-created with 200+ indexers,although TIME_WAIT port down to about 10,000,but is still not solved,Can you give me some advice??&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 05:10:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-the-number-of-indexers-of-lily-hbase-indexer-service-is/m-p/203581#M83913</guid>
      <dc:creator>13333855372</dc:creator>
      <dc:date>2019-08-18T05:10:37Z</dc:date>
    </item>
    <item>
      <title>Re: When the number of indexers of lily hbase indexer service is created to 200+, a large number of TIME_WAIT ports appear</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-the-number-of-indexers-of-lily-hbase-indexer-service-is/m-p/203582#M83914</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I've been experiencing similar problem with large number of TIME_WAIT sockets. I knew it is related to replication, so I started researching replication options and found following:&lt;/P&gt;&lt;P&gt;I've set &lt;STRONG&gt;replication.source.sleepforretries&lt;/STRONG&gt; to 1 according to these instructions:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP3/HDP-3.0.1/fault-tolerance/content/spreading_queue_failover_load.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/HDP3/HDP-3.0.1/fault-tolerance/content/spreading_queue_failover_load.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;They say it is 1 for 1 second, but if you look at the &lt;A href="https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSource.java#L162" target="_blank"&gt;HBase source code&lt;/A&gt;, you'll see that it is milliseconds and should be 1000 for 1 second. After changing &lt;STRONG&gt;replication.source.sleepforretries &lt;/STRONG&gt;from 1 to 1000 with replication enabled, the number of TIME_WAIT (TIME-WAIT) sockets dropped to normal value.&lt;/P&gt;&lt;P&gt;So check it, maybe you have set it at 1. And to Hortonworks, please fix the docs.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Nov 2018 21:57:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-the-number-of-indexers-of-lily-hbase-indexer-service-is/m-p/203582#M83914</guid>
      <dc:creator>gpchelkin</dc:creator>
      <dc:date>2018-11-27T21:57:50Z</dc:date>
    </item>
  </channel>
</rss>

