Hello, I've been experiencing similar problem with large number of TIME_WAIT sockets. I knew it is related to replication, so I started researching replication options and found following: I've set replication.source.sleepforretries to 1 according to these instructions: https://docs.hortonworks.com/HDPDocuments/HDP3/HDP-3.0.1/fault-tolerance/content/spreading_queue_failover_load.html They say it is 1 for 1 second, but if you look at the HBase source code, you'll see that it is milliseconds and should be 1000 for 1 second. After changing replication.source.sleepforretries from 1 to 1000 with replication enabled, the number of TIME_WAIT (TIME-WAIT) sockets dropped to normal value. So check it, maybe you have set it at 1. And to Hortonworks, please fix the docs.
... View more