I am using CDH 6.3.0 HBase. Several regionserver's WALs logs directory(/hbase/wals) keep growing and logs are not moved to /hbase/oldWals for deletion.
When digging the regionserver logs, i found the following logs, it happens when there are heavy write load.
2021-01-27 14:49:16,370 WARN org.apache.hadoop.hbase.regionserver.LogRoller: Failed to schedule flush of region_id_xxxxxxxx, region=null, requester=null
I suspect that we come across HBASE-16721(Concurrency issue in WAL unflushed seqId tracking), however CDH 5.8.x has already fix this issue, so how come we meet this in 6.3.0.
HBase regionserver WAL relevant config:
hbase.regionserver.maxlogs: 64
heap size: 36G
wal blocksize: 256M
so if wal logs not archived for Concurrency issue, how to fix it; or are there any other reasons that could cause the phenomenon
Any help would be appreciated !!
@gsharma @elserj