We have been getting regular warning messages of Swap Memory being utilized more than the threshold limit.
Currently, we have set swap memory threshold as below:
HDFS = 100 MB
Impala = 30 MB
Yarn = 500 MB (Could be +-20%, I dont remember the exact number).
Swap Memory usage for each of the component is crossing the threshold and reaches upto 720 MB (in case of Yarn). Because of this, we usually see warnings on our CM dashboard.
I know increasing the Swap Memory Threshold could remove these warnings, however, we would prefer rather reducing the usage.
Would be great if anyone could suggestion any Memory Tuning options which would reduce the usage of Swap Memory. Also, if there is a best recommendations of setting memory usage threshold, kindly share that as well.
Disk IO intensive workloads can cause the kernel to swap even if swappiness is set to 1.
Have you tried increasing the vm.vfs_cache_pressure kernel parameter on all worker nodes? Here is a post on how to fine tune the parameter: http://datavelo.com/en/2018/04/10/kernel-swapping-vm-swappiness-1
Any working solution for the swap problem?
We have same problem on CentOS 7 with CDH Enterprise 5.14.0
DataNode, NodeManager, YarnChild processes are swapping
As a temporary fix you can use linux command swapoff -a && swapon -a to move swap. Just make sure you have enough free memory to move swap to (top command)