distcp - How to determine number of mappers used by distcp job(s) at cluster level?
Sometime we run into network bandwidth issue caused by distcp job(s) running too many mappers or too many distcp jobs.
Our plan is to trigger DataDog alert when the total number of mappers used by distcp jobs (at cluster level) reach at defined number (ex: 100). We are open to explore the "-bandwidth" option.
We have many users who will be submitting a job from diff edge nodes. so, we don't want to use the "ps" command at server level.
Please help us rectify the issue. Thanks in advance.