Created 06-22-2025 08:31 PM
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 process INFO [4545-collect-host-statistics] Instantiating process
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 process INFO [4545-collect-host-statistics] Updating process: True {}
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 process INFO First time to activate the process [4545-collect-host-statistics].
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 agent INFO Created /var/run/cloudera-scm-agent/process/4545-collect-host-statistics
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 agent INFO Chowning /var/run/cloudera-scm-agent/process/4545-collect-host-statistics to root (0) root (0)
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 agent INFO Chmod'ing /var/run/cloudera-scm-agent/process/4545-collect-host-statistics to 0751
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 agent INFO Created /var/run/cloudera-scm-agent/process/4545-collect-host-statistics/logs
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 agent INFO Chowning /var/run/cloudera-scm-agent/process/4545-collect-host-statistics/logs to root (0) root (0)
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 agent INFO Chmod'ing /var/run/cloudera-scm-agent/process/4545-collect-host-statistics/logs to 0751
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 process INFO [4545-collect-host-statistics] Refreshing process files: None
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 parcel INFO prepare_environment begin: {}, [], []
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 parcel INFO No parcels activated for use
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 __init__ INFO Extracted 1 files and 0 dirs to /var/run/cloudera-scm-agent/process/4545-collect-host-statistics.
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 process INFO [4545-collect-host-statistics] Launching process. one-off True, command support/collect_host_stats.sh, args []
[22/Jun/2025 12:40:02 +0000] 16316 Thread-15 supervisor WARNING Failed while getting process info. Retrying. (<Fault 10: 'BAD_NAME: 4545-collect-host-statistics'>)
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 supervisor INFO Triggering supervisord update.
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 util INFO Using generic audit plugin for process collect-host-statistics
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 util INFO Creating metadata plugin for process collect-host-statistics
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 util INFO Using specific metadata plugin for process collect-host-statistics
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 util INFO Using generic metadata plugin for process collect-host-statistics
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 process INFO Begin audit plugin refresh
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 throttling_logger INFO (3 skipped) Scheduling a refresh for Audit Plugin for collect-host-statistics with count 1 pipelines names [''].
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 process INFO Begin metadata plugin refresh
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 process INFO Not creating a monitor for 4545-collect-host-statistics: should_monitor returns false
[22/Jun/2025 12:40:04 +0000] 16316 Thread-15 process INFO Daemon refresh complete for process 4545-collect-host-statistics.
[22/Jun/2025 12:40:05 +0000] 16316 Audit-Plugin navigator_plugin INFO Pipelines updated for Audit Plugin: []
[22/Jun/2025 12:40:05 +0000] 16316 Audit-Plugin throttling_logger INFO (3 skipped) Refreshing Audit Plugin for collect-host-statistics with count 0 pipelines names [].
[22/Jun/2025 12:40:05 +0000] 16316 Metadata-Plugin navigator_plugin INFO Pipelines updated for Metadata Plugin: []
Created 06-22-2025 10:34 PM
Hello @Jecky
Thank you for reaching out to cloudera community
Could you please confirm the CM and CDP versions here?
Also, could you please elaborate on the actual problem? Are you facing issues while collecting the diagnostic bundle?
Thank you
Kshitij Upadhyay
Created 06-22-2025 10:57 PM
Hello @upadhyayk04
cm version is 6.3.1
In actual use, there is a server that often does not release memory after executing the lsof - n - P command to be collected, until the server crashes. The problem has not been found on other servers, and there are no abnormal tasks on the abnormal server. The logs show that only normal collection is being done, and there are no other obvious errors
thank you
Created 06-22-2025 11:00 PM
Hello @Jecky
On the abnormal server, do you have anything suspicious in /var/log/messages or /var/log/cloudera-scm-agent/cloudera-scm-agent.log
Created on 06-22-2025 11:13 PM - edited 06-22-2025 11:15 PM
Created 06-22-2025 11:23 PM
Hello @upadhyayk04
The suspicious content is as follows
[22/Jun/2025 13:40:38 +0000] 16316 Thread-14 process INFO [4545-collect-host-statistics] Updating process: False {u'running': (True, False), u'run_generation': (1, 2)}
[22/Jun/2025 13:40:38 +0000] 16316 Thread-14 process INFO [4545-collect-host-statistics] Deactivating process
[22/Jun/2025 13:40:39 +0000] 16316 Thread-14 process INFO [4545-collect-host-statistics] Unregistered supervisor process STOPPED
[22/Jun/2025 13:40:41 +0000] 16316 Thread-14 supervisor INFO Triggering supervisord update.
[22/Jun/2025 13:40:41 +0000] 16316 Thread-14 process INFO [4545-collect-host-statistics] stopping monitors
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 process INFO [4550-host-inspector] Instantiating process
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 process INFO [4550-host-inspector] Updating process: True {}
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 process INFO First time to activate the process [4550-host-inspector].
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 agent INFO Created /var/run/cloudera-scm-agent/process/4550-host-inspector
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 agent INFO Chowning /var/run/cloudera-scm-agent/process/4550-host-inspector to root (0) root (0)
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 agent INFO Chmod'ing /var/run/cloudera-scm-agent/process/4550-host-inspector to 0751
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 agent INFO Created /var/run/cloudera-scm-agent/process/4550-host-inspector/logs
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 agent INFO Chowning /var/run/cloudera-scm-agent/process/4550-host-inspector/logs to root (0) root (0)
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 agent INFO Chmod'ing /var/run/cloudera-scm-agent/process/4550-host-inspector/logs to 0751
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 process INFO [4550-host-inspector] Refreshing process files: None
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 parcel INFO prepare_environment begin: {}, [], []
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 parcel INFO No parcels activated for use
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 __init__ INFO Extracted 1 files and 0 dirs to /var/run/cloudera-scm-agent/process/4550-host-inspector.
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 process INFO [4550-host-inspector] Launching process. one-off True, command mgmt/mgmt.sh, args [u'inspector', u'input.json', u'output.json', u'ALL']
[22/Jun/2025 13:40:41 +0000] 16316 Thread-17 supervisor WARNING Failed while getting process info. Retrying. (<Fault 10: 'BAD_NAME: 4550-host-inspector'>)
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 supervisor INFO Triggering supervisord update.
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 util INFO Using generic audit plugin for process host-inspector
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 util INFO Creating metadata plugin for process host-inspector
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 util INFO Using specific metadata plugin for process host-inspector
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 util INFO Using generic metadata plugin for process host-inspector
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 process INFO Begin audit plugin refresh
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 throttling_logger INFO (1 skipped) Scheduling a refresh for Audit Plugin for host-inspector with count 1 pipelines names [''].
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 process INFO Begin metadata plugin refresh
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 process INFO Not creating a monitor for 4550-host-inspector: should_monitor returns false
[22/Jun/2025 13:40:43 +0000] 16316 Thread-17 process INFO Daemon refresh complete for process 4550-host-inspector.
[22/Jun/2025 13:40:43 +0000] 16316 MainThread agent WARNING Long HB processing time: 5.26821017265
[22/Jun/2025 13:40:43 +0000] 16316 Audit-Plugin navigator_plugin INFO stopping Audit Plugin for collect-host-statistics with count 0 pipelines names [].
[22/Jun/2025 13:40:43 +0000] 16316 Audit-Plugin navigator_plugin INFO Pipelines updated for Audit Plugin: []
[22/Jun/2025 13:40:43 +0000] 16316 Audit-Plugin throttling_logger INFO (1 skipped) Refreshing Audit Plugin for host-inspector with count 0 pipelines names [].
[22/Jun/2025 13:40:44 +0000] 16316 Metadata-Plugin navigator_plugin INFO stopping Metadata Plugin for collect-host-statistics with count 0 pipelines names [].
[22/Jun/2025 13:40:44 +0000] 16316 Metadata-Plugin navigator_plugin INFO Pipelines updated for Metadata Plugin: []
[22/Jun/2025 13:40:47 +0000] 16316 MainThread process INFO [4550-host-inspector] Unregistered supervisor process EXITED
[22/Jun/2025 13:40:49 +0000] 16316 MainThread supervisor INFO Triggering supervisord update.
[22/Jun/2025 13:41:02 +0000] 16316 Thread-16 process INFO [4550-host-inspector] Updating process: False {u'running': (True, False), u'run_generation': (1, 2)}
[22/Jun/2025 13:41:02 +0000] 16316 Thread-16 process INFO [4550-host-inspector] Deactivating process (skipped)
[22/Jun/2025 13:41:02 +0000] 16316 Thread-16 process INFO [4550-host-inspector] stopping monitors
[22/Jun/2025 13:41:03 +0000] 16316 Audit-Plugin navigator_plugin INFO stopping Audit Plugin for host-inspector with count 0 pipelines names [].
[22/Jun/2025 13:41:04 +0000] 16316 Metadata-Plugin navigator_plugin INFO stopping Metadata Plugin for host-inspector with count 0 pipelines names [].
Created 06-23-2025 03:00 AM
Hello @upadhyayk04
The following information was found in the /var/log/cloudera-scm-agent/status-stdout.log log file.
Created 06-26-2025 10:29 PM
Hello @Jecky
Was a reboot performed on the node? Can it be tried once?
Also this might need further deep level analysis regarding the threads so if you have a Cloudera license would it be possible to raise a support case for this
Thank you
Kshitij Upahdyay