Member since
07-01-2025
18
Posts
2
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
431 | 07-01-2025 05:37 PM |
07-23-2025
01:48 PM
I just found this information on my validations in assets. I have made these changes and will report back tomorrow if it helps. The Checkpoint transaction-limit set to 1000000. Cloudera recommends a limit of 4,000,000. The checkpoint period is set to 3600 seconds. Cloudera recommends at least 7200 seconds (2 hours) in production clusters. Please see the following documentation for complete details: https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/data-protection/topics/hdfs-configuration-properties.html.
... View more
07-23-2025
01:05 PM
I am going to create a support ticket for this as well. I was hoping this was going to be an easy one.
... View more
07-22-2025
01:58 PM
Here are the logs from one of the datanode servers. Thank you very much.
... View more
07-21-2025
12:00 PM
The weird thing is, when I restart HDFS, it seems fine for about a day and then I get those alerts again. One thing I just did though, was I did the ssh-copy-id from the secondary name node to all the data nodes. Not sure if that will help or not though.
... View more
07-21-2025
11:29 AM
Yes, they are having no problems communicating with each other. They all have two IPs and all of the internal communication is going over a private 192.168.x.x network. I can ping back and forth with no problem. I also turned the firewall off and that doesn't seem to be an issue either.
... View more
07-17-2025
12:55 PM
Sorry, I meant which logs do you want from both of those servers? Are there specific logs that you want? HDFS, agent, alert publisher, event server, firehose, etc?
... View more
07-16-2025
03:51 PM
In cloudera-scm-server.log, when I do a tail -f, I get a bunch of these logs. Does this mean anything? 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=164 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=166 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=224 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=226 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=225 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=227 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=228 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:22,763 WARN avro-servlet-hb-processor-24:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=288 name=null host=45264634-8596-4805-b797-998d053db296/dmidlkprdls01.svr.luc.edu 2025-07-16 17:48:24,737 INFO scm-web-20423:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:48:25,556 WARN avro-servlet-hb-processor-6:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=133 name=null host=009ec263-928b-4af1-8088-785b315f3e21/dmidlkprdls02.svr.luc.edu 2025-07-16 17:48:34,937 INFO scm-web-20423:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:48:45,064 INFO scm-web-21021:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:48:46,516 WARN avro-servlet-hb-processor-18:com.cloudera.server.cmf.AgentProtocolImpl: (119 skipped) Received Process Heartbeat for unknown (or duplicate) process. Ignoring. This is expected to happen once after old process eviction or process deletion (as happens in restarts). id=134 name=null host=2406c3be-dd14-481f-8a19-462efa8c5f8c/dmidlkprdls03.svr.luc.edu 2025-07-16 17:48:55,306 INFO avro-servlet-hb-processor-10:com.cloudera.server.common.AgentAvroServlet: (35 skipped) AgentAvroServlet: heartbeat processing stats: average=20ms, min=11ms, max=67ms. 2025-07-16 17:48:55,352 INFO scm-web-20423:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:48:57,424 INFO pool-10-thread-1:com.cloudera.server.cmf.components.CmServerStateSynchronizer: (30 skipped) Synced up 2025-07-16 17:49:05,667 INFO scm-web-20422:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:49:14,429 INFO pool-10-thread-1:com.cloudera.server.cmf.components.CmServerStateSynchronizer: (30 skipped) Cleaned up 2025-07-16 17:49:15,826 INFO scm-web-20423:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:49:26,104 INFO scm-web-20423:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:49:36,228 INFO scm-web-20422:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:49:47,376 INFO scm-web-21021:com.cloudera.cmf.cluster.AbstractParallelClusterServiceCommand: Cluster Start command with purpose START found all the services already in started state, no further action to perform on cluster DAMICluster 2025-07-16 17:49:55,368 INFO avro-servlet-hb-processor-4:com.cloudera.server.common.AgentAvroServlet: (35 skipped) AgentAvroServlet: heartbeat processing stats: average=21ms, min=11ms, max=67ms. 2025-07-16 17:49:59,425 INFO pool-10-thread-1:com.cloudera.server.cmf.components.CmServerStateSynchronizer: (30 skipped) Synced up
... View more
07-16-2025
10:19 AM
Hi, thank you very much for your response. What logs would you need? The cloudera-scm-server and/or cloudera-scm-agent?
... View more