Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

CM host/service monitor TS question

avatar
Contributor

Hi,

 

I have a 6 node cluster running in HA and noticed the growth rate of the TS data. I understand that the data storage for CM host/service monitor has a limit set which can be adjusted but is this normal for a HA cluster as for the rate of growth, considering that there are no jobs running in the cluster? Granted it will cap out at 10gb for each directory but is there a way to slow down the rate of consumption?

 

I saw an option in the CM manager "event publication log quiet time period" would this change the rate at which each "monitor storage directory"  collects data?

 

Again, if this is normal and expected then I just need to ensure that there is enough overhead for the TS data to begin rolling.

 

Capture.PNG

Capture.PNG

Thanks

 

CM 5.2

1 ACCEPTED SOLUTION

avatar
Super Collaborator
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login
8 REPLIES 8

avatar
Super Collaborator
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login

avatar
Explorer

hi,

 

Can i delete old data sitting in ts in cloudera-host/service monitors to retain some space ? the data was 4 years old.

 

please confirm me 

 

Thanks

Yasmeen

avatar
Super Collaborator

The minimum disk space limit for each of Host Monitor and Service Monitor is 10 Gb. If you are at this size or below, a cleanup does not make much sense as the disk space will be re-acquired over time.

 

The correct solution is to make sufficient disk space available for the /var/lib mountpoint.

 

If there is no other option then a short term help is to 

 

  • stop the Service Monitor role instance
  • empty the data directory /var/lib/cloudera-service-monitor/
  • start the Service Monitor role instance

Similar for the Host Monitor. Be aware this means all historical data showing up in the charts in CM will be lost with this method. Only monitoring data gathered after this procedure will be showing up.

avatar
Explorer

Thanks ton for your reply, few more clarifications -

 

Recently i came cross an issue where we have the partition metadata files under cloudera-host-monitor been zipped up and resulted in host monitor down for complete cluster. I rresolved this by unzipping all files.

 

 What does cloudera[host-monitor/service-monitor] do while they restart? do they perform to read files under partition/partition metadata during statr?

 

 as per your reply, 

  • empty the data directory /var/lib/cloudera-service-monitor/--- do i need to cleanup complete underlying directories under clouder-service-monitor? does it impact anything while starting service-monitor? We do have enough mountpoint of 50 GB, still i see files under ts sitting in CB's

Thanks

Yasmeen

avatar
Super Collaborator

Zipping files in the directory effectively breaks the index thus Host Monitor (same for Service Monitor) is expected to fail during startup. Regarding your question, yes please totally empty that directory while Service Monitor is shut down. The next startup will initialize the directory with the new index files, no issues expected.

avatar
Explorer

Hello,

 

Thanks for your reply.

 

Before i cleanup from scratch dir for cloudera [host /service monitor] few more clarifications-

 

I see under subfolders of partitions/partition metadata there are few older files 2014, 2015 which showing up an amount of 2 GB, can i alone clean up these files? will it result in any failure during host/service monitor when they actually restart ?

 

Thanks

Yasmeen

avatar
Super Collaborator

Sorry for the delay @yasmin 

Not sure which files exactly you are referring to, but options are these are still valid data files from back then or they have been manually copied to there. You can easily verify with this procedure:

  • Stop Service Monitor
  • Make a backup of the whole data directory
  • Delete the files in question
  • Start Service Monitor

Then monitor the Service Monitor logs after startup. If you see any errors or issues reported then those files are needed, then please revert the above procedure and restore the data directory from the backup. If there are no errors then you are fine with the deleted files.

avatar
Explorer

Thank you gzigldrum 

 

Ill follow the same below.Appreciate your time here!

 

Regards

Yasmeen