Community,
Any best practices for the maintenance for production clusters. When we should restart the services generally? - weekly once or monthly once.
Cleaning up log files.
And, also found that yarr logs in HDFS not get deleted according to the retention period. Ref: YARN logs under /tmp/logs/{user.name}/logs not cleared properly
Note: Mainly looking for a cluster in Bank, with regular ingestion using Sqoop and analytics using spark and impala.