Created on 07-28-2023 01:09 AM - edited 07-28-2023 01:10 AM
Hello everyone,
we are currently working on automaticlly patching the operating system of our CDP Private Cloud Base cluster. We enabled/configured our cluster to be HA, but for two components (YARN Job History Server and Spark History Server) we couldn't find any solution. We designed our automation to be split over the timespan of three days. Starting at 8 PM each day we update one of the master nodes, one of the edge nodes (e.g. know) and one third of the worker nodes in sequential order. Our problem is how to handle an unexpected failure. Is it possible to wait until the next day or is the availabilty in danger if one server (with the mentioned compontens) isn't booting up etc.
How important are the YARN Job History Server and Spark History Server for the successful execution. As far as I understand both are only required for storing/providing logs for executed applications. Do I miss anything here?
Created 07-31-2023 05:16 AM
Hi @MrPoseido
By enabling the YARN HA you can access the Job History Server any time simply you need to assign to the new host.
* https://hadoop.apache.org/docs/stable/hadoop-yarn/hadoop-yarn-site/ResourceManagerHA.html
Created 07-28-2023 01:39 AM
@MrPoseido, Welcome to our community! To help you get the best possible answer, I have tagged in our Yarn/Spark experts @Babasaheb @RangaReddy who may be able to assist you further.
Please feel free to provide any additional information or details about your query, and we hope that you will find a satisfactory solution to your question.
Regards,
Vidya Sargur,Created 07-28-2023 02:12 AM
Hi @MrPoseido
By using SHS, just we can the see the list of applications and its status. If SHS is down still you can copy the files to another cluster where SHS service is running and you can visualize.
Cloudera team is working on new feature to enable Spark History Server HA feature in future releases.
Created 07-31-2023 12:24 AM
Hi @RangaReddy
Thanks for your information about the SHS HA feature in future release. Let me rephrase your answer SHS is not required for a sucessful execution of any spark application. What I am missing now is the information if this states also for Yarn JHS or not?
Created 07-31-2023 05:16 AM
Hi @MrPoseido
By enabling the YARN HA you can access the Job History Server any time simply you need to assign to the new host.
* https://hadoop.apache.org/docs/stable/hadoop-yarn/hadoop-yarn-site/ResourceManagerHA.html
Created 08-01-2023 11:43 PM
@MrPoseido, Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.
Regards,
Vidya Sargur,