how do we troubleshoot spark hung jobs?
have a job where several executers execute in no time and last few executers hung and there are running since 24 hrs.
trying to understand what that process is doing? where it got hung.
You can login to resource manager url at port 8088 get the application Id specific to the submitted job,then run the following command to see the log info
yarn logs -applicationId <application_id> | more