Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

What is the best way to troubleshoot an application/job that is taking longer than usual?

avatar
Contributor

Hi experts,

I was wondering what is the best way to troubleshoot an application or job that is taking longer than usual. Maybe a 5minute job that is taking 1 hour or longer to complete etc.

What are some things I should start looking at first? Or can someone bring me through the process?

 

Thanks,

2 REPLIES 2

avatar
Rising Star

@ryu 
1.Check the queue that job is running in. See if that is allocated enough resources.
2.See if the queue is pending containers is showing or not.
3.If all these things are fine then start checking the locality if the job is running in node local or rack local.
4. Then go to node manager level and debug for local unix level slowness

avatar
Rising Star

@ryu  Please accept this as solution if your issue is resolved by the suggestion