I'm getting the same error. We just went through a power outage over the weekend and when we returned, there was a lot of data queued
up to be stored on hdfs. We let it catch up, but while doing so, oozie jobs we had scheduled were being kicked off and many times failing or
being suspended.
When I run ifconfig, I see what I would expect to see on the interface. Is there anything we did or didn't do on power up that is having us thrashing
and seemingly in a resource battle as these things run?
I'm two days into this research effort so any insight would be greatly appreciated.