Created on 05-21-2018 07:41 PM - edited 09-16-2022 06:15 AM
My cluster memory is 147GB and I get this error when the server has not used it's entire memory.
I can see there is memory free and yet my jobs get killed with this error. There is no error in logs and I don't get any error using dmesg command or in /var/log/messages
Also, it happens randomly and on any of the nodes. Please suggest. Been trying to get in touch with Cloudera sales support but no luck and it's urgent.
Created 05-21-2018 08:57 PM
Created on 05-21-2018 09:39 PM - edited 05-21-2018 09:41 PM
These are not spark jobs but hive and sqoop jobs I am running. These randomly get killed throughout the day, with the same configuration sometimes run and sometimes don't.
Created 05-24-2018 06:07 PM
@Harsh J : Could you please respond? It's a production cluster and it is disturbing our workflows when we run into this error