Created 11-14-2015 09:13 PM
Containers are being killed with error:
Container: container_e81_1447433228005_0589_01_001207 on datanode1.company.com_45454 ======================================================================================= LogType:stderr Log Upload Time:Fri Nov 13 13:14:59 -0500 2015 LogLength:78 Log Contents: Error: Could not find or load main class org.apache.tez.runtime.task.TezChild -- Container: container_e81_1447433228005_0589_01_003434 on datanode2.company.com_45454 ======================================================================================= LogType:stderr Log Upload Time:Fri Nov 13 13:14:58 -0500 2015 LogLength:78 Log Contents: Error: Could not find or load main class org.apache.tez.runtime.task.TezChild -- Container: container_e81_1447433228005_0589_01_003219 on datanode2.company.com_45454 ======================================================================================= LogType:stderr Log Upload Time:Fri Nov 13 13:14:58 -0500 2015 LogLength:78 Log Contents: Error: Could not find or load main class org.apache.tez.runtime.task.TezChild
Any ideas?
Created 11-14-2015 09:13 PM
Please remove customer name from the logs
Created 11-14-2015 09:15 PM
HDP version ?
Created 11-14-2015 09:25 PM
Thanks for the reminder
v2.2.4.2
Created 11-14-2015 11:27 PM
Can you check the installation of Tez? Detailed manual instructions are available here. Typically you see this when tez.lib.uris is not configured correctly.
Created 11-16-2015 02:24 PM
Thanks for the reply, Deepesh.
This cluster was configured by ambari and the Tez settings are correct.
Specifically, the page you cited talks about tez.lib.uris which is configured correctly:
<property> <name>tez.lib.uris</name> <value>/hdp/apps/2.2.4.5-6/tez/tez.tar.gz</value> </property>
All the other properties stated in that page also look correct to me.
Again, this is an intermittent issue. If we are missing properties, is not because of misconfiguration but rather the properties are being lost somehow.
Ideas?
Created 11-16-2015 03:07 PM
Suspect that the HDFS has some issues, can you look for exceptions in the DN and NN logs. Also scan for DFSClient exceptions in the the container logs.
Created 11-16-2015 07:10 PM
Thank you, Deepesh. I will do that.