I just built a new cluster using Cloudera Manager V7.0.3 and I am testing it. All of the services show "green" in CM, and I have run some tests on HDFS and YARN (both MR and Spark) with no problems. I am now trying to test Hive. I connected to Hive using the beeline prompt, and I was able to successfully create a table, but when I try to insert a row, it fails with this error:
Error while processing statement: FAILED: Execution Error, return code 30041 from org.apache.hadoop.hive.ql.exec.spark.SparkTask. Failed to create Spark client for Spark session b0cb962c-b8d5-4254-9ff8-6c3faedb9d21 (state=42000,code=30041)
So that made me think that somehow the CLASSPATH that beeline is using is wrong- but I don't know how to fix that. The Spark classes should be on that machine because I ran my YARN Spark test from there.
I also installed Hive on Tez on this cluster, so I tried to use that, and that DID work. I connected to Hive on Tez from the beeline prompt, and I was able to insert rows into the table (the one that I created earlier) and also query it.
So Hive itself seems OK; it is only Hive on Spark that is the problem. Can anyone help? I have configured everything through CM, so I would prefer to fix it there, but if the only option is to "hack" a configuration file somewhere, I would try that...