Created on 08-26-2019 01:40 PM - last edited on 08-26-2019 05:57 PM by ask_bill_brooks
I'm studying CDH 6.3.0 with hive and spark and I'm facing for a problem that held me for a week.
I already installed it from scratch and nothing solved.
The timeout occurs when I try to select from a table.
Considering this :
DROP TABLE dashboard.top10;
CREATE TABLE dashboard.top10 (id VARCHAR(100), floatVal DOUBLE)
STORED AS ORC tblproperties("compress.mode"="SNAPPY");
INSERT into dashboard.top10 SELECT * from analysis.total_raw order by floatVal DESC limit 10;
Error while processing statement: FAILED: Execution Error, return code
30041 from org.apache.hadoop.hive.ql.exec.spark.SparkTask. Failed to
create Spark client for Spark session
faf8afcb-0e43-4097-8dcb-44f3f1445005_0:
java.util.concurrent.TimeoutException: Client
'faf8afcb-0e43-4097-8dcb-44f3f1445005_0' timed out waiting for
connection from the Remote Spark Driver
The container is exiting and here is the full log:
Created 08-27-2019 07:09 AM
My guess is that timeout settings are not being taken into account.
And as my test environment, I can have a latency greater than 1s
I found some warnings that explain my guess:
2019-08-27T10:52:10,045 INFO [spark-submit-stderr-redir-05681b44-ae8a-42d9-a80d-20dad05faa98 main] client.SparkClientImpl: Warning: Ignoring non-spark config property: hive.spark.client.server.connect.timeout=90000
2019-08-27T10:52:10,046 INFO [spark-submit-stderr-redir-05681b44-ae8a-42d9-a80d-20dad05faa98 main] client.SparkClientImpl: Warning: Ignoring non-spark config property: hive.spark.client.rpc.threads=8
2019-08-27T10:52:10,046 INFO [spark-submit-stderr-redir-05681b44-ae8a-42d9-a80d-20dad05faa98 main] client.SparkClientImpl: Warning: Ignoring non-spark config property: hive.spark.client.future.timeout=60000
2019-08-27T10:52:10,046 INFO [spark-submit-stderr-redir-05681b44-ae8a-42d9-a80d-20dad05faa98 main] client.SparkClientImpl: Warning: Ignoring non-spark config property: hive.spark.client.connect.timeout=1000
2019-08-27T10:52:10,046 INFO [spark-submit-stderr-redir-05681b44-ae8a-42d9-a80d-20dad05faa98 main] client.SparkClientImpl: Warning: Ignoring non-spark config property: hive.spark.client.secret.bits=256
2019-08-27T10:52:10,053 INFO [spark-submit-stderr-redir-05681b44-ae8a-42d9-a80d-20dad05faa98 main] client.SparkClientImpl: Warning: Ignoring non-spark config property: hive.spark.client.rpc.max.size=52428800
Created 08-28-2019 12:50 AM
Created 08-30-2019 11:17 AM
Thanks for your reply @EricL ,
The connection between nodes are fine, I edited hive-site.xml with these parameters and its working now, but I'm not sure why the timeout was happening
set hive.spark.client.connect.timeout 360000ms
set hive.spark.client.server.connect.timeout 360000ms
BR
Created 08-30-2019 05:48 PM