Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

spark HiveThriftServer2 sql AnalysisException: LOAD DATA input path does not exist. if sql query is existed wild card characters

avatar
New Member

1. I'm using a ETL tool and connecting to Spark2-HiveThriftServer2 over connection string URL: "jdbc:hive2://10.30.164.132:10000/nat"

2. I'm facing with below error, if the sql have existed wild card characters "*" : LOAD DATA LOCAL INPATH '${SOURCE_DB_FILE}/V_SOURCE_BTS/${YYYYMMDD}/s0_V_SOURCE_BTS_\*_part_\*' INTO TABLE ${SCHEMA_BI}V_SOURCE_BTS; Couldn't execute SQL: LOAD DATA LOCAL INPATH '/u02/CDR/HAITI/MakeFile/V_SOURCE_BTS/20160927/s0_V_SOURCE_BTS_\*_part_\*' INTO TABLE nat.V_SOURCE_BTS 2016/10/05 14:38:43 - V_SOURCE_BTS - 2016/10/05 14:38:43 - V_SOURCE_BTS - org.apache.spark.sql.AnalysisException: LOAD DATA input path does not exist: /u02/CDR/HAITI/MakeFile/V_SOURCE_BTS/20160927/s0_V_SOURCE_BTS_\*_part_\*;

3. The sql query can execute without error if I changed the sql and remove wild cards to: LOAD DATA LOCAL INPATH '/u02/CDR/HAITI/MakeFile/V_SOURCE_BTS/20160927/s0_V_SOURCE_BTS_20160927_part_0000000' INTO TABLE ${SCHEMA_BI}V_SOURCE_BTS;

4. The problem's happened from Spark 2.0.0.

1 ACCEPTED SOLUTION

avatar
New Member

I've posted to spark community. It's a bug of SparkSQL.

https://issues.apache.org/jira/browse/SPARK-17796.

whenever the bug is fixed , It'll be updated to HDP 2.5. with spark 2.0.0 version.

View solution in original post

3 REPLIES 3

avatar
Super Guru

@Tran Quyet Thang

According to the hive documentation, "filepath can refer to a file (in which case Hive will move the file into the table) or it can be a directory (in which case Hive will move all the files within that directory into the table)."

I don't think, wild cards are allowed in the path.

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML#LanguageManualDML-Loadingfilesin...

avatar
New Member

I've posted to spark community. It's a bug of SparkSQL.

https://issues.apache.org/jira/browse/SPARK-17796.

whenever the bug is fixed , It'll be updated to HDP 2.5. with spark 2.0.0 version.

avatar
New Member

Hi Mqureshi,

It's already fixed in the dfdfc46. patched.

When it'll merged to HDP 2.5 (spark 2.0) ?