Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

.xlsx parsing in spark giving java.text.ParseException: Unparseable number

Highlighted

.xlsx parsing in spark giving java.text.ParseException: Unparseable number

New Contributor

We are using HDP 3.0, while we are loading .xlsx file to spark data frame, strangely string type of column was taken as number in data frame. giving the exception as java.text.ParseException: Unparseable number: "TMF Study 331-102-00088 Contributor" .

the code we used is

df_load_temp = spark.read.format("com.crealytics.spark.excel").option("treatEmptyValuesAsNulls", "true") \

.option("location",file_name) \

.option("useHeader", "true") \

.option("inferSchema", "true") \

.option("addColorColumns", "False").load()


Hope any body help as early as possible.

thanks in advance....