Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Quote data at writing to HDFS does not work

avatar
New Member

I am trying to save a csv to hdfs using quotes, but it does not work. Could you please suggest what i am doing wrong

code:

df.write.format('com.databricks.spark.csv').mode('overwrite').option("header", "false").option("quote","\"").save(output_path)

i am calling it with the following:

--packages com.databricks:spark-csv_2.10:1.5.0 --repositories http://repo.hortonworks.com/content/groups/public/

also tried some other versions, but still writes it without quoting. also tested with other quote character.

#output:
aaa,bbb

#expected output:
"aaa","bbb"

thanks

1 ACCEPTED SOLUTION

avatar
New Member

finally i was able to to it:

#code:
df.write.format('com.databricks.spark.csv').mode('overwrite').option("header", "false").option("quoteMode", "ALL").save(output_path)

#call:
--packages com.databricks:spark-csv_2.10:1.5.0

View solution in original post

2 REPLIES 2

avatar
Rising Star

Try this option.

.option("quote", ".option("quote", "\u0000")

avatar
New Member

finally i was able to to it:

#code:
df.write.format('com.databricks.spark.csv').mode('overwrite').option("header", "false").option("quoteMode", "ALL").save(output_path)

#call:
--packages com.databricks:spark-csv_2.10:1.5.0