Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

saveAsTextFile creates Junk characters if the Rdd has French Characters

Highlighted

saveAsTextFile creates Junk characters if the Rdd has French Characters

I am reading a parquet file which has French characters and transforming the Rdd to a new format and creating output file using saveAsTextFile. In my output I see Junk characters wherever input has French characters, Please help resolve the issue. How do I take care of the French Characters ?

gffRdd.filter((row) => row._1.equals(source)).map(row => row._2).repartition(1).saveAsTextFile(s"$outputLocalPath/$source")

1 REPLY 1
Highlighted

Re: saveAsTextFile creates Junk characters if the Rdd has French Characters

Expert Contributor

What are the characters causing issues? I have a feeling this is going to have to go in a JIRA and we can make a manual work around for now.

Don't have an account?
Coming from Hortonworks? Activate your account here