Created on 05-21-2017 11:25 AM - edited 08-18-2019 02:11 AM
Hi,
I'm writing files in HDFS but special characters like "ã", "ç" appear like that "Precipita??????o" instead "Precipitação".
How can i handle this encondings?
Here is my flow:
Thanks
Created 05-22-2017 03:18 AM
What is the encoding of incoming file? ReplaceText default encoding is UTF-8 which I think should be sufficient for this. But you are also writing raw file. Check content of the flow file by looking at data provenance. I think the raw file is likely being written the right way but whatever you are using to read the file is not showing you the content in the encoding you need to read this data properly.
Created 05-22-2017 09:05 PM
The raw file written have the same problem when i view in "Files View" in Ambari. Perhaps the problem is when visualizing although the right encoding in UTF-8.