Support Questions

Find answers, ask questions, and share your expertise

I am looking some info on XML SerDe that can handle multi-byte char data. Please advise.

avatar
 
1 ACCEPTED SOLUTION

avatar
Super Guru

@milind pandit

@Joseph Niemiec mentioned the use of this XML SerDe: http://search.maven.org/remotecontent?filepath=com/ibm/spss/hive/serde2/xml/hivexmlserde/1.0.5.3/hiv...

I understand that you are looking for XML SerDe, but you may be open for an alternative. NiFi provides ConvertCharacterSet processor to convert the character set used to encode the content from one character set to another. Maybe that helps.

View solution in original post

3 REPLIES 3

avatar
Super Guru

@milind pandit

@Joseph Niemiec mentioned the use of this XML SerDe: http://search.maven.org/remotecontent?filepath=com/ibm/spss/hive/serde2/xml/hivexmlserde/1.0.5.3/hiv...

I understand that you are looking for XML SerDe, but you may be open for an alternative. NiFi provides ConvertCharacterSet processor to convert the character set used to encode the content from one character set to another. Maybe that helps.

avatar

@Constantin Stanca I am exploring the NiFi option as ibm xml serde having issues processing multi byte chars.

avatar
Super Guru

Another option would be to pre-convert XML to JSON.