Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

I am looking some info on XML SerDe that can handle multi-byte char data. Please advise.

avatar
 
1 ACCEPTED SOLUTION

avatar
Super Guru

@milind pandit

@Joseph Niemiec mentioned the use of this XML SerDe: http://search.maven.org/remotecontent?filepath=com/ibm/spss/hive/serde2/xml/hivexmlserde/1.0.5.3/hiv...

I understand that you are looking for XML SerDe, but you may be open for an alternative. NiFi provides ConvertCharacterSet processor to convert the character set used to encode the content from one character set to another. Maybe that helps.

View solution in original post

3 REPLIES 3

avatar
Super Guru

@milind pandit

@Joseph Niemiec mentioned the use of this XML SerDe: http://search.maven.org/remotecontent?filepath=com/ibm/spss/hive/serde2/xml/hivexmlserde/1.0.5.3/hiv...

I understand that you are looking for XML SerDe, but you may be open for an alternative. NiFi provides ConvertCharacterSet processor to convert the character set used to encode the content from one character set to another. Maybe that helps.

avatar

@Constantin Stanca I am exploring the NiFi option as ibm xml serde having issues processing multi byte chars.

avatar
Super Guru

Another option would be to pre-convert XML to JSON.