Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

query related to xml input to hive

avatar
Expert Contributor

hi,

i am trying to import xml data in hive.

below is an example:

<ns2:reqValAdd xmlns:ns2="http://www.ss.ss"></ns2:reqValAdd>

CREATE TABLE xml_test4(ns2 STRING) ROW FORMAT SERDE 'com.ibm.spss.hive.serde2.xml.XmlSerDe' WITH SERDEPROPERTIES ( "column.xpath.ns2"="/ns2:ReqValAdd/@ns2" ) STORED AS INPUTFORMAT 'com.ibm.spss.hive.serde2.xml.XmlInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.IgnoreKeyTextOutputFormat' TBLPROPERTIES ( "xmlinput.start"="<ns2:reqValAdd xmlns:ns2", "xmlinput.end"="</ns2:reqValAdd>" );

the output is coming as NULL while i am expecting "http://www.ss.ss".

can you please suggest what is wrong and how to rectify?

thanks,

Rishit Shah

1 ACCEPTED SOLUTION

avatar
Expert Contributor

thanks, but this is solved by using xpath.

View solution in original post

1 REPLY 1

avatar
Expert Contributor

thanks, but this is solved by using xpath.