Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

How to load XML file into Hbase table? Can anyone knows please share steps with sample data?

How to load XML file into Hbase table? Can anyone knows please share steps with sample data?

New Contributor
 
1 REPLY 1
Highlighted

Re: How to load XML file into Hbase table? Can anyone knows please share steps with sample data?

Hello

XML has a specific structure which will probably change a little in the way you model it in Hbase, for example picking what is the rowkey or how xml fields get projected in column families. This previous statement may be true unless you want to store the while whole XML document just as a raw blob and with no work on it. This is another option

With this in mind, in the former approach, usually you would use a parsing engine or ETL to load the data in hbase with the right data model for Hbase. Popular choices would be Spark parsing and loading into Hbase, or a java job this github project may give you some ideas:

https://github.com/sreejithpillai/HBaseBulkImportXML