Created 09-01-2016 07:20 AM
Hi. What is - in your opinion - the best way to import XML file into Hive table?
Is there any way to import XML file to Hive directly? My currently idea is: import XML to Oracle table, and then import Oracle table to Hive using Sqoop. Do you have better idea?
Created 09-01-2016 07:44 AM
You can try a direct import using com.ibm.spss.hive.serde2.xml.XmlSerDe. Check https://community.hortonworks.com/content/kbentry/972/hive-and-xml-pasring.html and https://community.hortonworks.com/questions/40979/hive-xml-parising-null-value-returned.html for examples.
Created 09-01-2016 07:44 AM
You can try a direct import using com.ibm.spss.hive.serde2.xml.XmlSerDe. Check https://community.hortonworks.com/content/kbentry/972/hive-and-xml-pasring.html and https://community.hortonworks.com/questions/40979/hive-xml-parising-null-value-returned.html for examples.
Created 10-17-2017 08:28 AM
You can automate the whole process of generating ORC/Parquet for Hive in a relational structure. This blog post shows how to convert MISMO XML to Hive and Parquet
,You can automate the whole process of generating ORC/Parquet for Hive in a relational structure. This blog post shows how to convert MISMO XML to Hive and Parquet
Created 02-23-2019 11:03 AM
Try this NIFI groovy processor it converts XML files to CSV or AVRO