Member since
02-13-2019
1
Post
0
Kudos Received
0
Solutions
02-13-2019
07:11 PM
Spark is great for XML processing. It is based on a massively parallel distributed compute paradigm. I think you cam find some useful info in this examples:
https://stackoverflow.com/questions/33078221/xml-processing-in-spark
https://community.hortonworks.com/questions/71538/parsing-xml-in-spark-rdd.html
Also, check on https://anonymous-essay.com/ XSD/XML complexity. And finally you can view this thread to find out how do it without databricks package.
... View more