question Re: com.databricks.spark.xml parsing xml takes a very long time in Support Questions

question Re: com.databricks.spark.xml parsing xml takes a very long time in Support Questions https://community.cloudera.com/t5/Support-Questions/com-databricks-spark-xml-parsing-xml-takes-a-very-long-time/m-p/130456#M93142 Thanks Mark. I have looked into your suggestions.Which has lead me to LZO Compression;<A href="http://blog.cloudera.com/blog/2009/11/hadoop-at-twitter-part-1-splittable-lzo-compression/">http://blog.cloudera.com/blog/2009/11/hadoop-at-twitter-part-1-splittable-lzo-compression/</A>I think this may be something I try next. Do you have any suggestions with this? Doesn't HDP already comes with LZO? The link is a good few years old. should I try something else before I spend a few hows with this? My company is not keen on me spending a few hours writing Java sequenceFile jar. Mon, 20 Mar 2017 21:45:03 GMT antin_leszczysz 2017-03-20T21:45:03Z