About RandomT

RandomT · ‎09-09-2019

Thanks for the info, was very helpful!

RandomT · ‎09-04-2019

Hello, I'm just starting off and was wondering if there's any concrete way of setting the compression when writing to a file in Spark? I used to use option when writing files: exampleDF.write.option("compression", "snappy").avro("output path") but when I go to check where the Avro files are saved I can't tell from the name of the files whether they've been compressed or not. Also just to say this is after I've imported "com.databricks.spark.avro._" so I'm not having any trouble using Avro files. Another way I've seen is to use "sqlContext.setConf" and these would be the commands I'd use in this instance: import org.apache.spark.sql.hive.HiveContext val sqlContext = new HiveContext(sc) sqlContext.setConf("spark.sql.avro.compression.codec", "snappy") exampleDF.write.avro("output path") Neither way causes any errors so I was wondering which would be the better way and if there are any other more reliable ways of setting the compression when writing files? The version of Spark I'm using is Spark version 2.2.0 Thanks in advance

Online	Offline
Last Visited	‎10-17-2019 07:48 AM

Member Since	‎09-04-2019 02:35 AM
Last Visited	‎10-17-2019 07:48 AM
Posts	2

Cloudera Community

Re: Setting Compression

Setting Compression