russell786

Community
Training
Partners
Support

New Contributor

Member since

spark .session() .read() .option("encoding", "UTF-8") .option("delimiter", "^") .option("mode", "PERMISSIVE") .schema(SCHEMA_STORE.getIPDRschema()) .csv( JavaConverters.collectionAsScalaIterableConverter(_files_to_process) .asScala() .toSeq()) .withColumn("filename", org.apache.spark.sql.functions.input_file_name()) .dropDuplicates(); Written in java please convert it into scala hope this will work 🙂

Community Statistics

Member Since	‎11-12-2019 06:32 PM
Last Visited	‎12-19-2023 10:26 PM
Posts	2

Cloudera Community

Re: How to read multiple gzipped files from S3 int...