Product Announcements

Find the latest product announcements and version updates
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Who agreed with this topic

[ANNOUNCE] Cloudera Distribution of Apache Spark 2.2 Release 1

avatar
Super Collaborator

We are happy to announce Apache Spark 2.2 release 1. You can download the parcel and apply it directly to provisioned clusters without disrupting your currently running Spark workloads.

 

This component is generally available and is supported on CDH 5.8 through CDH 5.12.

 

What's New in Cloudera Distribution of Apache Spark 2.2 Release 1

  • Support for CDH 5.12 and associated features.
  • Support for using Spark 2 jobs to read and write data on the Azure Data Lake Store (ADLS) cloud service.
  • Cloudera Distribution of Apache Spark 2.2 requires JDK 8.

 

Issues Fixed in Cloudera Distribution of Apache Spark 2.2 release 1

  • [SPARK-10364][SQL] Support Parquet logical type TIMESTAMP_MILLIS
  • [SPARK-10849][SQL] Adds option to the JDBC data source write for user to specify database column type for the create table
  • [SPARK-12868][SQL] Allow adding jars from HDFS
  • [SPARK-14503][ML] spark.ml API for FPGrowth
  • [SPARK-16101][HOTFIX] Fix the build with Scala 2.10 by explicit typed argument
  • [SPARK-16122][CORE] Add rest api for job environment

 

For a full list of fixed issues, see the list here.

Download Cloudera Distribution of Apache Spark 2.2 release 1.

Read the documentation.

Want to become a pro Spark user?  Sign up for Apache Spark Training.

Who agreed with this topic