Reply
Highlighted
Posts: 354
Topics: 162
Kudos: 60
Solutions: 27
Registered: ‎06-26-2013

Announcing: Cloudera Enterprise 5.1, Cloudera Navigator 2.0, and Impala 1.4 Released

Dear CDH, Cloudera Manager, Impala and Search Users,

 

 

We are thrilled to announce the GA release of Cloudera Enterprise 5.1 (CDH 5.1 and Cloudera Manager 5.1),  Cloudera Navigator 2.0, Impala 1.4 for CDH 4, Impala ODBC v2.5.17 and Hive ODBC v2.5.10.

 

Cloudera Enterprise 5.1

Cloudera Enterprise 5.1 contains a number of new features and component versions including the ones below:

  • Apache Hadoop

    • HDFS

      • Extended ACL Support

      • NFS Enhancements

      • HTTPS support

    • YARN

      • YARN + Impala GA: CDH 5.1 now supports three modes of Resource Management:

        • Static Partitioning (CDH 4.3+)

        • Admission Control (CDH 5.0+)

        • Dynamic Prioritization (CDH 5.1+)

      • As part of Dynamic Prioritization, Llama now supports HA

      • Configuration similar to other HA elements (Active-Passive)

  • Apache HBase

    • HBase 0.98.1 rebase

    • Improved WAL write performance

    • Reverse scans

    • MapReduce over snapshots

  • Apache Spark

    • Rebase to Spark 1.0

    • Spark Streaming integration with Kerberos

    • Application History Server improves monitoring capabilities

    • Sparse Vector Support in MLLib

    • Improvements to Avro integration

    • Simplified job submission to YARN cluster

    • Security: Authentication of all Spark communications

  • Apache Sentry

    • Rebase to 1.3

    • Support for Grant/Revoke statements via beeline (HiveServer2)

    • Policy files no longer required

    • Sentry fixes: job tracking in Hue, HDFS permissions inheritance

  • Impala

    • Release of  Impala 1.4.0

    • YARN-integrated Resource Management is GA

    • DECIMAL across Hive, Impala, Avro, Parquet, and text file formats

    • Integration with HDFS Caching

    • Fixed folder inheritance issues for better framework interoperability (IMPALA-827)

    • LDAPS Support for more secure AD username/password authentication

    • ORDER BY without LIMIT

    • Performance improvements for COMPUTE STATS and queries

    • DDL support for creating a new table from an existing Parquet file schema

    • DDL support for creating Avro tables

    • Additional built-in functions like TRUNC, EXTRACT, and stats functions

  • Apache Hive

    • DECIMAL across Hive, Impala, Avro, Parquet, and text file format

    • Fixed folder inheritance issues for better framework interoperability (HIVE-6892)

  • Cloudera Search

    • Document Level Security

    • Parquet Support

    • Search QuickStart Document and Script

    • HBase Lily Indexer update

  • Apache Oozie

    • Added ability to submit Sqoop jobs from Oozie CLI (Knox competitive)

    • Completed testing of JMS/Email notification for workflow and Coordinator action status changes

    • Completed testing of SLA notifications

    • Replaced/Improved Metrics instrumentation (for Cloudera Manager monitoring)

    • LAST_ONLY execution mode now works correctly

  • Cloudera Manager

    • Fine grain access for Cloudera Manager

      • Two additional roles in Cloudera Manager - Operator and Configurator

    • Impala Llama HA support

    • Security Updates

      • Integrate directly with Active Directory to set up kerberos principals needed for CDH daemons

      • New wizard to add kerberos to an existing unsecure cluster

      • Manage and deploy kerberos client configurations (krb5.conf).

    • Sentry support for Grant/Revoke

    • Support for SSL

      • Manage Hadoop SSL related configurations

      • Monitor Hadoop services when SSL is enabled

    • Monitoring

      • Updates to Oozie monitoring

      • New Hive Metastore Canary

  • Hue

    • Rebase to 3.6

    • Search App v2

      • 100% dynamic dashboards

      • Drag and drop dashboard builder

      • Text, Timeline, Pie, Line, Bar, Map, Filters, Grid, and HTML Widgets

      • Solr Index Designer (basic up-and-running wizard, from a file in HDFS)

    • View support for Snappy, Avro, Parquet files

    • Multi NT domain

    • LDAP Nested Groups

    • Impala HA

    • Close commands for Impala/Hive sessions and queries

  • Other Rebases

    • Apache Mahout - 0.9

    • Apache Crunch - 0.10

    • Flume 1.5.0

 

We look forward to you trying it out using the information below:

 

 

As always, we are happy to hear your feedback. Please send your comments and suggestions to the user group or through our community forums.  You can also file bugs through our external jira projects on issues.cloudera.org.  

 

Cloudera Navigator 2.0

Cloudera Navigator is a fully-integrated governance, compliance, and risk management solution for Hadoop. Cloudera Navigator provides comprehensive metadata, lineage, and auditing support across the enterprise data hub.

 

  • Unified technical and business metadata

    • Consolidate technical metadata for Hadoop files and tables, plus add custom tags and comments so that you can easily track, annotate and classify data in alignment with business rules.

  • Lineage

    • View upstream and downstream column-level lineage in an easy-to-follow graph so that you can quickly identify the origin of a data set and its impact on downstream analysis.

    • Supports Hive, Pig, Sqoop 1, MapReduce, YARN, Oozie

  • Auditing

    • View and summarize all data access attempts with a simple, queryable interface so you can quickly identify outliers and security breaches

    • Supports Hive, Impala, HBase, HDFS, Sentry

  • Security

    • Cloudera Navigator includes Navigator Encrypt and Navigator Key Trustee, formerly known as Gazzang zNcrypt and Gazzang zTrustee. These products provide enterprise-grade encryption and key management.

  • Links

 

Impala 1.4 on CDH4

Impala 1.4 for CDH4 contains features that are included in Cloudera Enterprise 5.1 which are supported by CDH4:

  • Fixed folder inheritance for Sentry (IMPALA-827)

  • LDAPS Support for more secure AD username/password authentication

  • ORDER BY without LIMIT

  • DDL support for creating a new table from an existing Parquet file schema

  • DDL support for creating Avro tables

  • Fixed folder inheritance issues for better framework interoperability (IMPALA-827)

  • Performance improvements for COMPUTE STATS and queries

  • Additional built-in functions like TRUNC, EXTRACT, and stats functions 

Note that Impala 1.4 does not work with Cloudera Manager 4.7 or earlier. If your environment is managed by Cloudera Manager 4.7 or earlier, you should upgrade to Impala 1.4 and Cloudera Manager 4.8 together.

 

For more details on new Impala features, please see “New Features in Impala” for CDH 4: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Cloudera-Impala-Release...

 

“Installing and Using Cloudera Impala” guide for CDH 4: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Im...

 

As always, we are happy to hear your feedback. Please send your comments and suggestions to impal...@cloudera.org or through our new community forums.  You can also file bugs in the Impala project at issues.cloudera.org  

 

Impala ODBC 2.5.17 and Hive ODBC 2.5.10

The new ODBC drivers include the following new functionality:

  • Impala ODBC fixes the default fetch size for faster client returns

  • Improved Mac OSX client installation

We look forward to you trying it out using the links below:

Announcements