Member since
06-26-2013
354
Posts
68
Kudos Received
27
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 5876 | 04-28-2015 09:33 AM | |
| 4198 | 11-14-2014 09:17 AM | |
| 6877 | 11-08-2013 09:48 AM |
08-15-2016
12:42 PM
Hello CDH Users, We are pleased to announce the release of Hive ODBC v2.5.20 driver. This release has the following fixes and enhancements: Enhancements & New Features Delegate Kerberos credentials. You can now have the driver forward your Kerberos user credentials to the server to simplify the authentication process. Optimized Fast SQLPrepare behavior. The Fast SQLPrepare driver configuration option (the FastSQLPrepare key) is now disabled for non-SELECT queries. This ensures that the driver retrieves the necessary result set metadata at prepare time. Resolved Issues Unicode characters in parameter values causing errors. Returning errors for some queries with dates in them. Unable to create new tables when Unicode character types option set. Getting Started with the Cloudera Driver Read the Cloudera ODBC 2.5.20 Driver for Hive Release Notes and Installation Guide. Download the connector from the Cloudera Connectors page. As always, we welcome your feedback. Please send your comments and suggestions to the user group or through our community forums. You can also file bugs through our external JIRA projects on issues.cloudera.org.
... View more
07-28-2016
03:58 PM
1 Kudo
Dear CDH, Cloudera Manager, and Cloudera Navigator users,
We are pleased to announce the release of Cloudera Enterprise 5.7.2 (CDH 5.7.2, Cloudera Manager 5.7.2, and Cloudera Navigator 2.6.2).
This release fixes key bugs and includes the following:
CDH fixes for the following issues:
HADOOP-11409 - FileContext.getFileContext can stack overflow if default fs misconfigured
HADOOP-12787 - KMS SPNEGO sequence does not work with WebHDFS
HADOOP-13251 - Authenticate with Kerberos credentials when renewing KMS delegation token
HDFS-10360 - DataNode may format directory and lose blocks if current/VERSION is missing
HBASE-11625 - Reading data block throws "Invalid HFile block magic" and can not switch to hdfs checksum
HIVE-9499 - hive.limit.query.max.table. partition makes queries fail on non-partitioned tables
HIVE-10685 - Alter table concatenate operator will cause duplicate data
IMPALA-1928 - Fix Thrift client transport wrapping order
HUE-4113 - [Pig] Hue breaks when user has only access to pig app
SQOOP-2846 - Sqoop Export with update-key failing for avro data file
For a full list of upstream JIRAs fixed in CDH 5.7.2, see the issues fixed section of the Release Notes.
Cloudera Manager fixes for the following issues
Unable to start Hue on cluster that's using Kerberos and Isilon, Hue service can now start with Isilon if Kerberos is enabledXSS in Kerberos activation: In lower releases, there was an XSS vulnerability on the Kerberos page. This is now fixed
XSS in host addition: In lower releases, there was an XSS vulnerability on the Add Hosts page. This is now fixed.
Cloudera Manager Agent clears out JN data directories that leads to HDFS not restarting
Files excluded from replication are not replicated if they are renamed: If a file excluded from replication by an exclusion filter is renamed, it is now replicated properly
For a full list of issues fixed in Cloudera Manager 5.7.2, see the issues fixed section of the Release Notes.
Cloudera Navigator fixes for the following issues:
java.lang. IllegalArgumentException error causes Navigator to crash
Navigator Metadata Server fails with nav.batch and a high amount of heap
For a full list of issues fixed in Cloudera Navigator 2.6.2, see the issues fixed section of the Release Notes.
We look forward to you trying it, using the information below:
Download Cloudera Enterprise
View the documentation
As always, we are happy to hear your feedback. Please send your comments and suggestions to the user group or through our community forums. You can also file bugs through our external JIRA projects on issues.cloudera.org.
... View more
07-21-2016
10:53 AM
Cloudera is excited to announce the general availability of Cloudera Enterprise 5.8! Main highlights of this release include Impala read/write support on Amazon S3, a redesigned SQL query editor GUI, the expansion of role-based access control functionality to Cloudera Search, and the GA of Cloudera Navigator Optimizer to facilitate and optimize workload migrations.
For those new to it, Cloudera Navigator Optimizer (previously in beta) is a cloud-based service that helps with offload planning and active data optimization for Apache Hadoop. For example, it provides workload visibility and assessments for building a predictable offload plan, adapting to evolving Hadoop data and workload demands, and optimizing query performance in Impala and Apache Hive. (For more details about Cloudera Navigator Optimizer, check out the release blog post.)
As usual, Cloudera Enterprise 5.8 contains a long list of quality enhancements and bug fixes (learn more about our multi-dimensional hardening/QA process) and other improvements across the stack. Here is a partial list of those improvements (see the Release Notes for a full list):
Performance & Scale
Impala queries now 3x faster on Kerberized clusters
Significant performance improvements to Hive metadata replication in BDR
Usability & Management
Streamlined offload planning and active data optimization with Cloudera Navigator Optimizer
Redesigned SQL editor in Hue
Hive jobs can now be assigned to specific YARN resource pools based on Sentry policy (instead of default “hive” pool)
Native Cloud Support
Apache Sentry support for Amazon S3
Impala can now read and write to Amazon S3
Security & Governance
Role-based access control (Sentry) support for Impala and Hive queries over Amazon S3
Sentry support for Cloudera Search
New policy support for managed metadata assignment
Navigator SDK support for Navigator Optimizer integration
New or Updated Open Source Components
Impala 2.6
Hue 3.10
New or Updated Platform Support
Debian 8.2
Oracle JDK 1.8u74 and 1.8u91
Over the next few weeks, we’ll publish blog posts that cover some of these features in detail. In the meantime:
Download Cloudera Enterprise 5.8
Explore documentation
As always, we value your feedback; please provide any comments and suggestions through our community forums. You can also file bugs via issues.cloudera.org.
... View more
04-15-2016
08:25 AM
All,
Cloudera's docs have a new look and improved performance & usability. Read the details here:
http://blog.cloudera.com/blog/2016/04/check-out-those-new-and-improved-cloudera-docs/
We look forward to your feedback!
... View more
04-12-2016
02:46 PM
The organizers of HBaseCon, the conference for the Apache HBase community, have published the agenda for the conference (May 24, 2016, in San Francisco)—and once again, the impressive geographical and use-case diversity of HBase are on full display.
See full agenda/register here:
http://hbasecon.com
... View more
04-07-2016
04:30 PM
A new release of the RecordService beta (0.3.0) is now available; see details here:
http://community.cloudera.com/t5/Beta-Releases-Apache-Kudu/ANNOUNCE-RecordService-0-3-0-Released/m-p/39488#M190
... View more
04-07-2016
04:21 PM
Cloudera Enterprise 5.7 is now generally available (comprising CDH 5.7, Cloudera Manager 5.7, and Cloudera Navigator 2.6).
Cloudera is excited to announce the general availability of Cloudera Enterprise 5.7! Main highlights of this release include production-ready Hive-on-Spark functionality, which will help users accelerate their use of Apache Spark as a data processing standard; 2x performance gains for Apache Impala (incubating); easier cluster configuration and utilization reporting; and end-to-end encryption for Apache Spark data.
The release also contains a long list of incremental improvements across the stack, in addition to the usual hundreds of bug fixes (some of which were uncovered during our multi-dimensional hardening/QA process). Here is a partial list of those improvements (see the Release Notes for a full list):
Performance & Scale
Hive-on-Spark GA (graduates from Cloudera Labs)
2x performance gains for Impala: Better join ordering and cardinality estimation, faster query startup, codegen and code optimizations, more
Support for the Apache HBase WAL on SSD
Support for the HBase-Spark module (graduates from Cloudera Labs)
Dramatic performance improvement for backups/DR
Usability & Management
New per-tenant cluster utilization reporting for YARN and Impala
Support for portable, scriptable, and versionable cluster configuration
New SQL formatting in HUE query editor
Security & Governance
Improved Apache Sentry HDFS sync feature
Encryption over the wire/on disk for Spark data
Support for Kerberos and LDAP auth on the same HiveServer2 instance
New “business views” for data lineage; new managed/secure metadata within Cloudera Navigator
New or Updated Open Source Components
Apache Spark 1.6 (including support for Spark SQL and Dataframes in PySpark and the spark.ml package and Pipelines API)
Apache HBase 1.2
Apache Impala (incubating) 2.5
Apache Kafka 0.9 (separate install)
New or Updated Platform Support
RHEL/CentOS/OEL 7.2
SLES 11 SP4
Debian 7.8
JDK 7_80 and JDK 8_60
Over the next few weeks, we’ll publish blog posts that cover some of these features in detail. In the meantime:
Install Cloudera Enterprise 5.7
Explore documentation
As always, we value your feedback; please provide any comments and suggestions through our community forums. You can also file bugs via issues.cloudera.org.
... View more
03-15-2016
05:42 PM
Dear Cloudera Users,
We are pleased to announce the general availability of the Cloudera Connector Powered by Teradata 1.5. This release fixes a compatibility issue with CDH 5.5.0 and later. See the download page for more details.
For more details on new features and usage of Cloudera Connector Powered by Teradata, see:
Release Notes Cloudera Connector Powered by Teradata version 1.5
Cloudera Connector Powered by Teradata User Guide, version 1.5
As always, we welcome your feedback. Please send your comments and suggestions through our new community forums. You can also file bugs in the CDH project at issues.cloudera.org.
... View more
02-19-2016
01:11 PM
Dear CDH Users,
We are pleased to announce the release of the Cloudera Distribution of Apache Kafka 2.0 for CDH 5.
Apache Kafka is a highly scalable, distributed, publish-subscribe messaging system. This release is based on Apache Kafka 0.9, and adds security features such as Kerberos authentication, wire encryption, secure mirroring, a new consumer API, per-user throttling, and many other features and bug fixes that solidify Kafka as an enterprise production-grade component of the Hadoop ecosystem. Kafka 2.0 also ships with new management tooling in Cloudera Manager, for point-and-click configuration of each new capability.
New Features in Cloudera Distribution of Apache Kafka 2.0
Kafka is rebased on Apache Kafka 0.9: http://archive.apache.org/dist/kafka/0.9.0.0/RELEASE_NOTES.html.
Kerberos authentication of connections from clients and other brokers, including to ZooKeeper.
Wire encryption of communications from clients and other brokers using SSL.
A new client API for consumers (Java).
A refactored, secure MirrorMaker to prevent data loss and improve reliability of cross-data center replication.
Per-user quotas to throttle producer and consumer throughput in a multitenant cluster.
Requirements for Cloudera Distribution of Apache Kafka 2.0
Cloudera Manager 5.5.3
Any CDH 5.x release is supported.
Notable Issues Fixed in Cloudera Distribution of Apache Kafka 2.0
Notable fixes backported into Kafka 2.0:
KAFKA-2799: WakupException thrown in the followup poll() could lead to data loss
KAFKA-2942: Inadvertent auto-commit when pre-fetching can cause message loss
KAFKA-2878: Kafka broker throws OutOfMemory exception with invalid join group request
KAFKA-2882: Add constructor cache for Snappy and LZ4 Output/Input stream in Compressor.java
KAFKA-2913: GroupMetadataManager unloads all groups in removeGroupsForPartitions
KAFKA-2880: Fetcher.getTopicMetadata NullPointerException when broker cannot be reached
KAFKA-2950: Fix performance regression in the producer
KAFKA-2973: Fix leak of child sensors on remove
KAFKA-2978: Consumer stops fetching when consumed and fetch positions get out of sync
KAFKA-2988: Change default configuration of the log cleaner
KAFKA-3012: Avoid reserved.broker.max.id collisions on upgrade
All backported fixes can be viewed in the git release notes here.
We look forward to you trying Kafka 2.0! , For more information, please use the links below:
Install or upgrade Kafka
Review the documentation
Review the Release Notes
As always, we welcome your feedback. Please send your comments and suggestions through our community forums.
... View more
12-10-2015
10:11 AM
We are pleased to announce the release of the Cloudera Distribution of Apache Kafka 1.4.0 for CDH 5. Apache Kafka is a distributed publish-subscribe messaging system. This release is based on Apache Kafka 0.8.2, adds support for distribution as a package as well as a parcel, and includes fixes for key issues.
New Features:
Cloudera Distribution of Apache Kafka 1.4 is now distributed via native packages as well as a parcel
Notable Fixes:
KAFKA-2633: Default logging from tools to Stderr.
KAFKA-1664: Kafka does not properly parse multiple ZK nodes with non-root chroot.
KAFKA-2477: Fix a race condition between log append and fetch that causes OffsetOutOfRangeException.
KAFKA-2024: Cleaner can generate unindexable log segments.
KAFKA-2118: Cleaner can not clean after shutdown during replaceSegments.
We look forward to you trying it out, using the information below:
Install or upgrade Kafka
Review the Documentation
Review the Release Notes
As always, we welcome your feedback. Please send your comments and suggestions through our community forums.
... View more