Member since
06-26-2013
354
Posts
68
Kudos Received
27
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3840 | 08-05-2016 10:36 AM | |
6165 | 06-02-2016 04:57 PM | |
6353 | 05-31-2016 03:47 PM | |
5360 | 04-11-2016 11:26 AM | |
10980 | 03-07-2016 02:04 PM |
02-10-2014
05:34 PM
Cloudera has released the Beta 2 version of Cloudera Enterprise 5 (comprises CDH 5.0.0 and Cloudera Manager 5.0.0).
This release (download) contains a number of new features and component versions including the ones below:
New Components
Apache Spark 0.9
Apache Crunch 0.9
Parquet 1.2.5
Kite SDK 0.10
Apace Avro 1.7.5
Apache Hadoop
Rebase to Hadoop 2.2.0+
HDFS
HDFS Caching
NFS gateway
Cloudera Manager
Support for Apache Spark
Oozie and YARN Resource Manager High Availability
Extensibility to support add-on services
Enhancements to Monitoring & Charts
Apache HBase
Rebase to HBase 0.96.1.1
Impala
Rebase to Impala 1.2.3
Apache Flume
New Kite Dataset Sink
Apache Hive
Hive 0.12 Rebase
Improved JDBC spec coverage
SSL encryption support on non-kerberos authentication
Native Parquet support
Apache Sqoop
Rebase to Sqoop 1.99.3
Apache Oozie
Support for CRON like scheduling
Hue
Single Sign On (SSO) support
Graphical facets in Search application
Result graphing for Hive and Impala
Apache Pig
Rebase to Pig 0.12.0
Rebase to DataFu 1.1.0
Cloudera strongly recommends that you install Cloudera Enterprise 5.0 Beta 2 and CDH 5 Beta 2 on test clusters or new clusters only. In the case of existing clusters, Cloudera fully supports upgrade from Cloudera Enterprise 4.x and CDH 4.x to Cloudera Enterprise 5.0 Beta 2 and CDH 5 Beta 2. If you have a particular need to upgrade a cluster running Cloudera Enterprise 5.0 Beta 1 or CDH 5 Beta 1 to Beta 2, customers can contact Cloudera Support for further instructions. If you are not a customer, ask for assistance in the Cloudera Manager forum.
Please note that Cloudera Manager 5.0 beta 2 does not support CDH 5.0 beta 1.
As part of the open Beta, we encourage the community to try it out. Here is how you can get started:
Download Cloudera Enterprise from: cloudera.com/downloads install and try it out.
View the documentation:
CDH 5 Beta 2 Documentation
Cloudera Manager 5 Beta 2 Documentation
Impala Beta 2 Documentation
Search Beta 2 Documentation
Once you get started, we encourage you to provide feedback using any of the following methods:
Ask questions and provide comments on our Beta community forum. Click here to join.
File a bug through our public Jira at:
https://issues.cloudera.org/browse/DISTRO
https://issues.cloudera.org/browse/CM
We look forward to hearing about your experiences of Cloudera Enterprise 5 Beta 2.
... View more
02-07-2014
08:15 AM
We are pleased to announce the release of Kite 0.11.0. The main new feature in this release is the views API for working with a subset of a dataset using logical constraints such as field matching or ranges. There are also assorted Morphlines updates, and several bug fixes and minor improvements. For more information, see the release notes, and the documentation.
... View more
02-03-2014
03:42 PM
Dear CDH and Cloudera Manager users,
We are pleased to announce the general availability of Cloudera Manager 4.8.1.
Cloudera Manager 4.8.1 is a maintenance release and has a number of bug-fixes and usability improvements.
For more details, see the release notes: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Cloudera Manager4Ent/latest/Cloudera-Manager-Release-Notes/Cloudera-Manager-Release-Notes.html
All of the Cloudera Manager documentation can be found here:
http://www.cloudera.com/content/support/en/documentation/manager/cloudera-manager-v4-latest.html
You can download this latest Cloudera Manager here or you can upgrade from an existing installation of Cloudera Manager.
Installation and upgrade documentation can be found here.
... View more
02-03-2014
02:25 PM
Cloudera is pleased to announce the immediate availability of its first release of Apache Spark for Cloudera Enterprise (comprising CDH and Cloudera Manager).
Spark was created and contributed to the Apache Software Foundation by UC Berkeley, and it has quickly gained adoption for machine learning, interactive analytics, and streaming analytics over large datasets. It features a general programming model for writing applications by composing arbitrary operators, such as mappers, reducers, joins, group-bys, and filters. Spark keeps track of the data that each of the operators produces, enabling applications to reliably store this data in memory, which makes it ideal for low-latency computations and efficient iterative algorithms. Spark applications can be up to 100x faster and require writing 2x to 10x less code than equivalent MapReduce applications.
Cloudera provides enterprise support for Spark through Cloudera Enterprise Flex Edition (as an optional component) and Data Hub Edition (as an included component) subscriptions. This release provides Spark 0.9.0 tested for use with Spark Standalone Mode on CDH 4, from 4.4.0 forward. Expect releases for Cloudera Enterprise 5 (comprising CDH 5 and Cloudera Manager 5) and Spark on YARN in the near future.
To get started now, you can follow these instructions to install Spark using parcels with Cloudera Manager. The instructions will also walk you through the basic configuration, and a simple WordCount example on Spark.
Once you get going, we would love to hear your feedback:
You can ask questions, get help, and share your growing expertise on our community forum for questions about Spark.
You can file a bug through our public Jira instances.
For issues with Spark please use https://issues.cloudera.org/browse/DISTRO.
For issues with the beta integration with Cloudera Manager, please usehttps://issues.cloudera.org/browse/CM.
... View more
01-18-2014
07:01 AM
Apologies - it's difficult after the fact to say why this problem may have occured. I presume it's not related to the previously reported issue?
Have you checked your profile page for an auto-saved draft?
... View more
12-23-2013
09:54 AM
Dear Cloudera Impala users,
We would like to announce the general availability of Cloudera Impala 1.2.3. This release fixes a bug where Impala couldn’t read Parquet files written by MapReduce.
Note that Impala 1.2.3 does not work with Cloudera Manager 4.7 or earlier. If your environment is managed by Cloudera Manager, you should upgrade to Impala 1.2.3 and Cloudera Manager 4.8 together.
For more details on new Impala features, please see “New Features in Impala”: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Cloudera-Impala-Release-Notes/cirn_new_features.html
“Installing and Using Cloudera Impala” guide: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/Installing-and-Using-Impala.html
As always, we are happy to hear your feedback. Please send your comments and suggestions to impalala-user@cloudera.org or through our new community forums. You can also file bugs in the Impala project at issues.cloudera.org
... View more
12-18-2013
10:59 AM
1 Kudo
Hey Chuck, You are correct that some Hive users will prefer to take advantage of Impala (or Shark); my point only is that those solutions were not designed to displace Hive. CDH 5 (currently in beta) will ship with Hive 0.12, which contains all Stinger code that has gone upstream.
... View more
12-17-2013
02:19 PM
Dear CDH, Cloudera Manager, and Cloudera Impala users,
We are pleased to announce the general availability of Cloudera Impala 1.2.2 and version 2.5.13 of the Cloudera ODBC Driver for Cloudera Impala.
Impala 1.2.2 provides key enhancements including a number of the most requested features:
Cost-based join order optimizations
Compute statistics for tables and columns directly from Impala
Cross join
Preliminary support for clients to securely and more easily authenticate with their username and password via LDAP or Active Directory
Cloudera ODBC Driver for Impala version 2.5.13 includes:
Ability to authenticate with a username and password
Ability to secure client traffic via encrypted SSL
Ability for multi-tiered applications to proxy via delegation IDs
Note that Impala 1.2.2 does not work with Cloudera Manager 4.7 or earlier. If your environment is managed by Cloudera Manager, you should upgrade to Impala 1.2.2 and Cloudera Manager 4.8 together.
For more details on new Impala features, please see “New Features in Impala”: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Cloudera-Impala-Release-Notes/cirn_new_features.html
“Installing and Using Cloudera Impala” guide: http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/Installing-and-Using-Impala.html
As always, we are happy to hear your feedback. Please send your comments and suggestions to impala-user@cloudera.org or through our new community forums . You can also file bugs in the Impala project at issues.cloudera.org.
... View more
12-16-2013
05:12 PM
Hi Chuck,
My observations:
1. First, keep in mind that Impala and Hive have different use cases. Impala offers the low latency and high concurrency that analysts doing BI-style queries are going to expect. In contrast, Hive/MR is still more appropriate for batch-oriented processing.
2. Based on #1, it stands to reason that any and all improvements to Hive are good news insofar as they help users with those workloads. To that end, Cloudera employs Hive committers, actively contributes code to Hive (e.g., HiveServer 2), and provides complementary infrastructure (e.g., the incubating Apache Sentry project for RBAC, which is built for both Hive and Impala and which we hope is embraced by the entire ecosystem).
3. Shark (which is a Hive port actually, not an "improvement" to Hive) is another example of having the right tool for the right job. I think most would agree with the premise that Shark is generally used for complex analytics/iterative machine learning, not "mainstream" BI.
... View more
12-10-2013
10:23 AM
We are pleased to announce a new name for the CDK: Kite. We've just released Kite version 0.10.0, which is purely a rename of CDK 0.9.0. The new repository and documentation are:
Kite repository: https://github.com/kite-sdk/kite
Kite documentation: http://kitesdk.org/
Kite examples: https://github.com/kite-sdk/kite-examples
Why rename? The goal of Kite, and CDK, is to increase the accessibility of the platform. That isn't Cloudera-specific, and we want the name to correctly represent the project as an open, community-driven set of tools. What will this break? The rename mainly affects dependencies and package names. Once imports and dependencies are updated, almost everything should work the same. However, there are a couple of configuration changes to make for anyone using flume or morphlines. The changes are detailed on our migration page. Again, this 0.10.0 release is a rename only. There are no feature changes, and 0.9.0 will be supported as long as 0.10.0, so that there is plenty of time to make a smooth transition. For more information, see the release notes.
Cheers,
The Kite/CDK team
... View more