Member since
06-26-2013
354
Posts
68
Kudos Received
27
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 4408 | 08-05-2016 10:36 AM | |
| 7137 | 06-02-2016 04:57 PM | |
| 7569 | 05-31-2016 03:47 PM | |
| 6400 | 04-11-2016 11:26 AM | |
| 12057 | 03-07-2016 02:04 PM |
12-04-2014
03:51 PM
Dear CDH, Cloudera Manager, Impala and Cloudera Navigator users,
We are pleased to announce the release of Cloudera Enterprise 5.1.4, Cloudera Enterprise 5.0.5, Cloudera Navigator 2.0.3, CDH 4.7.1, Cloudera Manager 4.8.5 and Impala 1.4.3 and 2.0.1 for CDH 4.
The focus of these releases is to fix the POODLE vulnerability in SSL that was discovered in September.
In addition, Cloudera Enterprise 5.1.4 contains some critical bug fixes including:
Fix for duplicate actions when using the CRON syntax in Apache Oozie
Kerberos ticket renewal fixes in Apache Sentry
Fix for Spark to work with YARN HA
Fixes in Hive to handle white spaces, delimiters, escape sequences, and delegation token cancellation
Cloudera Navigator 2.0.3 includes
Fix an issue with MR and HDFS extractions where solr query OR clause has too many boolean clauses.
We encourage you to try it out using the information below:
Download Cloudera Enterprise from: http://www.cloudera.com/ content/support/en/downloads. html
View the documentation:
CDH 4 Release Notes
Cloudera Manager 4 Release notes
CDH 5 Release Notes
Cloudera Manager 5 Release Notes
Cloudera Navigator Release Notes
Cloudera Documentation
As always, we are happy to hear your feedback. Please send your comments and suggestions to the user group or through our community forums. You can also file bugs through our external jira projects on issues.cloudera.org.
... View more
12-02-2014
03:20 PM
Dear CDH, Cloudera Manager, Impala and Cloudera Navigator users,
We are pleased to announce the release of Cloudera Enterprise 5.2.1 (CDH 5.2.1, Cloudera Manager 5.2.1, and Cloudera Navigator 2.1.1)
This release is focused on fixing key bugs and includes the following:
CDH Fixes
Oozie: Using cron-like syntax for Coordinator frequencies could result in duplicate actions in certain cases; this is now fixed (OOZIE-2063)
YARN: Handle app-recovery failures gracefully (YARN-2010)
Impala: Memory leak with string functions (IMPALA-1397)
Impala: IllegalStateException when inserting results of a window function (IMPALA-1400)
Impala: Read errors with Parquet files (IMPALA-1401)
Impala: Regex functions don’t accept shorthand such as \d (IMPALA-1410)
Impala: Queries fail with metastore exception after upgrade and compute stats (IMPALA-1416)
Impala: Crashes due to bug in ClientCacheHelper (IMPALA-1445)
Cloudera Manager
Fixed metric collection for CDH 5.0 HDFS daemons.
Fixed OutOfMemory crashes on Thrift servers in Reports Manager and Event Server.
Replication commands respects JAVA_HOME if an override has been provided for it.
Fixed ZooKeeper connection leaks from HBase clients used by the Service Monitor.
For parcel-based installations, user home directories are created with umask 022 (instead of the user add default of 077)
A new health check has been added to indicate if HDFS rolling upgrade has not been finalized.
Cloudera Navigator
LDAP lookups in Active Directory to resolve group membership are now working.
Dropping a hive table and creating a view with same name or vice versa no longer raises an error.
HDFS extraction now works after upgrading CDH from 5.1 to 5.2
Setting a property in the Hue advanced configuration snippet no longer throws a "too many Boolean clauses" error in Navigator Metadata
We look forward to you trying it out using the information below:
Download Cloudera Enterprise from: http://www.cloudera.com/ content/support/en/downloads. html
View the documentation:
CDH 5 Release Notes
Cloudera Manager Release Notes
Cloudera Navigator Release Notes
Cloudera Documentation
As always, we are happy to hear your feedback. Please send your comments and suggestions to the user group or through our community forums. You can also file bugs through our external jira projects on issues.cloudera.org.
... View more
10-29-2014
10:04 AM
You can, yes. I encourage you to ask detailed questions in the HBase area. You could also evaluate Apache Phoenix as another SQL-over-HBase option (not currently supported by Cloudera though).
... View more
10-24-2014
09:43 AM
Regarding Impala vs Presto: As of 2.0, Impala supports a wide range of SQL operations (docs). So I would not automatically make that assumption.
And no, Impala does not query Cassandra -- which leads to the question, what is it about your use case that implies Cassandra over HBase, which offers very similar capabilities but also things like strong consistency, coprocessors, and the availability of a nice GUI (Hue), if you're into that?
... View more
10-23-2014
01:36 PM
Have you explored either Impala or Apache HBase for this use case?
... View more
10-14-2014
01:20 PM
We're pleased to announce the release of Cloudera Enterprise 5.2 (comprising CDH 5.2, Cloudera Manager 5.2, Cloudera Director 1.0, and Cloudera Navigator 2.1).
This release reflects our continuing investments in Cloudera Enterprise's main focus areas, including security, integration with the partner ecosystem, and support for the latest innovations in the open source platform (including Impala 2.0, its most significant release yet, and Apache Hive 0.13.1). It also includes a new product, Cloudera Director, that streamlines deployment and management of enterprise-grade Hadoop clusters in cloud environments; new component releases for building real-time applications; and new support for significant partner technologies like EMC Isilon. Furthermore, this release ships the first results of joint engineering with Intel, including WITH GRANT OPTION for Hive and Impala and performance optimizations for MapReduce.
Here are some of the highlights (incomplete; see the respective Release Notes for CDH, Cloudera Manager, and Cloudera Navigator for full lists of features and fixes):
Security
Via Apache Sentry (incubating) 1.4, GRANT and REVOKE statements in Impala and Hive can now include WITH GRANT OPTION, for delegation of granting and revoking privileges (joint work with Intel under Project Rhino).
Hue has a new Sentry UI that supports policy management for visually creating/editing roles in Sentry and permissions on Files in HDFS .
Kerberos authentication is now supported in Apache Accumulo.
Impala, authentication can now be done through a combination of Kerberos and LDAP.
Data Management and Governance
Cloudera Navigator 2.1 features a brand new auditing UI that is unified with lineage and discovery, so you now have access to all Navigator functionality from a single interface.
Navigator 2.1 includes role-based access control so you can restrict access to auditing, metadata and policy management capabilities
We’re also shipping a beta policy engine in Navigator 2.1. Targeted to GA by year-end, the policy engine allows you to set up rules and notifications so you can classify data as it arrives and integrate with data preparation and profiling tools. Try it out and let us know what you think!
And we’ve added lots of top-requested enhancements, such as Sentry auditing for Impala and integration with Hue.
Cloud Deployment
Cloudera Director is a simple and reliable way to deploy, scale, and manage Hadoop in the cloud (initially for AWS) in an enterprise-grade fashion. It’s free to download and use, and supported by default for Cloudera Enterprise customers. Features include:
Simple UI for self-service cluster spin up/teardown
Dynamic scaling for spiky workloads
Simple cloning of clusters
Cloud blueprints for repeatable deployments
Third-party software deployment within same workflow
Support for custom, workload-specific deployments
Support for complex cluster topologies
Minimum size cluster when capacity constrained
Multi-cluster dashboard
Instance tracking for account billing
Real-Time Architecture
Rebase on Apache HBase 0.98.6
Cell-level ACLs for fine-grained access control of data in HBase now supported
Backported improvements to get and put request scheduling and throttling that provide basic QoS for multi-tenant HBase tables and clusters. Lets some production and real-time workloads take priority over ad hoc and analytic jobs.
Backported patches that make Offheap Block Cache (aka bucket cache) production-ready. Now you can use large amounts of memory for read caching without the GC penalties of the past. Bucket cache is now the default.
Backported authentication of clients accessing HBase via the HBase Thrift Proxy.
Rebase on Apache Spark/Streaming 1.1
Rebase on Impala 2.0
Cloudera Search
now provides Spark-indexing - iterative, fast index design
distributed pivot facets
ability to expire documents
node fail recovery
support for deep paging and for multithreaded faceting
Apache Sqoop now supports import into Apache Parquet (incubating) file format
Apache Kafka integration with CDH is now incubating in Cloudera Labs; a Kafka-Cloudera Labs parcel (unsupported) is available for installation. Integration with Flume via special Source and Sink have been provided.
Impala 2.0
Disk-based query processing: enables large queries to "spill to disk" if their in-memory structures are larger than the currently available memory. (Note that this feature only uses disk for the portion that doesn't fit in the available memory.)
Greater SQL compatibility: SQL 2003 analytic (window) functions, support for legacy data types (such as CHAR and VARCHAR), better compliance with SQL standards (WHERE, EXISTS, IN), and additional vendor-specific SQL extensions.
Impala 2.0 is now also available for CDH 4.
New Open Source Releases and Certifications
Cloudera Enterprise 5.2 includes multiple new component releases:
Apache Avro 1.7.6
Apache Crunch 0.11
Apache Hadoop 2.5
Apache HBase 0.98.6
Apache Hive 0.13.1
Apache Parquet (incubating) 1.5 / Parquet-format 2.1.0
Apache Sentry (incubating) 1.4
Apache Spark 1.1
Apache Sqoop 1.4.5
Impala 2.0
Kite SDK 0.15.0
...with new certifications on:
Filesystems: EMC Isilon
OSs: Ubuntu 14.04 (Trusty)
Java: Oracle JDK1.7.0_67
Over the next few weeks, we’ll publish blog posts that cover some of these and other new features in detail. In the meantime:
Download Cloudera Enterprise 5.2
Explore documentation
As always, we value your feedback; please provide any comments and suggestions through our community forums. You can also file bugs via issues.cloudera.org.
... View more
10-10-2014
08:52 AM
We're pleased to announce the release of Kite SDK 0.17.0. This release updates the examples to CDH 5, defaults Parquet to the non-durable mode from 0.14 and prior, adds support for namespaces, and adds a kite-minicluster for easier development and integration testing against single-node Hadoop deployments. For more details see the release notes and the documentation.
... View more
09-25-2014
01:19 PM
Hello CDH and Impala Users,
We are pleased to announce the release of version 2.5.12 of the ODBC Driver for Apache Hive and version 2.5.20 of the ODBC driver for Impala. These versions contain bug fixes including one that affected the decimal data type. These drivers work for previous versions of HiveServer2 in CDH 4.1 or higher and Impala 1.0 or higher.
Getting started with the Cloudera ODBC Drivers:
Read the Cloudera ODBC 2.5 Driver for Impala release notes and installation guide
Read the Cloudera ODBC 2.5 Driver for Apache Hive release notes and installation guide
Download the connector from the Cloudera Connectors page
As always, we are happy to hear your feedback. Please send your comments and suggestions to cdh-user@cloudera.org or post to our new Community Forums.
Kind regards,
The Cloudera Team
... View more
09-23-2014
02:45 PM
Dear CDH and Cloudera Manager users,
We are pleased to announce the release of Cloudera Enterprise 5.1.3.
Cloudera Enterprise 5.1.3
This release is focused on fixing key bugs and includes the following.
CDH Fixes
HADOOP-11035 - distcp on mr1(branch-1) fails with NPE using a short relative source path.
HBASE-11349 - [Thrift] support authentication/impersonation
HBASE-11446 - Reduce the frequency of RNG calls in SecureWALCellCodec#EncryptedKvEncoder
HBASE-11457 - Increment HFile block encoding IVs accounting for ciper's internal use
HBASE-11474 - [Thrift2] support authentication/impersonation
HBASE-11565 - Stale connection could stay for a while
HBASE-11627 - RegionSplitter's rollingSplit terminated with "/ by zero", and the _balancedSplit file was not deleted properly
HBASE-11788 - hbase is not deleting the cell when a Put with a KeyValue, KeyValue.Type.Delete is submitted
HBASE-11828 - callers of SeverName.valueOf should use equals and not ==
HDFS-4257 - The ReplaceDatanodeOnFailure policies could have a forgiving option
HDFS-6776 - Using distcp to copy data between insecure and secure cluster via webdhfs doesn't work
HDFS-6908 - incorrect snapshot directory diff generated by snapshot deletion
HUE-2247 - [Impala] Support pass-through LDAP authentication
HUE-2273 - [desktop] Blacklisting apps with existing document will break home page
HUE-2295 - [librdbms] External oracle DB connection is broken due to a typo
HUE-2318 - [desktop] Documents shared with write group permissions are not editable
HIVE-5087 - Rename npath UDF to matchpath
HIVE-6820 - HiveServer(2) ignores HIVE_OPTS
HIVE-7635 - Query having same aggregate functions but different case throws IndexOutOfBoundsException
IMPALA-958 - Excessively long query plan serialization time in FE when querying huge tables
IMPALA-1091 - Improve TScanRangeLocation struct and associated code
OOZIE-1989 - NPE during a rerun with forks
YARN-1458 - FairScheduler: Zero weight can lead to livelock
Cloudera Manager
Adding and upgrading hosts allows users to skip installing default JDK that ships with Cloudera Manager.
Improved speed and heap usage when deleting hosts on cluster with long history.
When there are multiple clusters, each cluster's topology files and validation for legal topology is limited to hosts in that cluster. Most commands will now fail up front if the cluster's topology is invalid.
For users using Oracle databases, the size of the statement cache has been reduced, to help with memory consumption.
Improvements to memory usage of "cluster diagnostics collection" for large clusters.
Cloudera Navigator
HBase auditing initialization failure can prevent region opening indefinitely.
We look forward to you trying it out using the information below:
Download Cloudera Enterprise from: http://www.cloudera.com/content/support/en/downloads.html
View the documentation:
CDH 5 Release Notes
CDH 5 Documentation
Cloudera Manager Release Notes
Cloudera Manager 5 Documentation
Cloudera Navigator Documentation
As always, we are happy to hear your feedback. Please send your comments and suggestions to the user group or through our community forums. You can also file bugs through our external jira projects on issues.cloudera.org.
... View more