Member since
02-19-2018
99
Posts
29
Kudos Received
32
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1799 | 07-28-2020 07:46 AM | |
| 1657 | 07-28-2020 07:45 AM | |
| 3067 | 06-23-2020 11:15 PM | |
| 4138 | 06-23-2020 11:12 PM | |
| 2331 | 05-25-2020 02:41 AM |
02-21-2020
06:14 AM
Hi @jhcruvinel Cloudera has updated its licensing policy. Yes, we are an open-source company as per this commitment outlined here: Our Commitment to Open Source Software but if you review the Cloudera Licensing Policy FAQs you will see that moving forward you will need a subscription agreement with Cloudera to access the distribution of our software: "Starting September 2019, a subscription agreement will be required to access supported Cloudera products and maintenance releases." Here is the specific information that describes what is required (i.e. username and password as supplied by Cloudera) to download Apache Ambari 2.7.5.0: https://docs.cloudera.com/HDPDocuments/Ambari-2.7.5.0/bk_ambari-installation/content/ch03s02s01.html Regards, Steve
... View more
02-19-2020
12:38 AM
Hi @pky Yes, the migration of existing workloads on a CDH cluster to a CDP in the public cloud is supported. You can use the Cloudera Replication Manager to help you move data and metadata from CDH to CDP public cloud: "Replication Manager is a service for copying and migrating data between environments within the enterprise data cloud. It is a simple, easy-to-use, and feature-rich data movement capability to move existing data and metadata to the cloud to fuel new workloads." If you only want to move specific workloads from your on-premises CDH cluster to CDP public cloud you can use Cloudera Workload Manager to analyze those on-premises workloads and then build a replication policy to migrate to the cloud. To help you define a migration strategy from CDH to CDP my advice would be to reach out to your Cloudera account team. One of our Solution Engineers would be happy to give you some guidance on this topic. Regards, Steve
... View more
02-18-2020
11:06 AM
2 Kudos
Hi @MaxPeters Thanks for the interesting question on the cdswctl CLI. Currently, it is not possible to use the cdswctl CLI to create a connection to an existing session. The ability to do this is a feature request for a future version. Regards, Steve
... View more
02-17-2020
03:24 AM
1 Kudo
Hi @sharathkumar13 Cloudera Manager comes with an API. Here is a link to the Cloudera Manager API guide: http://cloudera.github.io/cm_api/docs/quick-start/ Does that help? Steve
... View more
02-14-2020
01:26 AM
1 Kudo
Hi @srik123 Apache Druid is not part of the CDH distribution and is not supported by Cloudera on the CDH platform. You can see the underlying Apache projects that are included and supported by Cloudera (in CDH) here: https://docs.cloudera.com/documentation/enterprise/6/release-notes/topics/rg_cdh_63_packaging.html#cdh_630_packaging In a similar way to the answer that was given to one of your other questions here you can install and configure third-party components on top of CDH - but they won't be supported by Cloudera. Here is an example of someone who has done this previously with CDH and Apache Druid: https://github.com/knoguchi/cm-druid Regards, Steve
... View more
02-11-2020
05:53 AM
Hi @CaptainJa Are you able to upgrade your environment to CDH 6? In CDH 6 Tomcat is replaced by Jetty. Please take a look at this post: https://blog.cloudera.com/third-party-libraries-in-c6/ Regards, Steve
... View more
02-05-2020
04:34 AM
Hi @ahmedalsaidi Apache Nifi does not have a generic CDC processor as such. One way you can achieve a CDC like approach is to use the QueryDatabaseTable processor. Please take a look at this article on how to use the QueryDatabaseTable processor to do an incremental fetch of new rows in the source database: https://community.cloudera.com/t5/Community-Articles/Incremental-Fetch-in-NiFi-with-QueryDatabaseTable/ta-p/247073 Alternatively, you use can use a vendor that specializes in CDC. Please accept this answer as a solution if it helps you. Steve
... View more
02-05-2020
02:32 AM
2 Kudos
Hi @carlaurrea The Cloudera Data Platform (CDP) has a number of form factors including a PaaS model in the cloud and a data center edition for on-premises deployments. I'm going to make the assumption that your HDP 2.6.5. deployment in on-premises. The Cloudera Data Platform Data Center edition (CDP-DC) is already available for download and installation: https://www.cloudera.com/downloads.html However, it is not possible right now to do an upgrade in place with this version - but we are planning to support this shortly with the release of CDP-DC 7.1. The plan is that when CDP-DC 7.1 is released you will be able to perform an upgrade in place from HDP 2.6.5. Regards, Steve
... View more
02-04-2020
04:11 AM
Hi @piosobc , The CDH pricing is under the 'Enterprise Data Hub' column on this page: https://www.cloudera.com/products/pricing.html So yes, it's the same as CDP. But as @cjervis mentioned in an earlier post it is best to discuss this with someone in our sales organization via Contact Us page on Cloudera.com. I think Cloudera Express will work > 100 nodes for CDH / CM 6.0 as this limit was introduced at 6.1. However, I don't recommend this as an approach as you would surely benefit from the enterprise-class features for more than 100 nodes. Also, as I mentioned in my earlier post, Cloudera is moving away from shipping Cloudera Express editions. Your best option is to consider a support version of CDH / CDP. Regards, Steve
... View more
02-03-2020
02:31 AM
2 Kudos
In part 1 of this series we talked about the growing relevance of streaming technologies and covered the need for existing Cloudera customers currently using Apache Flume, to consider moving over to Cloudera DataFlow (CDF).
Cloudera DataFlow is an umbrella term that covers the streaming technologies from Cloudera. CDF is supported on CDH 5 / CDH 6 and HDP 2 / HDP 3. So there is nothing stopping customers adopting Cloudera DataFlow right now so that they are in a supported configuration for when they upgrade to the new Cloudera Data Platform (CDP).
CDF includes the technology to address a number of areas:
Edge Flow Management
Core Flow Management
Stream Processing
Streaming Analytics
Enterprises Services
A good summary of these components can be found in the blog post Introducing Cloudera Data Flow.
Cloudera DataFlow - Data-In-Motion Platform
If you are a traditional Cloudera customer using the Cloudera Distribution of Apache Kafka, there are a number of new and exciting management technologies available via CDF. For example, the Cloudera Streams Management component includes:
Cloudera Stream Messaging Manager which provides a visual and interactive user interface for managing topics in Apache Kafka.
Cloudera Streams Replication Manager for managing replication between Kafka clusters based on MirrorMaker 2.
However, Apache Flume has been replaced in CDF by Apache Nifi and MiNiFi. There are a number of benefits of using Apache Nifi / MiNiFi over Apache Flume:
It is very simple to use with an intuitive user interface. This enhances user productivity with a drag and drop approach to designing data pipelines rather than having to develop lots of lines of code and configuration files.
There are 290+ pre-built processors for data source connectivity, ingestion, transformation, and content routing.
Nifi supports Nifi Registry for version controlling dataflows and also supports the software development lifecycle (SDLC) when it comes to promoting flows from one environment to another e.g. development to production.
Point-in-time capability - allowing you to go back to a previous point in time and inspect the data as it was at that point and replay it again downstream.
Scale-out architecture - adding more nodes increases the network and disk bandwidth for ingestion and transformation.
Data lineage and provenance are built-in features of Apache Nifi with graphical information and metrics that describe data on their journey from source to target.
Cloudera Edge Management (CEM) provides a management user interface for deploying and managing MiNiFI agents on edge devices.
Continuous data delivery, streaming applications and real-time analysis are becoming increasingly important and more widely adopted as part of a data architecture strategy. However, so is the need to adhere and comply with data regulation and protection laws such as GDPR in the EU and CCPA in California. This is why technologies such as Apache Nifi with graphical data pipelines and built-in support for data lineage and provenance provide a strong framework to work towards meeting regulatory compliance requirements.
One of the reasons that customers adopt Cloudera technology is because of the portfolio of technology that we offer all under a governed, secure and integrated data and analytics platform. This means that we can integrate and build differing streaming applications to address a variety of use cases. For example, Cloudera supports Apache HBase and Apache Kudu to use as the backend storage for real-time applications. In addition, Cloudera Machine Learning means that we can build predictive models and manage and deploy them into streaming applications. This is why we describe Cloudera as an end-to-end Edge2AI platform.
... View more
- « Previous
- Next »