Member since
02-19-2018
99
Posts
29
Kudos Received
32
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1109 | 07-28-2020 07:46 AM | |
1014 | 07-28-2020 07:45 AM | |
1960 | 06-23-2020 11:15 PM | |
3064 | 06-23-2020 11:12 PM | |
1426 | 05-25-2020 02:41 AM |
04-19-2020
11:25 PM
Hi @denys_tyshetsky , I don't understand your question - you want to trial CDP in AWS public cloud environment but the fact that the CDP Management Console is in the public cloud is an issue? Regards, Steve
... View more
04-19-2020
11:22 PM
Hi @muslihuddin , The cluster templates are only available in the CDP public cloud form factor at the moment. So for CDP-DC you can install Nifi using a parcel / csd as you say. It's pretty easy to do. Regards, Steve
... View more
04-19-2020
11:19 PM
Hi @ebeb , Thanks for your question. There are a number of ways to get to CDP. If the public cloud is an option for you then I strongly recommend you explore doing that because there are so many advantages of this approach. If you are staying on-premises then you can either built a new CDP-DC cluster and move data to the new environment and migrate content to that environment using the inbuilt tools. Or, you can do an upgrade in place of your existing CDH cluster. To do the upgrade in place CDH needs to be at 5.13 or above. The upgrade in place approached will be available when we release CDP-DC 7.1. Regards, Steve
... View more
04-17-2020
05:15 AM
Hi @muslihuddin , Currently, the CDP Management Console is only available in the public cloud. However, we are also planning to launch a CDP Private Cloud edition which would run on-premises including the Management Console. If you are looking to run CDP on-premises today, you can do that with the CDP Data Center (DC) edition. CDP-DC is managed by Cloudera Manager and does not use the Management Console. I hope that helps. Steve
... View more
04-16-2020
01:20 PM
Hi @BaluSaiD , Cloudera Manager does not support SQL Server as a database for its backend services. The databases that are supported are listed here: https://docs.cloudera.com/documentation/enterprise/6/release-notes/topics/rg_database_requirements.html#cdh_cm_supported_db Regards, Steve
... View more
04-16-2020
10:02 AM
Hi @Ashik , A good rule of thumb for the amount of HDFS storage required is 4 x the raw data volume. HDFS triple replicates data and then we need some headroom in the system which is why it is 4 x rather than 3 x . This formula is just a rough guide and can change for example if you compress the data on HDFS. You need to factor other data processing that you might do into this calculation. For example, if you built data marts on top of the raw data - that is additional data volume and then you have organic data growth over a period of time. Regarding cluster topology, there are some guidelines here: https://docs.cloudera.com/documentation/enterprise/5/latest/topics/cm_ig_host_allocations.html Regarding best practice for cluster sizing, those are here: https://docs.cloudera.com/documentation/other/reference-architecture/topics/ra_bare_metal_deployment.html#concept_lzl_vkl_f2b Regarding hardware recommendations, those are given here: https://docs.cloudera.com/documentation/enterprise/release-notes/topics/hardware_requirements_guide.html I would recommend that you have: 3 x Master Nodes (for high availability) N x Data Nodes (where N is number based on the storage capacity of the data nodes). You need a minimum of N=3 for triple replication of data and I would recommend N >= 5 for a production system. The more data nodes that you have and the more disks there are in each of those data nodes the higher the performance of your system will be because of the distributed throughput and higher disk I/O. 1 x Utility Node / Management Node 1 x Gateway Node Regards, Steve
... View more
04-16-2020
09:45 AM
In your example you cannot just give me a Cloudera license because I would not have a commercial relationship with Cloudera - I wouldn't be able to raise support tickets for example because Cloudera would have no record of me. You need to speak to your Cloudera sales account team to discuss your situation. Steve
... View more
04-16-2020
04:13 AM
1 Kudo
Hi @Cloudsupport , Thanks for clarifying the situation. If you are happy to message me privately via the Cloud Community messaging capability and tell me who company X and company Y are, I will see if I can get someone to help you. Regards, Steve
... View more
04-16-2020
02:20 AM
Hi @Cloudsupport , I don't completely follow the scenario that you described - are you saying the support department is handing over to another department? What do you mean by 'organization' - another company? That said, to discuss the Cloudera license and commercial arrangements you should reach out to the Cloudera account team for your organization. Regards, Steve
... View more
04-16-2020
12:42 AM
Hi @DataMike , I had a chat with some of my colleagues about this and it seems there is no easy way of stopping the HDFS starting when the cluster restarts. You might be able to do something via the Cloudera Manager API - but that probably quite complicated. If it's any consolidation, this is fixed in the next generation of technology from Cloudera i.e. CDP. When you upgrade to CDP Sentry is replaced with Ranger and this HDFS dependency for Kafka no longer exists. Regards, Steve
... View more