In this series, I will showcase how to harness the true power of Cloudera Data Platform (CDP) Hybrid cloud capabilities. Throughout the series you will learn how to use CDP Private Cloud Base, Replication Manager, CDP Public Cloud, Nifi, Kafka on data hub, Cloudera Data Warehouse, and Cloudera Viz.
Reminder: CDP Vision
CDP is designed to seamlessly enable you to deploy any data workloads (data collection, streaming, enrichment, engineering, serving, and AI/ML), on any infrastructure, with the latest engines while maintaining a coherent layer of security and governance (SDX).
Case Study: Worldwide Bank
For the purpose of this article, I will use an example of a fake bank (Worldwide Bank).
Worldwide Bank is a large international bank that leverages a traditional big data architecture on-premises (CDP PvC Base) for data engineering and data warehousing over petabytes of data.
With COVID-19 taking the world through unprecedented times, competition is at its highest, accelerating its data organization through their adoption of the latest technologies and architectures, especially cloud infrastructures.
Their first use case on this new technology platform is to create a visual report assessing the risk of every one of its branches as the virus spreads.
The implementation of this first use case has the following critical considerations:
Speed of implementation/cloud adoption
Maintenance of data privacy/security standards
Re-use of current team skillset (i.e. portability)
Implementation Architecture
After carefully considering options, the bank selected CDP as their hybrid architecture as it satisfies all their needs. Specifically, here is their implementation design:
This article series will guide you through these four steps: