I have a background in traditional data warehousing (SQL, 3NF Inmon, Star schema Kimball). I used to design data models then create it in a SQL RDMS and then populate it with ETL data flows. Paying a particular attention to slowly changing dimensions.
With Cloudera, how to build the equivalent of a data warehouse? and which feeding treatments? What are the components to use?
Thanks for being a part of the Cloudera community,
You can leverage the above options provided by Cloudera for the Public and Private clouds.
Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.
Thanks for the reply. It looks to me interesting but does not tell about best practices of architecture and data modeling. For example, it would be great to add a diagram in your documentation illustrating architecture: components used from data ingestion to data consumption with related technologies.
If you'd like more details about data model design and implementing them using Cloudera Data Warehouse over and above the documentation @shehbazk provided links to, I'd recommend you watch the recording of a webinar Cloudera held on the topic quite some time ago, Data Modeling for Hadoop and the subsequent blog post Common Questions on Data Modeling for Big Data.
@Christ Thanks for your feedback on the documentation, I will highlight this to our document team. About your query, we need to open a support ticket because it requires the involvement of our product team.
Please let us know if you have any other queries.
@Christ, Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.