Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Data Lake Architecture

avatar
Rising Star

Hi all,

Can anyone advise me on how to organize data in my data lake? For instance, split data into categories, like Archived Data, that probably won't be used but it's needed, another division for raw data, and the last one for transformed data.

I'm using Hbase and Hive for now.

Thanks

1 ACCEPTED SOLUTION

avatar
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login
3 REPLIES 3

avatar

@Francisco Pires

Typically, for data warehousing, we recommend logically organizing your data into tiers for processing.

14051-screen-shot-2017-03-26-at-123125-pm.png

The physical organization is a little different for everyone, but here is an example for Hive:

14052-physical.jpg

avatar
Rising Star

thanks, this will help.

avatar
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login