From the course: End-to-End Real-World Data Engineering Project with Databricks
Unlock this course with a free trial
Join today to access over 24,900 courses taught by industry experts.
Data lakehouse: High-level solution
From the course: End-to-End Real-World Data Engineering Project with Databricks
Data lakehouse: High-level solution
- [Instructor] In order to get into our project, we need to make sure that we have at least a high-level understanding of data lake house. A data lake house combines the best features of data lakes and data warehouses. It offers the flexibility and scalability of data lake. But side by side, it also offers the capability of data management and asset transactions of a data warehouse. So in short, you keep your data into the data lake, but you can do all the operations, which you can do in the data warehouses. That's make it very suitable for a variety of data loads and help you to do either as SQL way or an ML way. Our solution for global retail have a three layer architecture. The bronze layer that is for a raw ingestion. Silver layer for clean and conform data. And the gold layer for the business level aggregates. Let's take a little deep dive into each layer. The bronze layer is going to be our raw layer. Here, we are going to keep the data as it is as we received. We are not going…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.