Press "Enter" to skip to content

Mini Datalake – 2 step mini data-lake (silver, bronze, -gold)

A now infamous pattern to build data pipelines is to split them into several stages. Depending on the quality of data, data engineer can decide how to classify the data to give data analyst a convenient catalogue to use. The classification is also helpful to provide fine grained control over the data, as well as meeting the compliance and government rules.

In this post we are going to strip down version of a datalake we call mini-lake. This provides an apparatus for those who wants to learn about datalakes and data-pipelines.

Be First to Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

en_USEnglish