loader
AA_Collage_02 (2)

engineering

Thanks for reading. Here you will find a huge range of information in text, audio and video on topics such as Data Science, Data Engineering, Machine Learning Engineering, DataOps and much more. The show notes for “Data Science in Production” are also collated here.

pattern
Vector (44)
border-traingle

Recent Blogs

Azure Synapse Serverless Lake Access Patterns
I've been working with a client to build a large Data Lakehouse platform predominantly using...
DevOps for Databricks: Getting Started
As a consultant and Data Engineer I’ve lost count of the number of times I’ve been asked “how...
Databricks Execution Plans
The execution plans in Databricks allows you to understand how code will actually get executed...
Databricks Delta Cache and Spark Cache
As data sizes and demand increases as time goes on, you often see slowness on Databricks this...
Slowly Changing Dimensions (SCD Type 1) with Delta and Databricks
From Warehouse to Lakehouse Pt.1 SCD Type 1 in SQL and Python Introduction With the move to...
Terraform Databricks Labs
In late 2020, Databricks introduced Databricks Labs a collection of Terraform Providers that...
An Introduction to Serverless Computing and Function-as-a-Service
What is Serverless Computing? The term serverless computing can be misleading; how can you...
Azure Data Lake ACL Introduction
Introduction to ACL’s Access Control Lists (ACLs) offer low-level control of access to the...
Azure Synapse Analytics - Microsoft's Flagship Lakehouse Now in...
Today’s the day! There’s much buzz & excitement as we FINALLY get to see Azure Synapse...
Scala For Big Data Engineering – Why should you care?
The thought of learning Scala fills many with fear, its very name often causes feelings of...
Vector (45)
Vector (46) dise