loader
AA_Collage_02 (2)

engineering

Thanks for reading. Here you will find a huge range of information in text, audio and video on topics such as Data Science, Data Engineering, Machine Learning Engineering, DataOps and much more. The show notes for “Data Science in Production” are also collated here.

pattern
Vector (44)
border-traingle

Recent Blogs

Fake It Till You Make It: Generate Fake Data Quickly with Faker
Why Fake Data is Useful for Data Engineers One thing that is sometimes really difficult as a...
Liquid Clustering 101 - How you should be storing & optimising your...
Liquid Clustering (LC) was designed to replace table partitioning and the ZORDER command, to...
Up your CI/CD game with Databricks Asset Bundles and Automated Testing
Why should I care about automated testing? Automated testing is one of the most impactful...
When is an Aggregate not an Aggregate?
I am irked. As a father of two high school aged children, this is not uncommon, but for once...
Using Databricks SQL in VSCode
Recently, I had the opportunity to explore the Databricks SQL extension for VSCode, and I was...
Introduction to Databricks SQL
Databricks SQL is a powerful tool for querying and analysing data in Databricks Lakehouse. It...
Has the Data Engineer replaced the Business Intelligence Developer?
It seems these days that every person I talk to is either a scientist, engineer or architect,...
Tips for the Databricks Certified Associate Developer for Apache...
After diving in to (and passing!) the Associate Developer for Apache Spark 3.0 exam...
Tips for the Databricks Certified Associate Developer for Apache...
Following on from my previous post I wanted to cover off some more key topics that can really...
Writing a Single JSON File in Databricks
When writing to a JSON destination using the DataFrameWriter the dataset is split into...
Vector (45)
Vector (46) dise