loader
AA_Collage_02 (2)

Blogs

Thanks for reading. Here you will find a huge range of information in text, audio and video on topics such as Data Science, Data Engineering, Machine Learning Engineering, DataOps and much more. The show notes for “Data Science in Production” are also collated here.

pattern
Vector (44)
border-traingle

Recent Blogs

Design an Azure Data Platform that InfoSec will love - Azure Storage
This is the third in a series where I look at all of the resources common to a Data Lakehouse...
Design an Azure Data Platform that InfoSec will love – Azure...
In the first post of this series, I set out the foundation of our secure Data Lakehouse. It...
Design an Azure Data Platform that InfoSec will love
Reference architectures are great! You've got all of the key components in there, nice and...
Writing a Single JSON File in Databricks
When writing to a JSON destination using the DataFrameWriter the dataset is split into...
Azure Synapse Serverless Lake Access Patterns
I've been working with a client to build a large Data Lakehouse platform predominantly using...
Why Data Quality is Important
Data is among the most valuable assets for any organisation. Without data, the ability to make...
Using GPT-3 Model to Generate Romantic Comedy Movie Summaries
Valentines Day is approaching and what better way to celebrate than to play around with AI...
Querying Movie Data on Wikipedia using DBpedia and SPARQL
Getting data from Wikipedia is no easy feat. At first glance, it seems like the easiest way...
Advancing Analytics at SQLBits 2022
SQLBits 2022 is just around the corner, taking place between March 8th and 12th at the ExCel...
Recommender Systems — Using Snorkel for Relevance Labelling
Introduction to Snorkel Created in Stanford in 2016, Snorkel is a system for programmatically...
Vector (45)
Vector (46) dise