loader
AA_Collage_02 (2)

Blogs

Welcome to the Advancing Analytics blog. We’re a bunch of data and AI specialists who build modern platforms and make messy data behave.

You'll find a trove of posts and video here, focused on AI and machine learning, data engineering and analytics. Our blogs are all based on what we’re seeing in the real world - with lessons learned from the sharp end of delivery. 

pattern
Vector (44)
border-traingle

Recent Blogs

Design an Azure Data Platform that InfoSec will love – Azure...
In the first post of this series, I set out the foundation of our secure Data Lakehouse. It...
Design an Azure Data Platform that InfoSec will love
Reference architectures are great! You've got all of the key components in there, nice and...
Writing a Single JSON File in Databricks
When writing to a JSON destination using the DataFrameWriter the dataset is split into...
Azure Synapse Serverless Lake Access Patterns
I've been working with a client to build a large Data Lakehouse platform predominantly using...
Why Data Quality is Important
Data is among the most valuable assets for any organisation. Without data, the ability to make...
Using GPT-3 Model to Generate Romantic Comedy Movie Summaries
Valentines Day is approaching and what better way to celebrate than to play around with AI...
Querying Movie Data on Wikipedia using DBpedia and SPARQL
Getting data from Wikipedia is no easy feat. At first glance, it seems like the easiest way...
Advancing Analytics at SQLBits 2022
SQLBits 2022 is just around the corner, taking place between March 8th and 12th at the ExCel...
Recommender Systems — Using Snorkel for Relevance Labelling
Introduction to Snorkel Created in Stanford in 2016, Snorkel is a system for programmatically...
On-Premise Self Hosted Integration Runtime for Azure Data Factory:...
Introduction In this post I will discuss the solution to a problem that plagued me on a client...
Vector (45)
Vector (46) dise