Advancing Analytics
Data Science | AI | DataOps | Engineering


Data Science & Data Engineering blogs


Thanks for reading. Here you will find a huge range of information in text, audio and video on topics such as Data Science, Data Engineering, Machine Learning Engineering, DataOps and much more. The show notes for “Data Science in Production” are also collated here.

How do You Size Your Azure Databricks Clusters?

Every day this month we will be releasing a new video on Azure Databricks.

We have done the introductions, now we are getting in to the meat. Today we are tackling "How do You Size Your Azure Databricks Clusters?”.

In this video Simon takes you through how to size a cluster. We look at what happens when you take 3GB of data and cache it on a 2 node cluster. How do you see the distribution of data? How many partitions are there on each node?. How do we achieve workload isolation? All these questions are answered.

If you watch the video on YouTube, remember to Like and Subscribe, so you never miss a video.

Getting started with Azure Databricks is difficult and can be expensive. Making the wrong decisions early has a huge detrimental impact on the success of your project. We work with businesses to sure their analytics project succeed and we would love to help accelerate your project in to production. Click the link below to get in touch.


Advancing Analytics have developed a course based on the needs of our customers. It is designed to take a data professional from Zero to Hero in just 3 days. You will leave this course with all the skills you need to get started on your Big Data Journey.

Take a look at our Azure Databricks training for more details.