loader
AA_Collage_02 (2) pattents-img pattents-img

advancing-spark

Thanks for reading. Here you will find a huge range of information in text, audio and video on topics such as Data Science, Data Engineering, Machine Learning Engineering, DataOps and much more. The show notes for “Data Science in Production” are also collated here.

pattern
Vector (44)
border-traingle

Recent Blogs

Liquid Clustering 101 - How you should be storing & optimising your...
Liquid Clustering (LC) was designed to replace table partitioning and the ZORDER command, to...
Databricks Delta Cache and Spark Cache
As data sizes and demand increases as time goes on, you often see slowness on Databricks this...
Spark 3.0 Questions and answers from the Data AI Summit
At the Data + AI Summit, Simon delivered a session on “Achieving Lakehouse Models with Spark...
Identifying Data Outliers in Apache Spark 3.0
The secret to getting machine learning to work effectively is in ensuring that the data we are...
Will Koalas replace PySpark?
One of the first of many big announcements at the 2020 Spark and AI Summit was the official...
Vector (45)
Vector (46) dise