Data Science In Production Episode 3: Version control for Data Science
Hello Productionaneers (that might need work).
Is this podcast for you? Is all you code in source control? No? I thought not, this is for you. In this podcast I take a look at Version control / Source control for your data science projects. I investigate the various options you have, which artifacts to version control, how to version notebooks, data and projects. I discuss more about DataOps and how it is reliant on a solid version control.
There were a lot of topics mentioned on this episode.
Azure DevOps - Azure DevOps is a Cloud DevOps tool which is free to use for up to 5 people. Byond that you need to start paying for it. You can get started here: https://azure.microsoft.com/en-gb/services/devops/
Git / Git Tools - https://git-scm.com/
Bit Bucket - https://bitbucket.org/
MLFlow.org - https://mlflow.org/
DVC - https://dvc.org/
Thanks for listening. If you’re enjoying the podcast, please like and subscribe.