Why data scientists love and hate notebooks: velocity and validation
Skills:
ML Pipelines80%
Key Takeaways
The video discusses the importance of velocity, validation, and versioning in operationalizing and deploying ML models, with a focus on the trade-offs between these aspects and how they relate to the use of notebooks such as Jupyter Notebooks.
Full Transcript
totally so this is this is great now we're getting back to the three most important aspects um that determine the success of operationalizing and deploying ml velocity validation and versioning you make it very clear that I'm going to actually quote you um High Velocity um means creating many versions in other words having high velocity means drowning in a sea of versions of experiments right so I'm envisaging there some Pareto front where there's attention and trade-off between velocity and uh versioning but then you also mentioned there are synergies between velocity and validating early so if ideas can be invalidated in earlier stages of deployment and overall velocity um is increased one more thing you mentioned is creating similar development and production environments exposes attention between velocity and validating so the development Cycles are more experimental move faster than production Cycles however if the development environment is significantly different from prod it's hard to validate ideas early so we have this oh there's some sort of triangle with synergies as well so I've mentioned a few but can can you just speak a bit more to the relationships and correlations and causations between these three incredibly important things yeah uh uh um I think the the knife the anecdote there is on Jupiter notebooks um for a long time we had all been seeing each other some people so excited a bunch of different notebooks some people absolutely hate to Bruno books everybody has strong opinions everybody wants to give a monologue on their opinions on Jupiter notebooks and I think for me it's been like so many years of hearing this kind of over and over again why people love it or hate it and I wanted to know why it was so polarizing um and it was very satisfying to me to hear this uh or to kind of frame it as this um kind of where do people lie on the velocity and validating Spectrum some people want to move fast and break things in the Facebook speak and they're okay with that if they can fix it some people do not want to move fast they want to make sure that there's no buggy models they want to make sure that everybody can review each other's work yes that hinders velocity um and kind of like it's really hard right some some people want Jupiter notebooks because they can go fast some people don't want data scientists to have they don't want data scientists to go too fast because then maybe certain scientific principles are disregarded maybe things are irreproducible I don't really know but it was nice to frame it this way because there's no right answer right it's like where do you personally lie on the Spectrum and when you run a company or you run your team like what is the ethos that you want to create um or like where do you guys want to lie um amazing that was interesting to me this is incredibly useful I I mean this framework so what I'm hearing is in this framework of velocity validation um and and versioning we can look at people who uh prefer Jupiter notebooks and in this framework they're essentially prioritizing velocity whereas people who are strongly opinionated against Jupiter notebooks uh prioritizing validation and versioning or mostly validation or how do you how do you think mostly validation um I think of it as validation more because it's like how do you uh what like how do you make sure the development and production environments are as similar as possible um whenever they're there there's a discrepancy there is chance for books um so you can remove the need to validate a lot when promoting from Dev to prod if there is no real like environment change from Death to product one great example is like sometimes people will iterate locally and then deploy to the prod service in Cloud that it's a huge environment mismatch so you need to do some sort of big validation I don't even know I don't think people have solved this problem of like making sure there aren't bugs in business and as much of environments um but again right like why why is it so separate like a completely different Hardware not even in the same like Cloud um crazy I I think this also exposed to me that like I am now feeling somewhat opinionated it's not the corrective there's no correct opinion right it's like what is the opinion you want to hold and you want to prescribe for the team that you are running yeah absolutely and a question around notebooks for example is if they don't have affordances which we'd like them to have maybe as tall Builders we build them into them as well right it isn't yes of course how do we want to build the future as well right yeah yeah as you point out
Original Description
A clip from our fireside chat "Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners" with Shreya Shankar. You can find the full conversation here: https://youtu.be/7zB6ESFto_U
Find out more about how we think about MLOps, OSS, and human-centric data science tools here: https://outerbounds.com/
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Playlist UU5h8Ji6Lm1RyAZopnCpDq7Q · Outerbounds · 18 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
▶
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Metaflow GUI for monitoring machine learning workflows
Outerbounds
Metaflow Cards [no sound]
Outerbounds
Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning
Outerbounds
Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning
Outerbounds
Metaflow on Kubernetes and Argo Workflows [no sound]
Outerbounds
Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK
Outerbounds
Metaflow Tags: Programmatic Tagging
Outerbounds
Metaflow Tags: Basic Tagging
Outerbounds
Metaflow Tags: Tags in CI/CD
Outerbounds
Metaflow Tags: Tags and Namespaces
Outerbounds
Metaflow Tags: Tags and Continuous Training
Outerbounds
Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People
Outerbounds
Fireside Chat #5: Machine Learning + Infrastructure for Humans
Outerbounds
Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser
Outerbounds
Metaflow on Azure
Outerbounds
Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners
Outerbounds
ML engineering vs traditional software engineering: similarities and differences
Outerbounds
Why data scientists love and hate notebooks: velocity and validation
Outerbounds
What even is a 10x ML engineer?
Outerbounds
The 4 main tasks in the production ML lifecycle
Outerbounds
Is the premise of data-centric AI flawed?
Outerbounds
The 3 factors that Determine the success of ML projects
Outerbounds
Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch
Outerbounds
Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]
Outerbounds
Metaflow on GCP
Outerbounds
Fireside Chat #8: Navigating the Full Stack of Machine Learning
Outerbounds
How to Build a Full-Stack Recommender System
Outerbounds
Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]
Outerbounds
Easy Airflow DAGs for ML and data science with Metaflow [no sound]
Outerbounds
Fireside chat #9: Language Processing: From Prototype to Production
Outerbounds
How to build end-to-end recommender systems at reasonable scale
Outerbounds
Full-Stack Machine Learning with Metaflow on CoRise
Outerbounds
Natural Language Processing meets MLOps
Outerbounds
Fireside Chat #10: Large Language Models: Beyond Proofs of Concept
Outerbounds
What even are Large Language Models?
Outerbounds
How to get started with LLMs today
Outerbounds
LLMs in production
Outerbounds
Accessing secrets securely in Metaflow [no audio]
Outerbounds
Fireside Chat #11: The Open-Source Modern Data Stack
Outerbounds
Fireside chat #12: Kubernetes for Data Scientists
Outerbounds
Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster
Outerbounds
Fireside chat #13: Supply Chain Security in Machine Learning
Outerbounds
Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story
Outerbounds
Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai
Outerbounds
Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration
Outerbounds
From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo
Outerbounds
Building a GenAI Ready ML Platform with Metaflow at Autodesk
Outerbounds
Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis
Outerbounds
Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform
Outerbounds
Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming
Outerbounds
The Past, Present, and Future of Generative AI
Outerbounds
Building Production Systems with Generative AI, Machine Learning, and Data
Outerbounds
A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)
Outerbounds
Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)
Outerbounds
Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)
Outerbounds
Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)
Outerbounds
Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)
Outerbounds
Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)
Outerbounds
LLMs in Practice: A Guide to Recent Trends and Techniques
Outerbounds
Metaflow for distributed high-performance computing and large-scale AI training
Outerbounds
More on: ML Pipelines
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
DevOps Took 10 Years to Mature.
Medium · DevOps
Praesto: A Kubernetes Operator for Node-Local ML Model Caching with CSI
Medium · DevOps
Beyond `ollama run`: Production-Ready DeepSeek R1 Deployment with vLLM and Nginx
Dev.to · Shannon Dias
MCP Health Check: Building Production Monitoring for Your MCP Server — What I Learned After 84 Production Outages
Dev.to AI
🎓
Tutor Explanation
DeepCamp AI