Why data scientists love and hate notebooks: velocity and validation

Outerbounds · Intermediate ·🏭 MLOps & LLMOps ·3y ago
Skills: ML Pipelines80%

Key Takeaways

The video discusses the importance of velocity, validation, and versioning in operationalizing and deploying ML models, with a focus on the trade-offs between these aspects and how they relate to the use of notebooks such as Jupyter Notebooks.

Full Transcript

totally so this is this is great now we're getting back to the three most important aspects um that determine the success of operationalizing and deploying ml velocity validation and versioning you make it very clear that I'm going to actually quote you um High Velocity um means creating many versions in other words having high velocity means drowning in a sea of versions of experiments right so I'm envisaging there some Pareto front where there's attention and trade-off between velocity and uh versioning but then you also mentioned there are synergies between velocity and validating early so if ideas can be invalidated in earlier stages of deployment and overall velocity um is increased one more thing you mentioned is creating similar development and production environments exposes attention between velocity and validating so the development Cycles are more experimental move faster than production Cycles however if the development environment is significantly different from prod it's hard to validate ideas early so we have this oh there's some sort of triangle with synergies as well so I've mentioned a few but can can you just speak a bit more to the relationships and correlations and causations between these three incredibly important things yeah uh uh um I think the the knife the anecdote there is on Jupiter notebooks um for a long time we had all been seeing each other some people so excited a bunch of different notebooks some people absolutely hate to Bruno books everybody has strong opinions everybody wants to give a monologue on their opinions on Jupiter notebooks and I think for me it's been like so many years of hearing this kind of over and over again why people love it or hate it and I wanted to know why it was so polarizing um and it was very satisfying to me to hear this uh or to kind of frame it as this um kind of where do people lie on the velocity and validating Spectrum some people want to move fast and break things in the Facebook speak and they're okay with that if they can fix it some people do not want to move fast they want to make sure that there's no buggy models they want to make sure that everybody can review each other's work yes that hinders velocity um and kind of like it's really hard right some some people want Jupiter notebooks because they can go fast some people don't want data scientists to have they don't want data scientists to go too fast because then maybe certain scientific principles are disregarded maybe things are irreproducible I don't really know but it was nice to frame it this way because there's no right answer right it's like where do you personally lie on the Spectrum and when you run a company or you run your team like what is the ethos that you want to create um or like where do you guys want to lie um amazing that was interesting to me this is incredibly useful I I mean this framework so what I'm hearing is in this framework of velocity validation um and and versioning we can look at people who uh prefer Jupiter notebooks and in this framework they're essentially prioritizing velocity whereas people who are strongly opinionated against Jupiter notebooks uh prioritizing validation and versioning or mostly validation or how do you how do you think mostly validation um I think of it as validation more because it's like how do you uh what like how do you make sure the development and production environments are as similar as possible um whenever they're there there's a discrepancy there is chance for books um so you can remove the need to validate a lot when promoting from Dev to prod if there is no real like environment change from Death to product one great example is like sometimes people will iterate locally and then deploy to the prod service in Cloud that it's a huge environment mismatch so you need to do some sort of big validation I don't even know I don't think people have solved this problem of like making sure there aren't bugs in business and as much of environments um but again right like why why is it so separate like a completely different Hardware not even in the same like Cloud um crazy I I think this also exposed to me that like I am now feeling somewhat opinionated it's not the corrective there's no correct opinion right it's like what is the opinion you want to hold and you want to prescribe for the team that you are running yeah absolutely and a question around notebooks for example is if they don't have affordances which we'd like them to have maybe as tall Builders we build them into them as well right it isn't yes of course how do we want to build the future as well right yeah yeah as you point out

Original Description

A clip from our fireside chat "Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners" with Shreya Shankar. You can find the full conversation here: https://youtu.be/7zB6ESFto_U Find out more about how we think about MLOps, OSS, and human-centric data science tools here: https://outerbounds.com/
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Playlist UU5h8Ji6Lm1RyAZopnCpDq7Q · Outerbounds · 18 of 60

1 Metaflow GUI for monitoring machine learning workflows
Metaflow GUI for monitoring machine learning workflows
Outerbounds
2 Metaflow Cards [no sound]
Metaflow Cards [no sound]
Outerbounds
3 Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning
Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning
Outerbounds
4 Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning
Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning
Outerbounds
5 Metaflow on Kubernetes and Argo Workflows [no sound]
Metaflow on Kubernetes and Argo Workflows [no sound]
Outerbounds
6 Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK
Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK
Outerbounds
7 Metaflow Tags: Programmatic Tagging
Metaflow Tags: Programmatic Tagging
Outerbounds
8 Metaflow Tags: Basic Tagging
Metaflow Tags: Basic Tagging
Outerbounds
9 Metaflow Tags: Tags in CI/CD
Metaflow Tags: Tags in CI/CD
Outerbounds
10 Metaflow Tags: Tags and Namespaces
Metaflow Tags: Tags and Namespaces
Outerbounds
11 Metaflow Tags: Tags and Continuous Training
Metaflow Tags: Tags and Continuous Training
Outerbounds
12 Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People
Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People
Outerbounds
13 Fireside Chat #5: Machine Learning + Infrastructure for Humans
Fireside Chat #5: Machine Learning + Infrastructure for Humans
Outerbounds
14 Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser
Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser
Outerbounds
15 Metaflow on Azure
Metaflow on Azure
Outerbounds
16 Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners
Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners
Outerbounds
17 ML engineering vs traditional software engineering: similarities and differences
ML engineering vs traditional software engineering: similarities and differences
Outerbounds
Why data scientists love and hate notebooks: velocity and validation
Why data scientists love and hate notebooks: velocity and validation
Outerbounds
19 What even is a 10x ML engineer?
What even is a 10x ML engineer?
Outerbounds
20 The 4 main tasks in the production ML lifecycle
The 4 main tasks in the production ML lifecycle
Outerbounds
21 Is the premise of data-centric AI flawed?
Is the premise of data-centric AI flawed?
Outerbounds
22 The 3 factors that Determine the success of ML projects
The 3 factors that Determine the success of ML projects
Outerbounds
23 Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch
Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch
Outerbounds
24 Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]
Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]
Outerbounds
25 Metaflow on GCP
Metaflow on GCP
Outerbounds
26 Fireside Chat #8: Navigating the Full Stack of Machine Learning
Fireside Chat #8: Navigating the Full Stack of Machine Learning
Outerbounds
27 How to Build a Full-Stack Recommender System
How to Build a Full-Stack Recommender System
Outerbounds
28 Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]
Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]
Outerbounds
29 Easy Airflow DAGs for ML and data science with Metaflow [no sound]
Easy Airflow DAGs for ML and data science with Metaflow [no sound]
Outerbounds
30 Fireside chat #9:  Language Processing: From Prototype to Production
Fireside chat #9: Language Processing: From Prototype to Production
Outerbounds
31 How to build end-to-end recommender systems at reasonable scale
How to build end-to-end recommender systems at reasonable scale
Outerbounds
32 Full-Stack Machine Learning with Metaflow on CoRise
Full-Stack Machine Learning with Metaflow on CoRise
Outerbounds
33 Natural Language Processing meets MLOps
Natural Language Processing meets MLOps
Outerbounds
34 Fireside Chat #10: Large Language Models: Beyond Proofs of Concept
Fireside Chat #10: Large Language Models: Beyond Proofs of Concept
Outerbounds
35 What even are Large Language Models?
What even are Large Language Models?
Outerbounds
36 How to get started with LLMs today
How to get started with LLMs today
Outerbounds
37 LLMs in production
LLMs in production
Outerbounds
38 Accessing secrets securely in Metaflow [no audio]
Accessing secrets securely in Metaflow [no audio]
Outerbounds
39 Fireside Chat #11: The Open-Source Modern Data Stack
Fireside Chat #11: The Open-Source Modern Data Stack
Outerbounds
40 Fireside chat #12: Kubernetes for Data Scientists
Fireside chat #12: Kubernetes for Data Scientists
Outerbounds
41 Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster
Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster
Outerbounds
42 Fireside chat #13: Supply Chain Security in Machine Learning
Fireside chat #13: Supply Chain Security in Machine Learning
Outerbounds
43 Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story
Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story
Outerbounds
44 Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai
Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai
Outerbounds
45 Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration
Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration
Outerbounds
46 From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo
From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo
Outerbounds
47 Building a GenAI Ready ML Platform with Metaflow at Autodesk
Building a GenAI Ready ML Platform with Metaflow at Autodesk
Outerbounds
48 Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis
Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis
Outerbounds
49 Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform
Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform
Outerbounds
50 Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming
Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming
Outerbounds
51 The Past, Present, and Future of Generative AI
The Past, Present, and Future of Generative AI
Outerbounds
52 Building Production Systems with Generative AI, Machine Learning, and Data
Building Production Systems with Generative AI, Machine Learning, and Data
Outerbounds
53 A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)
A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)
Outerbounds
54 Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)
Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)
Outerbounds
55 Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)
Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)
Outerbounds
56 Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)
Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)
Outerbounds
57 Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)
Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)
Outerbounds
58 Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)
Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)
Outerbounds
59 LLMs in Practice: A Guide to Recent Trends and Techniques
LLMs in Practice: A Guide to Recent Trends and Techniques
Outerbounds
60 Metaflow for distributed high-performance computing and large-scale AI training
Metaflow for distributed high-performance computing and large-scale AI training
Outerbounds

The video discusses the trade-offs between velocity, validation, and versioning in ML deployment, and how notebooks like Jupyter Notebooks can be used to prioritize velocity or validation. It highlights the importance of creating similar development and production environments to increase velocity and validation.

Key Takeaways
  1. Identify the trade-offs between velocity, validation, and versioning in ML deployment
  2. Determine the importance of each aspect for your project
  3. Choose a notebook or tool that prioritizes your desired aspect
  4. Create similar development and production environments to increase velocity and validation
  5. Validate ML experiments and models
  6. Deploy ML models to production
💡 The use of notebooks like Jupyter Notebooks can prioritize either velocity or validation, and creating similar development and production environments is crucial for increasing both velocity and validation.

Related AI Lessons

DevOps Took 10 Years to Mature.
MLOps is distinct from DevOps and solves unique problems, requiring a different approach
Medium · DevOps
Praesto: A Kubernetes Operator for Node-Local ML Model Caching with CSI
Learn how Praesto, a Kubernetes Operator, optimizes ML model caching for Node-Local storage with CSI, reducing costs and improving performance
Medium · DevOps
Beyond `ollama run`: Production-Ready DeepSeek R1 Deployment with vLLM and Nginx
Learn to deploy DeepSeek R1 with vLLM and Nginx for production-ready environments, moving beyond local development
Dev.to · Shannon Dias
MCP Health Check: Building Production Monitoring for Your MCP Server — What I Learned After 84 Production Outages
Learn to build production monitoring for your MCP server to minimize outages and ensure smooth operation
Dev.to AI
Up next
Pole Pruner How A Rope Lever Shears High Branches
Innoforge Studio
Watch →