The 4 main tasks in the production ML lifecycle

Outerbounds · Intermediate ·📐 ML Fundamentals ·3y ago

Skills: ML Pipelines90%

Key Takeaways

The 4 main tasks in the production ML lifecycle, including MLOps and human-centric data science tools, are discussed by Outerbounds with Shreya Shankar

Full Transcript

in in the chat so I'm interested in what you discovered in terms of I mean at a at a basic level of packing but pattern recognition just what what the main tasks that people do in the production machine learning life cycle are what did you what did you find out I think a good answer to this is what the textbook says and they found that it's different from what the textbook says and what the textbook will tell you for machine learning it's first you collect data step one step two is you train a model step three is you evaluate that model on a holdout data set make sure there's no overfitting and then step four is you deploy um and the what we found is that we can still categorize into four steps um and maybe the data collection part is similar except for it's more of a look like every I don't know a week or so we want to collect new data we want to make sure there's some QA on that data or something but the rest of the steps the last three steps are totally different um the Second Step what I said before model training is actually experimentation in general whether it be training new models whether it be trying to Source new data or adding new features there's a lot of ways you can think about improving a model um and a lot of the participants actually preferred to look into finding new data gave you signal or making features more fresh instead of stale features that they had before so that's kind of step two stage three in the process was we call evaluation and deployment so evaluation is not a one and done thing what happens is evaluation is kind of done maybe on a holdout data set at first and then it's deployed to a small fraction of users and then when the model shows a little promise there increasingly it's deployed to more and more users as we learn more about what it can do what it can't do what failure modes exist how do we go and Patch problems until we've kind of gotten to the full population um so key takeaway is evaluation is not a one-time thing it is a loop on evaluation and deployment a multi-stage deployment and then the latest the step four we found was this overall monitoring and response stage which was when you do have these models in production what is their what is their live performance um if you see the performance dropping what are the bugs where are the bugs how do we respond to them quickly whether that be actually trying to go do root cause analysis or simply retraining the model there is a stage around making sure that there's little downtime for these services so we do we have those four stages shown in the first figure in the paper and and it was interesting to several of us authors that they don't match the textbook I think that's like kind of a narrative that we want people to take away absolutely and let me ask are these um different steps I mean there's an iterative Loop happening there but are they kind of coupled in someone because you could imagine monitoring and validation yeah aren't always separable right totally um I'll say that monitoring is kind of done across the state or kind of in all of the stages um people monitoring their training jobs people monitoring data collection a lot of the times there's human in the loop processes to collect and label data to verify some quality whenever you see a failure um on the ground to go and collect examples that look like that failure so you can go back and augment your validation sets so in that sense there's monitoring all over the place but the one stage that we found was that was super iterative in itself was um evaluation and deployment um evaluations data sets never stay the same they're always changing they're always growing especially in tasks or domains where failures have such a high cost like autonomous vehicles are a great example of a failure is a really high cost when we observe one we need to make sure that we have no more failures like that so how do we go and invest efforts into making sure our foldout data sets whenever you evaluate your bottles new models in the future they're also robust to the problem yeah and I suppose drilling down even a bit more into data collection and I think this is one thing that that you're getting at in textbook data validation you're given a holdout set right as opposed to be actively getting that data and validating it and making sure it's the right data then using it as validation set there are feedback delays there all of these totally totally um I feel like I was never taught this in a machine learning class nobody taught me how to actually evaluate a model it's not like I need a big checklist um I just want to know if I were to be a machine learning engineer for a month not just one time how do I what do I do about my validation set do I keep it the same do I grow it when do I add to it what do I add to it I think this in itself super interesting problems for people to consider

Original Description

A clip from our fireside chat "Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners" with Shreya Shankar. You can find the full conversation here: https://youtu.be/7zB6ESFto_U Find out more about how we think about MLOps, OSS, and human-centric data science tools here: https://outerbounds.com/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Playlist UU5h8Ji6Lm1RyAZopnCpDq7Q · Outerbounds · 20 of 60

← Previous Next →

Metaflow GUI for monitoring machine learning workflows

Metaflow GUI for monitoring machine learning workflows

Metaflow Cards [no sound]

Metaflow Cards [no sound]

Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning

Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning

Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning

Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning

Metaflow on Kubernetes and Argo Workflows [no sound]

Metaflow on Kubernetes and Argo Workflows [no sound]

Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK

Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK

Metaflow Tags: Programmatic Tagging

Metaflow Tags: Programmatic Tagging

Metaflow Tags: Basic Tagging

Metaflow Tags: Basic Tagging

Metaflow Tags: Tags in CI/CD

Metaflow Tags: Tags in CI/CD

Metaflow Tags: Tags and Namespaces

Metaflow Tags: Tags and Namespaces

Metaflow Tags: Tags and Continuous Training

Metaflow Tags: Tags and Continuous Training

Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People

Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People

Fireside Chat #5: Machine Learning + Infrastructure for Humans

Fireside Chat #5: Machine Learning + Infrastructure for Humans

Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser

Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser

Metaflow on Azure

Metaflow on Azure

Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners

Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners

ML engineering vs traditional software engineering: similarities and differences

ML engineering vs traditional software engineering: similarities and differences

Why data scientists love and hate notebooks: velocity and validation

Why data scientists love and hate notebooks: velocity and validation

What even is a 10x ML engineer?

What even is a 10x ML engineer?

The 4 main tasks in the production ML lifecycle

The 4 main tasks in the production ML lifecycle

Is the premise of data-centric AI flawed?

Is the premise of data-centric AI flawed?

The 3 factors that Determine the success of ML projects

The 3 factors that Determine the success of ML projects

Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch

Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch

Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]

Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]

Metaflow on GCP

Metaflow on GCP

Fireside Chat #8: Navigating the Full Stack of Machine Learning

Fireside Chat #8: Navigating the Full Stack of Machine Learning

How to Build a Full-Stack Recommender System

How to Build a Full-Stack Recommender System

Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]

Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]

Easy Airflow DAGs for ML and data science with Metaflow [no sound]

Easy Airflow DAGs for ML and data science with Metaflow [no sound]

Fireside chat #9: Language Processing: From Prototype to Production

Fireside chat #9: Language Processing: From Prototype to Production

How to build end-to-end recommender systems at reasonable scale

How to build end-to-end recommender systems at reasonable scale

Full-Stack Machine Learning with Metaflow on CoRise

Full-Stack Machine Learning with Metaflow on CoRise

Natural Language Processing meets MLOps

Natural Language Processing meets MLOps

Fireside Chat #10: Large Language Models: Beyond Proofs of Concept

Fireside Chat #10: Large Language Models: Beyond Proofs of Concept

What even are Large Language Models?

What even are Large Language Models?

How to get started with LLMs today

How to get started with LLMs today

LLMs in production

LLMs in production

Accessing secrets securely in Metaflow [no audio]

Accessing secrets securely in Metaflow [no audio]

Fireside Chat #11: The Open-Source Modern Data Stack

Fireside Chat #11: The Open-Source Modern Data Stack

Fireside chat #12: Kubernetes for Data Scientists

Fireside chat #12: Kubernetes for Data Scientists

Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster

Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster

Fireside chat #13: Supply Chain Security in Machine Learning

Fireside chat #13: Supply Chain Security in Machine Learning

Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story

Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story

Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai

Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai

Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration

Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration

From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo

From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo

Building a GenAI Ready ML Platform with Metaflow at Autodesk

Building a GenAI Ready ML Platform with Metaflow at Autodesk

Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis

Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis

Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform

Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform

Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming

Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming

The Past, Present, and Future of Generative AI

The Past, Present, and Future of Generative AI

Building Production Systems with Generative AI, Machine Learning, and Data

Building Production Systems with Generative AI, Machine Learning, and Data

A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)

A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)

Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)

Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)

Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)

Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)

Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)

Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)

Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)

Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)

Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)

Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)

LLMs in Practice: A Guide to Recent Trends and Techniques

LLMs in Practice: A Guide to Recent Trends and Techniques

Metaflow for distributed high-performance computing and large-scale AI training

Metaflow for distributed high-performance computing and large-scale AI training

The video discusses the 4 main tasks in the production ML lifecycle, highlighting the importance of MLOps and human-centric data science tools in operationalizing ML models. By understanding these tasks, viewers can improve their ML pipeline implementation and model deployment. The conversation with Shreya Shankar provides valuable insights into the challenges and best practices of MLOps.

Key Takeaways

Identify the 4 main tasks in the production ML lifecycle
Understand the role of MLOps in operationalizing ML models
Explore human-centric data science tools for ML development
Implement ML pipelines using MLOps principles
Deploy and monitor ML models in production

💡 The 4 main tasks in the production ML lifecycle are critical to successful ML model deployment and maintenance, and MLOps plays a key role in bridging the gap between data science and engineering.

🔒 Pro feature: Ask AI to explain this lesson →

More on: ML Pipelines

View skill →

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Complete Dockers For Data Science Tutorial In One Shot

Complete Dockers For Data Science Tutorial In One Shot

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Abonia Sojasingarayar

Vertex Pipelines: Qwik Start

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Automate R scripts with GitHub Actions: Deploy a model

Related AI Lessons

Stop Overfitting With Basically One Line of Code

Learn to prevent overfitting with a simple code tweak and understand the difference between Ridge and Lasso regression

Stop Overfitting With Basically One Line of Code

Learn to prevent overfitting in machine learning models with a simple code tweak and understand the difference between Ridge and Lasso regression

Medium · Machine Learning

Stop Overfitting With Basically One Line of Code

Prevent overfitting in models with a simple code tweak, understanding the difference between Ridge and Lasso regression

Medium · Data Science

Stop Overfitting With Basically One Line of Code

Learn to prevent overfitting in machine learning models with a simple code tweak, comparing Ridge and Lasso regression techniques

Medium · Python

Learn Deep Learning by Hand (Beginner's Guide - Part 1)