Metaflow Tags: Programmatic Tagging

Outerbounds · Beginner ·🏭 MLOps & LLMOps ·4y ago

Skills: ML Pipelines80%

Key Takeaways

This video demonstrates how to use Metaflow tags programmatically in scripts and notebooks for experiment tracking and organizing results, showcasing the use of the Metaflow client API to assign and remove tags based on model accuracy.

Full Transcript

in some sense this example is an inverse of the previous one so in the previous example we ran an execution then we inspected the result using the metaflow card and the ui and then if the result was promising enough we assigned a tag to it in this example we assign the tags automatically so here i just added this one snippet of code here in the existing flow that looks at the accuracy of the model automatically here in the code and if the accuracy is high enough it retrieves the currently executing run using the meta flow client api and then like using the programmatic api for tags it adds a tag called promising model to the currently executing run so the tagging happens automatically in this case and now you might wonder that what's the point of assigning tags instead of using artifacts well the simple reason is that artifacts are immutable they are basically in some sense a kind of a immutable representation of the history which can't be changed history is what it is but you can change the interpretation of the words so for instance although the runs and the accuracies and like whatever it might be they are but they are but you can interpret the results in a different way and add and remove tags based on like post talk analysis so let's see how it works so now like we can execute the code like three candidates run and in this case the the accuracy is 0.84 and now imagine that you are a data scientist working on this project so maybe there are many things that you are iterating like maybe maybe you want to try different type of parameters like here we can try something crazy let's change that and see what happens or maybe maybe it was actually the max features that we want to be changing i think that there's an option for log two and maybe as a sanity check we can also just see what happens if we change the random seed maybe the results are just random the point here is that data sciences is fundamentally an iterative process that the development of of these projects is an iterative process and we keep like trying different things we develop code and it's nice that metaflow keeps track of everything automatically so after a while like we have bunch of these iterations and now what we can do is that let's say we can open a notebook and i have this notebook here that i have prepared for this example where we import like the metaflow client objects and now metaflow allows you to filter executions based on a tag so we can say that okay instead of looking at all the runs of this flow let's only look at the ones that have the tag promising model and now we can see that in this case we have four executions that have the tag here so and they are listed right here now of course like it's kind of hard to say like what's going on here so what we can do is that we can actually look at the artifacts now that are stored inside these runs so we can look at the parameters like we can look at the accuracy values we just um collect all that information in a few lists and like create a data frame out of it so we can actually see what's going on so in this case now we can see all those values all the parameterizations that we were testing uh max depth the features the accuracies and and so forth and obviously you can see that this is in effect experiment tracking what we are doing here and now we can visualize the results using standard pandas tools and uh well now looking at these results looking at the accuracy numbers it's kind of obvious that there's that one one experiment like one execution that actually like doesn't look that promising and now thanks to the fact that tags are immutable in contrast to the artifacts we really can't change the the artifacts and the fact that like let's say the max depth didn't yield good results well it's a fact of life like we shouldn't change that but we can actually change the interpretation that this is a promising model so what we can do is that we can instantiate the corresponding run object eight one um two one seven so that's the run right and now what we can do is that we can actually say that this is not that promising after all so we can remove the tag promising model and now if we go back here we can see that the list is shorter like we have actually removed that one execution that like didn't show good results and now a couple of things worth noticing here first we have the programmatic api that we can use to assign and remove tags and also thanks to the fact that we have both the artifacts and the tags available you can use metaflow quite handily for experiment tracking

Original Description

Learn how to use Metaflow tags programmatically in your scripts and notebooks for experiment tracking and to organize results Find out more about how we think about MLOps, OSS, and human-centric data science tools here: https://outerbounds.com/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Playlist UU5h8Ji6Lm1RyAZopnCpDq7Q · Outerbounds · 7 of 60

← Previous Next →

Metaflow GUI for monitoring machine learning workflows

Metaflow GUI for monitoring machine learning workflows

Metaflow Cards [no sound]

Metaflow Cards [no sound]

Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning

Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning

Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning

Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning

Metaflow on Kubernetes and Argo Workflows [no sound]

Metaflow on Kubernetes and Argo Workflows [no sound]

Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK

Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK

Metaflow Tags: Programmatic Tagging

Metaflow Tags: Programmatic Tagging

Metaflow Tags: Basic Tagging

Metaflow Tags: Basic Tagging

Metaflow Tags: Tags in CI/CD

Metaflow Tags: Tags in CI/CD

Metaflow Tags: Tags and Namespaces

Metaflow Tags: Tags and Namespaces

Metaflow Tags: Tags and Continuous Training

Metaflow Tags: Tags and Continuous Training

Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People

Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People

Fireside Chat #5: Machine Learning + Infrastructure for Humans

Fireside Chat #5: Machine Learning + Infrastructure for Humans

Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser

Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser

Metaflow on Azure

Metaflow on Azure

Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners

Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners

ML engineering vs traditional software engineering: similarities and differences

ML engineering vs traditional software engineering: similarities and differences

Why data scientists love and hate notebooks: velocity and validation

Why data scientists love and hate notebooks: velocity and validation

What even is a 10x ML engineer?

What even is a 10x ML engineer?

The 4 main tasks in the production ML lifecycle

The 4 main tasks in the production ML lifecycle

Is the premise of data-centric AI flawed?

Is the premise of data-centric AI flawed?

The 3 factors that Determine the success of ML projects

The 3 factors that Determine the success of ML projects

Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch

Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch

Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]

Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]

Metaflow on GCP

Metaflow on GCP

Fireside Chat #8: Navigating the Full Stack of Machine Learning

Fireside Chat #8: Navigating the Full Stack of Machine Learning

How to Build a Full-Stack Recommender System

How to Build a Full-Stack Recommender System

Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]

Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]

Easy Airflow DAGs for ML and data science with Metaflow [no sound]

Easy Airflow DAGs for ML and data science with Metaflow [no sound]

Fireside chat #9: Language Processing: From Prototype to Production

Fireside chat #9: Language Processing: From Prototype to Production

How to build end-to-end recommender systems at reasonable scale

How to build end-to-end recommender systems at reasonable scale

Full-Stack Machine Learning with Metaflow on CoRise

Full-Stack Machine Learning with Metaflow on CoRise

Natural Language Processing meets MLOps

Natural Language Processing meets MLOps

Fireside Chat #10: Large Language Models: Beyond Proofs of Concept

Fireside Chat #10: Large Language Models: Beyond Proofs of Concept

What even are Large Language Models?

What even are Large Language Models?

How to get started with LLMs today

How to get started with LLMs today

LLMs in production

LLMs in production

Accessing secrets securely in Metaflow [no audio]

Accessing secrets securely in Metaflow [no audio]

Fireside Chat #11: The Open-Source Modern Data Stack

Fireside Chat #11: The Open-Source Modern Data Stack

Fireside chat #12: Kubernetes for Data Scientists

Fireside chat #12: Kubernetes for Data Scientists

Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster

Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster

Fireside chat #13: Supply Chain Security in Machine Learning

Fireside chat #13: Supply Chain Security in Machine Learning

Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story

Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story

Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai

Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai

Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration

Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration

From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo

From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo

Building a GenAI Ready ML Platform with Metaflow at Autodesk

Building a GenAI Ready ML Platform with Metaflow at Autodesk

Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis

Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis

Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform

Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform

Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming

Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming

The Past, Present, and Future of Generative AI

The Past, Present, and Future of Generative AI

Building Production Systems with Generative AI, Machine Learning, and Data

Building Production Systems with Generative AI, Machine Learning, and Data

A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)

A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)

Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)

Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)

Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)

Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)

Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)

Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)

Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)

Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)

Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)

Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)

LLMs in Practice: A Guide to Recent Trends and Techniques

LLMs in Practice: A Guide to Recent Trends and Techniques

Metaflow for distributed high-performance computing and large-scale AI training

Metaflow for distributed high-performance computing and large-scale AI training

This video teaches how to use Metaflow tags programmatically to track experiments and organize results, and how to use the Metaflow client API to assign and remove tags based on model accuracy. The video demonstrates the use of tags in an iterative data science process.

Key Takeaways

Add code to existing flow to assign tags automatically based on model accuracy
Use Metaflow client API to retrieve currently executing run and add tag
Filter executions based on tag using Metaflow client objects
Collect artifact information and create data frame
Visualize results using standard pandas tools
Remove tag from execution if results are not promising

💡 Tags are mutable and can be used to interpret results in a different way, whereas artifacts are immutable and represent a fixed history

🔒 Pro feature: Ask AI to explain this lesson →

More on: ML Pipelines

View skill →

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Complete Dockers For Data Science Tutorial In One Shot

Complete Dockers For Data Science Tutorial In One Shot

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Abonia Sojasingarayar

Vertex Pipelines: Qwik Start

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Automate R scripts with GitHub Actions: Deploy a model

Related Reads

A Phased Blueprint for Migrating From Google Workspace to Microsoft 365

Learn a step-by-step approach to migrate from Google Workspace to Microsoft 365 with minimal downtime and zero data loss, understanding it as an infrastructure engineering challenge

Feature Freshness: The Forgotten Problem of MLOps

Learn how outdated features can cause production models to fail and why feature freshness is crucial in MLOps, to improve model performance and reliability

Day 19 of the 100 Days of MLOps Challenge

Learn to build a complete DVC ML pipeline with remote storage and experiments to streamline your machine learning workflow and improve collaboration

Medium · DevOps

From Critical Infrastructure to AI Factories: Building an AI Operations Copilot on Nebius…

Learn how to build an AI operations copilot by leveraging experience in critical infrastructure and AI-assisted engineering, and why it matters for efficient AI deployment

Pole Pruner How A Rope Lever Shears High Branches

Innoforge Studio