Metaflow Tags: Programmatic Tagging
Skills:
ML Pipelines80%
Key Takeaways
This video demonstrates how to use Metaflow tags programmatically in scripts and notebooks for experiment tracking and organizing results, showcasing the use of the Metaflow client API to assign and remove tags based on model accuracy.
Full Transcript
in some sense this example is an inverse of the previous one so in the previous example we ran an execution then we inspected the result using the metaflow card and the ui and then if the result was promising enough we assigned a tag to it in this example we assign the tags automatically so here i just added this one snippet of code here in the existing flow that looks at the accuracy of the model automatically here in the code and if the accuracy is high enough it retrieves the currently executing run using the meta flow client api and then like using the programmatic api for tags it adds a tag called promising model to the currently executing run so the tagging happens automatically in this case and now you might wonder that what's the point of assigning tags instead of using artifacts well the simple reason is that artifacts are immutable they are basically in some sense a kind of a immutable representation of the history which can't be changed history is what it is but you can change the interpretation of the words so for instance although the runs and the accuracies and like whatever it might be they are but they are but you can interpret the results in a different way and add and remove tags based on like post talk analysis so let's see how it works so now like we can execute the code like three candidates run and in this case the the accuracy is 0.84 and now imagine that you are a data scientist working on this project so maybe there are many things that you are iterating like maybe maybe you want to try different type of parameters like here we can try something crazy let's change that and see what happens or maybe maybe it was actually the max features that we want to be changing i think that there's an option for log two and maybe as a sanity check we can also just see what happens if we change the random seed maybe the results are just random the point here is that data sciences is fundamentally an iterative process that the development of of these projects is an iterative process and we keep like trying different things we develop code and it's nice that metaflow keeps track of everything automatically so after a while like we have bunch of these iterations and now what we can do is that let's say we can open a notebook and i have this notebook here that i have prepared for this example where we import like the metaflow client objects and now metaflow allows you to filter executions based on a tag so we can say that okay instead of looking at all the runs of this flow let's only look at the ones that have the tag promising model and now we can see that in this case we have four executions that have the tag here so and they are listed right here now of course like it's kind of hard to say like what's going on here so what we can do is that we can actually look at the artifacts now that are stored inside these runs so we can look at the parameters like we can look at the accuracy values we just um collect all that information in a few lists and like create a data frame out of it so we can actually see what's going on so in this case now we can see all those values all the parameterizations that we were testing uh max depth the features the accuracies and and so forth and obviously you can see that this is in effect experiment tracking what we are doing here and now we can visualize the results using standard pandas tools and uh well now looking at these results looking at the accuracy numbers it's kind of obvious that there's that one one experiment like one execution that actually like doesn't look that promising and now thanks to the fact that tags are immutable in contrast to the artifacts we really can't change the the artifacts and the fact that like let's say the max depth didn't yield good results well it's a fact of life like we shouldn't change that but we can actually change the interpretation that this is a promising model so what we can do is that we can instantiate the corresponding run object eight one um two one seven so that's the run right and now what we can do is that we can actually say that this is not that promising after all so we can remove the tag promising model and now if we go back here we can see that the list is shorter like we have actually removed that one execution that like didn't show good results and now a couple of things worth noticing here first we have the programmatic api that we can use to assign and remove tags and also thanks to the fact that we have both the artifacts and the tags available you can use metaflow quite handily for experiment tracking
Original Description
Learn how to use Metaflow tags programmatically in your scripts and notebooks for experiment tracking and to organize results
Find out more about how we think about MLOps, OSS, and human-centric data science tools here: https://outerbounds.com/
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Playlist UU5h8Ji6Lm1RyAZopnCpDq7Q · Outerbounds · 7 of 60
1
2
3
4
5
6
▶
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Metaflow GUI for monitoring machine learning workflows
Outerbounds
Metaflow Cards [no sound]
Outerbounds
Fireside chat #1: How to Produce Sustainable Business Value with Machine Learning
Outerbounds
Fireside chat #2: MadeWithML.com -- Teaching Practical Machine Learning
Outerbounds
Metaflow on Kubernetes and Argo Workflows [no sound]
Outerbounds
Fireside chat #3: Reasonable Scale Machine Learning -- You're not Google and it's totally OK
Outerbounds
Metaflow Tags: Programmatic Tagging
Outerbounds
Metaflow Tags: Basic Tagging
Outerbounds
Metaflow Tags: Tags in CI/CD
Outerbounds
Metaflow Tags: Tags and Namespaces
Outerbounds
Metaflow Tags: Tags and Continuous Training
Outerbounds
Fireside chat #4: Machine Learning and User Experience -- Building ML Products for People
Outerbounds
Fireside Chat #5: Machine Learning + Infrastructure for Humans
Outerbounds
Metaflow Sandbox Demo: Free Data Science Infrastructure In the Browser
Outerbounds
Metaflow on Azure
Outerbounds
Fireside Chat #6: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners
Outerbounds
ML engineering vs traditional software engineering: similarities and differences
Outerbounds
Why data scientists love and hate notebooks: velocity and validation
Outerbounds
What even is a 10x ML engineer?
Outerbounds
The 4 main tasks in the production ML lifecycle
Outerbounds
Is the premise of data-centric AI flawed?
Outerbounds
The 3 factors that Determine the success of ML projects
Outerbounds
Fireside Chat #7: How to Build an Enterprise Machine Learning Platform from Scratch
Outerbounds
Run Metaflow on any cloud: Google Cloud, Azure, or AWS [no sound]
Outerbounds
Metaflow on GCP
Outerbounds
Fireside Chat #8: Navigating the Full Stack of Machine Learning
Outerbounds
How to Build a Full-Stack Recommender System
Outerbounds
Modernize your Airflow deployments with Metaflow - zero-cost migration [no sound]
Outerbounds
Easy Airflow DAGs for ML and data science with Metaflow [no sound]
Outerbounds
Fireside chat #9: Language Processing: From Prototype to Production
Outerbounds
How to build end-to-end recommender systems at reasonable scale
Outerbounds
Full-Stack Machine Learning with Metaflow on CoRise
Outerbounds
Natural Language Processing meets MLOps
Outerbounds
Fireside Chat #10: Large Language Models: Beyond Proofs of Concept
Outerbounds
What even are Large Language Models?
Outerbounds
How to get started with LLMs today
Outerbounds
LLMs in production
Outerbounds
Accessing secrets securely in Metaflow [no audio]
Outerbounds
Fireside Chat #11: The Open-Source Modern Data Stack
Outerbounds
Fireside chat #12: Kubernetes for Data Scientists
Outerbounds
Behind the Screen: How Amazon Prime Video ships RecSys models 4x faster
Outerbounds
Fireside chat #13: Supply Chain Security in Machine Learning
Outerbounds
Quick Delivery, Quicker ML: DeliveryHero's Metaflow Story
Outerbounds
Crafting General Intelligence: LLM Fine-tuning with Metaflow at Adept.ai
Outerbounds
Fuelling Decisions: How DTN Powers Gas Pricing and Data Science Collaboration
Outerbounds
From Kitchen to Doorstep: Optimizing Data Science Velocity at Deliveroo
Outerbounds
Building a GenAI Ready ML Platform with Metaflow at Autodesk
Outerbounds
Media Transcoding for 10 Million users and beyond with Metaflow at Epignosis
Outerbounds
Telematics with Metaflow: How Nirvana Insurance built a large-scale Risk Estimation platform
Outerbounds
Fireside chat #14: Generative AI and Machine Learning for Film, TV, and Gaming
Outerbounds
The Past, Present, and Future of Generative AI
Outerbounds
Building Production Systems with Generative AI, Machine Learning, and Data
Outerbounds
A Custom Fine-Tuned LLM in Action (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 5)
Outerbounds
Building Live Production Systems with RAG (LLMs & RAG: An Interactive Guided Tour Part 4)
Outerbounds
Better Relevancy with RAG (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 3)
Outerbounds
Working with OSS LLMs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 2)
Outerbounds
Hitting OpenAI and Other Vendor APIs (LLMs, RAG, and Fine-Tuning: An Interactive Guided Tour Part 1)
Outerbounds
Production Systems with Generative AI (LLMs, RAG, & Fine-Tuning: An Interactive Guided Tour Part 0)
Outerbounds
LLMs in Practice: A Guide to Recent Trends and Techniques
Outerbounds
Metaflow for distributed high-performance computing and large-scale AI training
Outerbounds
More on: ML Pipelines
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
DevOps Took 10 Years to Mature.
Medium · DevOps
Praesto: A Kubernetes Operator for Node-Local ML Model Caching with CSI
Medium · DevOps
Beyond `ollama run`: Production-Ready DeepSeek R1 Deployment with vLLM and Nginx
Dev.to · Shannon Dias
MCP Health Check: Building Production Monitoring for Your MCP Server — What I Learned After 84 Production Outages
Dev.to AI
🎓
Tutor Explanation
DeepCamp AI