Applied AI

MLOps & LLMOps

Model deployment, experiment tracking, monitoring, inference optimisation and AI pipelines

887
lessons
Skills in this topic
View full skill map →
Experiment Tracking
beginner
Log experiments with MLflow or Weights & Biases
Model Deployment
intermediate
Wrap a model in a FastAPI endpoint
Model Monitoring
intermediate
Set up drift detection with Evidently AI
Feature Stores
advanced
Define feature views in Feast
LLMOps
advanced
Set up LangSmith or Langfuse for LLM tracing
All Reads (379) Articles (242)Blog Posts (112)Tutorials (24)News (1)
A Phased Blueprint for Migrating From Google Workspace to Microsoft 365
Hackernoon 🏭 MLOps & LLMOps ⚡ AI Lesson 2d ago
A Phased Blueprint for Migrating From Google Workspace to Microsoft 365
This article presents a phased blueprint for migrating from Google Workspace to Microsoft 365 with zero data loss and minimal downtime. It covers tenant hardeni
Feature Freshness: The Forgotten Problem of MLOps
Medium · LLM 🏭 MLOps & LLMOps ⚡ AI Lesson 3d ago
Feature Freshness: The Forgotten Problem of MLOps
Why Most Production Models Fail Because Their Features Are Old, Not Because Their Models Are Bad Continue reading on Medium »
Day 19 of the 100 Days of MLOps Challenge
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 3d ago
Day 19 of the 100 Days of MLOps Challenge
Today’s task was to build a complete DVC ML Pipeline with Remote Storage and Experiments. Continue reading on Medium »
From Critical Infrastructure to AI Factories: Building an AI Operations Copilot on Nebius…
Medium · LLM 🏭 MLOps & LLMOps ⚡ AI Lesson 4d ago
From Critical Infrastructure to AI Factories: Building an AI Operations Copilot on Nebius…
How experience in critical infrastructure, conversations with HPC professionals, and AI-assisted engineering inspired the design of an… Continue reading on Medi
DevOps Took 10 Years to Mature.
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 5d ago
DevOps Took 10 Years to Mature.
MLOps is not DevOps with a machine learning flavour. It solves a different class of problems — and the organisations that treat it like… Continue reading on Med
Praesto: A Kubernetes Operator for Node-Local ML Model Caching with CSI
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 5d ago
Praesto: A Kubernetes Operator for Node-Local ML Model Caching with CSI
Running ML and LLM workloads on Kubernetes often starts with a surprisingly expensive step: getting the model onto the node. Continue reading on Medium »
RocoMart: Building an End-to-End MLOps Pipeline Orchestration for E-Commerce
Medium · Python 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
RocoMart: Building an End-to-End MLOps Pipeline Orchestration for E-Commerce
Architect a robust MLOps pipeline from scratch using Python, Prefect, MLflow, and Flask to power real-time e-commerce tech. Continue reading on Analytics Vidhya
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
MLOps Patterns for Enterprise AI: Beyond Model Training to Autonomous Production Systems
Five battle-tested patterns for building scalable, governed autonomous AI systems at enterprise scale Continue reading on Medium »
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
MLOps Patterns for Enterprise AI: Beyond Model Training to Autonomous Production Systems
Five battle-tested patterns for building scalable, governed autonomous AI systems at enterprise scale Continue reading on Medium »
Streamlining Data-Driven Operations: A Comparative Analysis of DevOps, DataOps, MLOps, and ModelOps
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
Streamlining Data-Driven Operations: A Comparative Analysis of DevOps, DataOps, MLOps, and ModelOps
Compare DevOps, DataOps, MLOps, and ModelOps. Master enterprise AI governance, CI/CT, and scalable platform engineering strategies. Continue reading on Medium »
MLOps Pillar #1: How to Structure Data Workflows for Scalable Machine Learning
Medium · Data Science 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
MLOps Pillar #1: How to Structure Data Workflows for Scalable Machine Learning
Why strong data workflows, reusable features, and traceable lineage are the foundation of scalable ML systems Continue reading on Medium »
I Built a Full MLOps Pipeline to Sort Trash - and You Can Too
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
I Built a Full MLOps Pipeline to Sort Trash - and You Can Too
Most ML tutorials stop the second the model says “87% accuracy.” This one keeps going -through tracking, evaluation, testing, logging, and… Continue reading on
Is a Career as an ML Operations Engineer Worth Considering in 2026?-IABAC
Medium · Data Science 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
Is a Career as an ML Operations Engineer Worth Considering in 2026?-IABAC
Let’s be brutally honest for a second. Continue reading on Medium »
Escaping the Managed AI Tax: Migrating from Pinecone to Qdrant on Bare Metal
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
Escaping the Managed AI Tax: Migrating from Pinecone to Qdrant on Bare Metal
How to slash your vector database billing, eliminate memory map crashes, and master scalar quantization. Continue reading on Medium »
Architecting Resilient MLOps Pipelines: A Multi-AZ Deployment Strategy on AWS using Kubernetes…
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
Architecting Resilient MLOps Pipelines: A Multi-AZ Deployment Strategy on AWS using Kubernetes…
Project Architecture (AWS Infrastructure Overview) Continue reading on Medium »
What is MLOps?
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1w ago
What is MLOps?
MLOps (Machine Learning Operations) is a set of practices that combines Machine Learning (ML), DevOps, and Data Engineering to develop… Continue reading on Medi
3 AM and the LLM is Down: Why Your AI Infrastructure Needs an Emergency Operations Runbook(AI…
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 2w ago
3 AM and the LLM is Down: Why Your AI Infrastructure Needs an Emergency Operations Runbook(AI…
Most teams focus on building models. Elite teams focus on keeping them alive in production. Here is the operational framework you are… Continue reading on Mediu
Self-Hosting Airflow at Home: Automating Stock Price Data Collection
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 2w ago
Self-Hosting Airflow at Home: Automating Stock Price Data Collection
I ran Airflow at home to pull live stock data. Here’s how I did it and what I learned. Continue reading on Analytics Vidhya »
Day 2/100 MLOps: When Configuration Files Weaponize Against Your Team
Medium · Python 🏭 MLOps & LLMOps ⚡ AI Lesson 2w ago
Day 2/100 MLOps: When Configuration Files Weaponize Against Your Team
Yesterday, we built a pristine, isolated Python environment. It was clean. It was standard. Continue reading on Medium »
Day 2/100 MLOps: When Configuration Files Weaponize Against Your Team
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 2w ago
Day 2/100 MLOps: When Configuration Files Weaponize Against Your Team
Yesterday, we built a pristine, isolated Python environment. It was clean. It was standard. Continue reading on Medium »
InfoQ AI/ML 🏭 MLOps & LLMOps ⚡ AI Lesson 2w ago
Slack Eliminates SSH in EMR Pipelines, Migrates 700+ Jobs to Rest-Based Architecture
Slack modernized its data platform by replacing SSH based execution in Amazon EMR pipelines with a REST driven orchestration layer called Quarry. The migration
CI/CD for Machine Learning Projects: The Complete MLOps Guide
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 3w ago
CI/CD for Machine Learning Projects: The Complete MLOps Guide
A comprehensive, battle-tested guide to implementing CI/CD pipelines for Machine Learning projects — and why they’re fundamentally… Continue reading on Medium »
CI/CD for Machine Learning Projects: The Complete MLOps Guide
Medium · Python 🏭 MLOps & LLMOps ⚡ AI Lesson 3w ago
CI/CD for Machine Learning Projects: The Complete MLOps Guide
A comprehensive, battle-tested guide to implementing CI/CD pipelines for Machine Learning projects — and why they’re fundamentally… Continue reading on Medium »
The Missing 90% Between “Install Ollama” and Running AI in Production
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 4w ago
The Missing 90% Between “Install Ollama” and Running AI in Production
Every week I see the same pattern: someone posts a 10-minute tutorial installing Ollama, runs a model, asks it a question, and says “done… Continue reading on M
Mlops as a GIF.
Medium · Data Science 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Mlops as a GIF.
Everyone wants to build AI, but only the chosen ones want to scale it. Continue reading on Medium »
Canary Testing ML Migrations with Docker Sandboxes: A Production Pattern for Fearless Pipelines
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Canary Testing ML Migrations with Docker Sandboxes: A Production Pattern for Fearless Pipelines
How an MLOps team used ephemeral Docker environments to safely test a car price prediction pipeline — stage by stage — before touching… Continue reading on Medi
EvalForge: The Quality Gate Between AI Output and Production Trust
Medium · LLM 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
EvalForge: The Quality Gate Between AI Output and Production Trust
What if every AI-generated test case, agent response, tool call, and automation decision had to pass a measurable quality gate before… Continue reading on Mediu
Running a PyTorch Model on Triton (alongside onnx) — MLOPs Part 2
Medium · Deep Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Running a PyTorch Model on Triton (alongside onnx) — MLOPs Part 2
In Part 1, We got NVIDIA’s Triton Inference Server running on our local (Mac) with no GPU, and served a DenseNet ONNX model that correctly… Continue reading on
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 13: Restoring DVC Data on a Fresh Clone
Welcome back to the 100 Days of MLOps challenge! Continue reading on Medium »
Docker Layer Caching Is Broken in Your ML Project
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Docker Layer Caching Is Broken in Your ML Project
Every time you push a model update, your CI rebuilds from scratch. Fifteen minutes wasted. Again. Docker layer caching is supposed to fix… Continue reading on M
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Mastering MLOps: How to Build Continuous Delivery and Automation Pipelines for Machine Learning
In today’s fast-moving tech landscape, machine learning has moved from experimental projects to core business capabilities. Companies… Continue reading on Mediu
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 10: Versioning Data with DVC
Welcome to Day 10 of the 100 Days of MLOps challenge! Continue reading on Medium »
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Things I Learned Building an End-to-End ML Pipeline on Kubernetes: From Validated Data to Live…
Part 2 of an MLOps End-to-End series — 60 models, fully automated, one Airflow DAG Continue reading on Medium »
AI Adoption To AI Operations
Medium · LLM 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
AI Adoption To AI Operations
This article covers our team’s recent experiences building observability, evaluation, and operational workflows for production AI systems… Continue reading on I
Beyond Code: Why DevOps Isn’t Enough for Machine Learning in 2026
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Beyond Code: Why DevOps Isn’t Enough for Machine Learning in 2026
The AI Deployment Paradox Continue reading on Medium »
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
MLOps for Beginners: What is MLOps and How It Works in Real-World AI
Published by Nixace Technologies Continue reading on Medium »
CI/CD for GenAI: Shipping Code, Config, and Prompts Together — 6/6
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
CI/CD for GenAI: Shipping Code, Config, and Prompts Together — 6/6
Your pipeline knows how to test code. Here’s what it takes to ship an LLM-powered system with the same confidence. Continue reading on Medium »
CI/CD for Machine Learning: Automating Model Testing, Evaluation, and Deployment
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
CI/CD for Machine Learning: Automating Model Testing, Evaluation, and Deployment
Shipping an ML model feels very different from shipping normal software. You deploy successfully, then spend days watching dashboards… Continue reading on Mediu
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
As part of my KodeKloud MLOps learning journey, Day 2 focused on setting up and troubleshooting a JupyterLab server configuration for a… Continue reading on Med
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
Medium · Data Science 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
As part of my KodeKloud MLOps learning journey, Day 2 focused on setting up and troubleshooting a JupyterLab server configuration for a… Continue reading on Med
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
Medium · Python 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
As part of my KodeKloud MLOps learning journey, Day 2 focused on setting up and troubleshooting a JupyterLab server configuration for a… Continue reading on Med
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 2: Set Up and Configure Jupyter Notebook Server | KodeKloud MLOps Journey
As part of my KodeKloud MLOps learning journey, Day 2 focused on setting up and troubleshooting a JupyterLab server configuration for a… Continue reading on Med
Who Supports Your AI Code When It Crashes at 2 AM?
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Who Supports Your AI Code When It Crashes at 2 AM?
In my early days as a developer at Duke University Medical Center, we had a motto — crude, blunt, and unforgettable: If this code breaks… Continue reading on Da
Day 137-Kyverno apply CLI Command: Finding Failures Early and Saving Cost in AI/MLOps Workloads
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Day 137-Kyverno apply CLI Command: Finding Failures Early and Saving Cost in AI/MLOps Workloads
16th May 2026, Netherlands — In Kubernetes, policies are like safety rules. Continue reading on Medium »
Deploy Once, Host 100 Models with TGI & AI-Aware Load Balancer
Medium · LLM 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Deploy Once, Host 100 Models with TGI & AI-Aware Load Balancer
We live in a world of fast-evolving AI models. There are dozens of them — open-sourced, licensed, huge models with more than 175B… Continue reading on Medium »
Deployment Is Not a Pipeline:
Alias-Driven Local MLOps with MLflow
Medium · Data Science 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Deployment Is Not a Pipeline: Alias-Driven Local MLOps with MLflow
How a small team simplified local MLOps using MLflow aliases, autoserve deployment, and observability. Continue reading on Medium »
Deployment Is Not a Pipeline:
Alias-Driven Local MLOps with MLflow
Medium · DevOps 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
Deployment Is Not a Pipeline: Alias-Driven Local MLOps with MLflow
How a small team simplified local MLOps using MLflow aliases, autoserve deployment, and observability. Continue reading on Medium »
MLOps in production: what nobody tells you before shipping your model to prod.
Medium · Machine Learning 🏭 MLOps & LLMOps ⚡ AI Lesson 1mo ago
MLOps in production: what nobody tells you before shipping your model to prod.
Training a machine learning model is, relatively speaking, the easy part. What comes next — deploying it to production, maintaining it… Continue reading on Medi