Tech Skills

DevOps & Cloud

CI/CD, Docker, Kubernetes, AWS, GCP, Azure, monitoring and infrastructure as code

1,487
lessons
Skills in this topic
View full skill map →
Linux & CLI
beginner
Navigate the filesystem, manage permissions, and use pipes
Docker & Containers
beginner
Write a production-ready Dockerfile
Cloud Fundamentals
intermediate
Deploy a web app on AWS EC2 or App Engine
Kubernetes
intermediate
Deploy a multi-container app on a k8s cluster
CI/CD Pipelines
intermediate
Build a CI pipeline that runs tests on every PR
Infrastructure as Code
advanced
Provision a full VPC with Terraform
All Reads (1,026) Articles (548)Blog Posts (340)Tutorials (133)News (5)
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Need help troubleshooting understanding a GitHub Actions cache miss pattern in a monorepo
Need help troubleshooting understanding a GitHub Actions cache miss pattern in a monorepo Quest Best Tech-Category Response Original AgentHansa Help Thread Requ
How a Tiny Go Cache Cut Our Redis Bill by $4,000/Month
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
How a Tiny Go Cache Cut Our Redis Bill by $4,000/Month
The Night Redis Started Breaking Everything Continue reading on Medium »
SSH Login Taking Forever? Check Your DNS Settings
Dev.to · Schiff Heimlich ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
SSH Login Taking Forever? Check Your DNS Settings
A simple fix for slow SSH connections caused by DNS lookups
Payment Events at Scale: Building a Robust Kafka Event Bus  for a B2B Payment Platform
Medium · Python ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Payment Events at Scale: Building a Robust Kafka Event Bus for a B2B Payment Platform
 FREE full access on: LovinData — Simplified Full Stack Data Engineering Continue reading on Medium »
Migration of Intercontinental VM (USA Region > Australia Region) using Storage Snapshot through…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Migration of Intercontinental VM (USA Region > Australia Region) using Storage Snapshot through…
In this real-world based project, I acted as a Cloud Specialist in a project to migrate application and database into an intercontinental… Continue reading on M
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
A Tragic Tale of Mis-scaled Servers and the Unfortunate Rise of the Treasure Hunt Engine
The Problem We Were Actually Solving It's been a year since we rolled out the Treasure Hunt Engine, our flagship product for creating immersive in-game experien
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Linux Guide for DevOps
You have mastered Git, you understand deployment pipelines, and you can confidently package applications using Docker. But there is still… Continue reading on M
Kubernetes Deployment with GitOps and FluxCD
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Kubernetes Deployment with GitOps and FluxCD
In this workshop, we’ll explore how to deploy and manage a Kubernetes cluster using GitOps and FluxCD. In a previous article, I covered… Continue reading on Med
The Modern DevSecOps Engineering Stack (2026 Edition): From First Commit to Production
Dev.to · Aturo Phil ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The Modern DevSecOps Engineering Stack (2026 Edition): From First Commit to Production
Here's a hard truth I learnt after watching a production database get wiped by a leaked .env file:...
Docker Is Not What I Thought It Was
Medium · Programming ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Docker Is Not What I Thought It Was
I used Docker for a while before I actually understood what it was doing. I pulled images, ran containers, wrote Dockerfiles, and it all… Continue reading on Me
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
When Treachery Reveals the True Cost of Server Health
The Problem We Were Actually Solving After weeks of digging through logs and monitoring data, I finally figured out the root cause of our problems: our treasure
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Veltrix Operator Nightmare: How I Learned to Stop Worrying and Love the Failures
The Problem We Were Actually Solving I was tasked with integrating the Veltrix treasure hunt engine into our growing server infrastructure, and from the start,
Six Months Ago Kubernetes Retired Ingress NGINX. An 18-Year-Old Bug Just Made That a Crisis.
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Six Months Ago Kubernetes Retired Ingress NGINX. An 18-Year-Old Bug Just Made That a Crisis.
NGINX Rift is critical. Unpatched in the abandoned controller that half of cloud native runs. The only fix you can buy comes from a vendor… Continue reading on
The One Test That Never Fails (But Is Still Worth Writing)
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The One Test That Never Fails (But Is Still Worth Writing)
On testing configuration, contracts, and startup conditions — not just business logic. Continue reading on Medium »
My CI/CD Architecture
Dev.to · Akash Santra ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
My CI/CD Architecture
Why I Decided to Add CI/CD As my AI-powered realtime communication platform started...
The AWS Service Quotas That Will Take Down Your Production at 3 AM (And You Cannot Raise Them Fast…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The AWS Service Quotas That Will Take Down Your Production at 3 AM (And You Cannot Raise Them Fast…
Hard limits, scaling lags, and the architectural walls that no support ticket can fix. Continue reading on Medium »
I stopped uploading my files to random websites and built my own tools instead
Dev.to · PureTools ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
I stopped uploading my files to random websites and built my own tools instead
Every week I'd find myself doing the same thing. Googling "compress PDF Every week I'd find myself...
Cortex vs VictoriaMetrics: Why Scalable Prometheus Is Not Always the Best Prometheus
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Cortex vs VictoriaMetrics: Why Scalable Prometheus Is Not Always the Best Prometheus
A few years ago, observability was relatively simple. Most teams had a Prometheus instance scraping metrics from a handful of services… Continue reading on Medi
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Day 12: Linux Network Services | 100 Days of Devops
Guide for 12th task of 100 Days of Devops from KodeKloud Continue reading on Medium »
Navigating the Hidden Dangers of Server Growth with Treasure Hunt Engine
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Navigating the Hidden Dangers of Server Growth with Treasure Hunt Engine
The Problem We Were Actually Solving I still remember the day our team's server growth hit...
Auto versioning + changelog generation using Github Action
Dev.to · Kyle Y. Parsotan ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Auto versioning + changelog generation using Github Action
Auto versioning + changelog generation is a very real production pattern used in open-source and SaaS...
Observability in 2026: Distributed Tracing Replaced Logs, and OpenTelemetry Won
Dev.to · ZNY ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Observability in 2026: Distributed Tracing Replaced Logs, and OpenTelemetry Won
Observability in 2026: Distributed Tracing Replaced Logs, and OpenTelemetry Won The...
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Treasure Hunt Engine: How We Avoided the Common Pitfall of Configuration Over-Engineering
I still remember the day when our team thought we had finally cracked the code on building a scalable treasure hunt engine. We had implemented a shiny new AI mo
What Actually Happens When You Type kubectl apply
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
What Actually Happens When You Type kubectl apply
Kubernetes Internals, API Writes, And Reconciliation Loops Continue reading on Medium »
🌿 Git Mastery: The Complete Developer Guide
Dev.to · Dishon Oketch ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
🌿 Git Mastery: The Complete Developer Guide
From your first commit to advanced branching strategies — everything you need to version control like...
Serverless Server Overhead: A Treasure Hunt to Get Right Before Your Server Scales
Dev.to · pinkie zwane ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Serverless Server Overhead: A Treasure Hunt to Get Right Before Your Server Scales
The Problem We Were Actually Solving Digging deeper, we discovered that our serverless...
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Veltrix Treasure Hunts Are A Production Nightmare Without This One Crucial Step
The Problem We Were Actually Solving I still remember the day our team was tasked with integrating the Veltrix treasure hunt engine into our production system.
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Terraform Locals Changed How I Write Infrastructure Code
The small Terraform concept that quietly made our infrastructure cleaner, safer, and easier to scale Continue reading on Medium »
“Failover Is Not a Strategy.” — An Architect’s Wake-Up Call to Our IROPS Team
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
“Failover Is Not a Strategy.” — An Architect’s Wake-Up Call to Our IROPS Team
Last week I asked a simple question in our architecture review: Continue reading on Medium »
What I Learned From Building a Hytale Server That Didnt Fall Apart
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
What I Learned From Building a Hytale Server That Didnt Fall Apart
The Problem We Were Actually Solving I still remember the first time our Hytale server...
Adventures in Server Burnout: How Our Treasure Hunt Engine Lost Its Cache
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Adventures in Server Burnout: How Our Treasure Hunt Engine Lost Its Cache
The Problem We Were Actually Solving We were trying to use our treasure hunt engine, a...
The contract is the interface: agent-driven Steampipe Stave in one command
Dev.to · Bala Paranj ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The contract is the interface: agent-driven Steampipe Stave in one command
Wiring a new infrastructure source into a security policy engine used to mean writing a custom extractor. We replaced it with a YAML contract per asset type and
My Server Scaling Nightmare: Why Most People Get Veltrix Configuration Wrong
Dev.to · pinkie zwane ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
My Server Scaling Nightmare: Why Most People Get Veltrix Configuration Wrong
The Problem We Were Actually Solving Last year, our team at Mythic Games launched a highly...
A Beginner’s Guide to CI/CD and Cloud Deployment
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
A Beginner’s Guide to CI/CD and Cloud Deployment
Building a software application locally is just the first step. To make your application truly useful, you must store user data safely in… Continue reading on M
A Practical Guide to Not Losing Your Treasure in a Sea of Configuration Choices
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
A Practical Guide to Not Losing Your Treasure in a Sea of Configuration Choices
The Problem We Were Actually Solving We're building a treasure hunt engine for a massive...
I Deployed Netflix's Web Server in 30 Seconds (And So Can You) - Docker Project 1
Dev.to · PETER Samuel ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
I Deployed Netflix's Web Server in 30 Seconds (And So Can You) - Docker Project 1
Introduction What You Will Build Today By the end of this tutorial, you will have: A live web...
The False Sense of Security in Server Health Monitoring
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The False Sense of Security in Server Health Monitoring
The Problem We Were Actually Solving In our pursuit of server health, we were focusing on...
Spot instances as GitHub Actions runners
Dev.to · Khachatur Ashotyan ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Spot instances as GitHub Actions runners
Part 1 was Jenkins as code with ephemeral workers. Part 2 was macOS. This one moves a chunk of the CI...
Linux System Administration: The Complete Field Guide
Medium · Cybersecurity ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Linux System Administration: The Complete Field Guide
The server is down. It is 2 AM. Your phone is lighting up with alerts. You SSH in, fingers already moving from muscle memory, and in the… Continue reading on Me
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Building an Auto Kubernetes Deployment Validator with Python ☸️
A Beginner-Friendly DevOps Project with Real-World Examples Continue reading on Medium »
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Claude Leak Exposed Can We Trust Developer Productivity Metrics?
A post‑mortem of the Claude leak showed 30% of productivity gains were phantom. Learn why your dashboards may be overstating output and how to turn that surpris
How to Maximize Your Dedicated Server: Node.js 22 & PM2 Production Setup
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
How to Maximize Your Dedicated Server: Node.js 22 & PM2 Production Setup
Stop wasting your CPU cores. Learn how to deploy a resilient, multi-core Node.js architecture. Continue reading on Medium »
2 A.M., Two Engineers, and a Database on Fire: A Real SRE War Story
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
2 A.M., Two Engineers, and a Database on Fire: A Real SRE War Story
One query. A production database on its knees. Here’s how we survived it and what it taught me about the gap between knowing systems and… Continue reading on Me
When I Finally Realized My Runtime Was Holding Me Back
Dev.to · pretty ncube ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
When I Finally Realized My Runtime Was Holding Me Back
The Problem We Were Actually Solving I was tasked with optimizing the performance of our...
AI-Era Incident Response on AWS
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
AI-Era Incident Response on AWS
It’s 2:17 a.m. PagerDuty says checkout is failing. You have CloudWatch, Logs Insights, the last deploy in GitHub Actions, and Slack open… Continue reading on Me
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
Scaling a Treasure Hunt Engine to 10,000 Concurrent Users with Veltrix
The Problem We Were Actually Solving Our treasure hunt engine had worked well for a small audience, but when we hit a growth inflection point, it became clear t
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The Veltrix Debacle: Why You Can't Just Scale Your Server by Fiddling with Configuration Knobs
The Problem We Were Actually Solving To be honest, we didn't really have a clear understanding of what we were trying to solve. We knew our server was getting s
The Silent Scalability Bottleneck
Dev.to · ruth mhlanga ☁️ DevOps & Cloud ⚡ AI Lesson 4w ago
The Silent Scalability Bottleneck
The Problem We Were Actually Solving We had just launched a new live scoring feature for...