Tech Skills

DevOps & Cloud

CI/CD, Docker, Kubernetes, AWS, GCP, Azure, monitoring and infrastructure as code

1,410
lessons
Skills in this topic
View full skill map →
Linux & CLI
beginner
Navigate the filesystem, manage permissions, and use pipes
Docker & Containers
beginner
Write a production-ready Dockerfile
Cloud Fundamentals
intermediate
Deploy a web app on AWS EC2 or App Engine
Kubernetes
intermediate
Deploy a multi-container app on a k8s cluster
CI/CD Pipelines
intermediate
Build a CI pipeline that runs tests on every PR
Infrastructure as Code
advanced
Provision a full VPC with Terraform
All Reads (955) Articles (501)Blog Posts (330)Tutorials (119)News (5)
What Actually Happens When You Type kubectl apply
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
What Actually Happens When You Type kubectl apply
Kubernetes Internals, API Writes, And Reconciliation Loops Continue reading on Medium »
🌿 Git Mastery: The Complete Developer Guide
Dev.to · Dishon Oketch ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
🌿 Git Mastery: The Complete Developer Guide
From your first commit to advanced branching strategies — everything you need to version control like...
Serverless Server Overhead: A Treasure Hunt to Get Right Before Your Server Scales
Dev.to · pinkie zwane ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Serverless Server Overhead: A Treasure Hunt to Get Right Before Your Server Scales
The Problem We Were Actually Solving Digging deeper, we discovered that our serverless...
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Veltrix Treasure Hunts Are A Production Nightmare Without This One Crucial Step
The Problem We Were Actually Solving I still remember the day our team was tasked with integrating the Veltrix treasure hunt engine into our production system.
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Terraform Locals Changed How I Write Infrastructure Code
The small Terraform concept that quietly made our infrastructure cleaner, safer, and easier to scale Continue reading on Medium »
“Failover Is Not a Strategy.” — An Architect’s Wake-Up Call to Our IROPS Team
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
“Failover Is Not a Strategy.” — An Architect’s Wake-Up Call to Our IROPS Team
Last week I asked a simple question in our architecture review: Continue reading on Medium »
What I Learned From Building a Hytale Server That Didnt Fall Apart
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
What I Learned From Building a Hytale Server That Didnt Fall Apart
The Problem We Were Actually Solving I still remember the first time our Hytale server...
Adventures in Server Burnout: How Our Treasure Hunt Engine Lost Its Cache
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Adventures in Server Burnout: How Our Treasure Hunt Engine Lost Its Cache
The Problem We Were Actually Solving We were trying to use our treasure hunt engine, a...
The contract is the interface: agent-driven Steampipe Stave in one command
Dev.to · Bala Paranj ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The contract is the interface: agent-driven Steampipe Stave in one command
Wiring a new infrastructure source into a security policy engine used to mean writing a custom extractor. We replaced it with a YAML contract per asset type and
My Server Scaling Nightmare: Why Most People Get Veltrix Configuration Wrong
Dev.to · pinkie zwane ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
My Server Scaling Nightmare: Why Most People Get Veltrix Configuration Wrong
The Problem We Were Actually Solving Last year, our team at Mythic Games launched a highly...
A Beginner’s Guide to CI/CD and Cloud Deployment
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
A Beginner’s Guide to CI/CD and Cloud Deployment
Building a software application locally is just the first step. To make your application truly useful, you must store user data safely in… Continue reading on M
A Practical Guide to Not Losing Your Treasure in a Sea of Configuration Choices
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
A Practical Guide to Not Losing Your Treasure in a Sea of Configuration Choices
The Problem We Were Actually Solving We're building a treasure hunt engine for a massive...
I Deployed Netflix's Web Server in 30 Seconds (And So Can You) - Docker Project 1
Dev.to · PETER Samuel ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
I Deployed Netflix's Web Server in 30 Seconds (And So Can You) - Docker Project 1
Introduction What You Will Build Today By the end of this tutorial, you will have: A live web...
The False Sense of Security in Server Health Monitoring
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The False Sense of Security in Server Health Monitoring
The Problem We Were Actually Solving In our pursuit of server health, we were focusing on...
Spot instances as GitHub Actions runners
Dev.to · Khachatur Ashotyan ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Spot instances as GitHub Actions runners
Part 1 was Jenkins as code with ephemeral workers. Part 2 was macOS. This one moves a chunk of the CI...
Linux System Administration: The Complete Field Guide
Medium · Cybersecurity ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Linux System Administration: The Complete Field Guide
The server is down. It is 2 AM. Your phone is lighting up with alerts. You SSH in, fingers already moving from muscle memory, and in the… Continue reading on Me
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Building an Auto Kubernetes Deployment Validator with Python ☸️
A Beginner-Friendly DevOps Project with Real-World Examples Continue reading on Medium »
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Claude Leak Exposed Can We Trust Developer Productivity Metrics?
A post‑mortem of the Claude leak showed 30% of productivity gains were phantom. Learn why your dashboards may be overstating output and how to turn that surpris
How to Maximize Your Dedicated Server: Node.js 22 & PM2 Production Setup
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
How to Maximize Your Dedicated Server: Node.js 22 & PM2 Production Setup
Stop wasting your CPU cores. Learn how to deploy a resilient, multi-core Node.js architecture. Continue reading on Medium »
2 A.M., Two Engineers, and a Database on Fire: A Real SRE War Story
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
2 A.M., Two Engineers, and a Database on Fire: A Real SRE War Story
One query. A production database on its knees. Here’s how we survived it and what it taught me about the gap between knowing systems and… Continue reading on Me
When I Finally Realized My Runtime Was Holding Me Back
Dev.to · pretty ncube ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
When I Finally Realized My Runtime Was Holding Me Back
The Problem We Were Actually Solving I was tasked with optimizing the performance of our...
AI-Era Incident Response on AWS
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
AI-Era Incident Response on AWS
It’s 2:17 a.m. PagerDuty says checkout is failing. You have CloudWatch, Logs Insights, the last deploy in GitHub Actions, and Slack open… Continue reading on Me
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Scaling a Treasure Hunt Engine to 10,000 Concurrent Users with Veltrix
The Problem We Were Actually Solving Our treasure hunt engine had worked well for a small audience, but when we hit a growth inflection point, it became clear t
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Veltrix Debacle: Why You Can't Just Scale Your Server by Fiddling with Configuration Knobs
The Problem We Were Actually Solving To be honest, we didn't really have a clear understanding of what we were trying to solve. We knew our server was getting s
The Silent Scalability Bottleneck
Dev.to · ruth mhlanga ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Silent Scalability Bottleneck
The Problem We Were Actually Solving We had just launched a new live scoring feature for...
Medium · Python ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Codes for Linux users
Some of the codes are given below. The non user of Linux can avoid this read. Continue reading on Medium »
AWS DevOps Agent: Deep Dive — Autonomous Incident Investigation and Prevention for Production…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
AWS DevOps Agent: Deep Dive — Autonomous Incident Investigation and Prevention for Production…
This article covers AWS DevOps Agent — the ops-focused autonomous incident agent. Continue reading on Medium »
How to Connect a GoDaddy Domain to Your Vercel App
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
How to Connect a GoDaddy Domain to Your Vercel App
Step 1: Buy Domain on GoDaddy Continue reading on Medium »
Building a GCP-Like Environment on a Home Cluster
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Building a GCP-Like Environment on a Home Cluster
I’ve been running a homelab for about 10 years now. I like the convenience of public clouds, but I don’t trust relying entirely on some… Continue reading on Med
Investigation Reports: When Monitors Get Smarter
Dev.to · Patrick Londa ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Investigation Reports: When Monitors Get Smarter
What if your monitor didn't just fire an alert — it also investigated the issue and delivered a diagnosis before you even reached your keyboard? That's what Inv
It Worked When I Closed the Laptop. I Swear.
Dev.to · Game Dev Notes (Korea) ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
It Worked When I Closed the Laptop. I Swear.
Last night the build was clean. This morning, five red errors before the coffee was even warm. git...
Local Platform Engineering on Your Laptop | vind, Sveltos and ArgoCD
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Local Platform Engineering on Your Laptop | vind, Sveltos and ArgoCD
I’m not new to Kubernetes. Certified my way through most of what the CNCF has to offer. And yet this specific setup, a proper local… Continue reading on Medium
Treasure Hunt Engine: How We Survived the Horror of Default Config
Dev.to · ruth mhlanga ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Treasure Hunt Engine: How We Survived the Horror of Default Config
The Problem We Were Actually Solving At the early stages of our deployment, we focused on...
The Infrastructure Team Is the Real Single Point of Failure
Dev.to · NTCTech ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Infrastructure Team Is the Real Single Point of Failure
Every serious infrastructure investment goes into redundant hardware, distributed systems, and...
Why we moved from LVM to Ceph for container storage 📦
Dev.to · Flora Brandão ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Why we moved from LVM to Ceph for container storage 📦
Managing storage that is glued to your compute nodes makes scaling a nightmare. We found that LVM...
GitOps Needs Break Glass
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
GitOps Needs Break Glass
Declarative delivery is great. Production incidents do not give rats ass about your philosophy. Continue reading on Medium »
Vibe Coding vs Real Full-Stack Development: The Production Gap
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Vibe Coding vs Real Full-Stack Development: The Production Gap
“Frontend + Backend” is no longer enough to call yourself full-stack. Continue reading on Medium »
Your cron jobs are probably failing silently and you have no idea
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Your cron jobs are probably failing silently and you have no idea
Background jobs die quietly. Heartbeat monitoring is the pattern that catches them - how it works, why inverted checks beat ping monitors for scheduled tasks, a
How an expired SSL cert took down our checkout for six hours (and what I should have had watching)
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
How an expired SSL cert took down our checkout for six hours (and what I should have had watching)
A post-mortem on a cert expiry outage that an uptime monitor marked green for four hours. What Let's Encrypt auto-renewal actually fails on, and how to properly
The 5 things traditional uptime monitors miss (and how to catch them)
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The 5 things traditional uptime monitors miss (and how to catch them)
JavaScript failures, CDN stale cache, hydration errors, visual regressions, and silent cron deaths - the production incidents that return HTTP 200 and fool your
How to build a visual uptime monitor with Go and headless Chrome
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
How to build a visual uptime monitor with Go and headless Chrome
Most uptime monitors work by making an HTTP request and checking the response code. It's fast, cheap,...
Understanding Kubernetes RBAC Structure: Roles, Permissions, API Verbs, PATCH Access, and Special…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Understanding Kubernetes RBAC Structure: Roles, Permissions, API Verbs, PATCH Access, and Special…
One of the first things engineers learn in Kubernetes is: Continue reading on Medium »
Is the JVM Dead in the Cloud Era? Migrating Spring Boot to GraalVM Native Images
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Is the JVM Dead in the Cloud Era? Migrating Spring Boot to GraalVM Native Images
I asked myself this question seriously for the first time while staring at a Kubernetes node utilisation dashboard at eleven o’clock on a… Continue reading on M
Stop Learning These DevOps Tools in 2026. They Are Already Dead.
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Stop Learning These DevOps Tools in 2026. They Are Already Dead.
Most DevOps roadmaps on the internet are outdated. The courses are old, the tool recommendations are stale, and nobody tells you which… Continue reading on Medi
The Context Window Is RAM — Why Your Agent's SLIs Are Telling You It's Full
Dev.to · Ajay Devineni ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Context Window Is RAM — Why Your Agent's SLIs Are Telling You It's Full
The Microsoft team that built the Azure SRE Agent published something in January that I keep coming...
The Day an RDS Database Refused Only One Server
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Day an RDS Database Refused Only One Server
There are few things more frustrating in infrastructure debugging than a problem that makes absolutely no sense. Continue reading on Medium »
Enterprise Monitoring: Upgrading Zabbix Infrastructure and Integrating Cisco 9800 WLC SNMP…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Enterprise Monitoring: Upgrading Zabbix Infrastructure and Integrating Cisco 9800 WLC SNMP…
Managing an enterprise network infrastructure requires absolute visibility. When you are responsible for monitoring hundreds of switches… Continue reading on Me
NIS2 Article 21 in Azure: Implementing Network Security Controls with Terraform
Dev.to · david ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
NIS2 Article 21 in Azure: Implementing Network Security Controls with Terraform
NIS2 Article 21 in Azure: Implementing Network Security Controls with Terraform Tags: terraform,...