Tech Skills

DevOps & Cloud

CI/CD, Docker, Kubernetes, AWS, GCP, Azure, monitoring and infrastructure as code

1,346
lessons
Skills in this topic
View full skill map →
Linux & CLI
beginner
Navigate the filesystem, manage permissions, and use pipes
Docker & Containers
beginner
Write a production-ready Dockerfile
Cloud Fundamentals
intermediate
Deploy a web app on AWS EC2 or App Engine
Kubernetes
intermediate
Deploy a multi-container app on a k8s cluster
CI/CD Pipelines
intermediate
Build a CI pipeline that runs tests on every PR
Infrastructure as Code
advanced
Provision a full VPC with Terraform
All Reads (895) Articles (464)Blog Posts (313)Tutorials (114)News (4)
I Deployed Netflix's Web Server in 30 Seconds (And So Can You) - Docker Project 1
Dev.to · PETER Samuel ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
I Deployed Netflix's Web Server in 30 Seconds (And So Can You) - Docker Project 1
Introduction What You Will Build Today By the end of this tutorial, you will have: A live web...
The False Sense of Security in Server Health Monitoring
Dev.to · theresa moyo ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
The False Sense of Security in Server Health Monitoring
The Problem We Were Actually Solving In our pursuit of server health, we were focusing on...
Spot instances as GitHub Actions runners
Dev.to · Khachatur Ashotyan ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Spot instances as GitHub Actions runners
Part 1 was Jenkins as code with ephemeral workers. Part 2 was macOS. This one moves a chunk of the CI...
Linux System Administration: The Complete Field Guide
Medium · Cybersecurity ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Linux System Administration: The Complete Field Guide
The server is down. It is 2 AM. Your phone is lighting up with alerts. You SSH in, fingers already moving from muscle memory, and in the… Continue reading on Me
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Building an Auto Kubernetes Deployment Validator with Python ☸️
A Beginner-Friendly DevOps Project with Real-World Examples Continue reading on Medium »
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Claude Leak Exposed Can We Trust Developer Productivity Metrics?
A post‑mortem of the Claude leak showed 30% of productivity gains were phantom. Learn why your dashboards may be overstating output and how to turn that surpris
How to Maximize Your Dedicated Server: Node.js 22 & PM2 Production Setup
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
How to Maximize Your Dedicated Server: Node.js 22 & PM2 Production Setup
Stop wasting your CPU cores. Learn how to deploy a resilient, multi-core Node.js architecture. Continue reading on Medium »
2 A.M., Two Engineers, and a Database on Fire: A Real SRE War Story
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
2 A.M., Two Engineers, and a Database on Fire: A Real SRE War Story
One query. A production database on its knees. Here’s how we survived it and what it taught me about the gap between knowing systems and… Continue reading on Me
When I Finally Realized My Runtime Was Holding Me Back
Dev.to · pretty ncube ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
When I Finally Realized My Runtime Was Holding Me Back
The Problem We Were Actually Solving I was tasked with optimizing the performance of our...
AI-Era Incident Response on AWS
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
AI-Era Incident Response on AWS
It’s 2:17 a.m. PagerDuty says checkout is failing. You have CloudWatch, Logs Insights, the last deploy in GitHub Actions, and Slack open… Continue reading on Me
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Scaling a Treasure Hunt Engine to 10,000 Concurrent Users with Veltrix
The Problem We Were Actually Solving Our treasure hunt engine had worked well for a small audience, but when we hit a growth inflection point, it became clear t
Dev.to AI ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
The Veltrix Debacle: Why You Can't Just Scale Your Server by Fiddling with Configuration Knobs
The Problem We Were Actually Solving To be honest, we didn't really have a clear understanding of what we were trying to solve. We knew our server was getting s
The Silent Scalability Bottleneck
Dev.to · ruth mhlanga ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
The Silent Scalability Bottleneck
The Problem We Were Actually Solving We had just launched a new live scoring feature for...
Medium · Python ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Codes for Linux users
Some of the codes are given below. The non user of Linux can avoid this read. Continue reading on Medium »
AWS DevOps Agent: Deep Dive — Autonomous Incident Investigation and Prevention for Production…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
AWS DevOps Agent: Deep Dive — Autonomous Incident Investigation and Prevention for Production…
This article covers AWS DevOps Agent — the ops-focused autonomous incident agent. Continue reading on Medium »
How to Connect a GoDaddy Domain to Your Vercel App
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
How to Connect a GoDaddy Domain to Your Vercel App
Step 1: Buy Domain on GoDaddy Continue reading on Medium »
Building a GCP-Like Environment on a Home Cluster
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Building a GCP-Like Environment on a Home Cluster
I’ve been running a homelab for about 10 years now. I like the convenience of public clouds, but I don’t trust relying entirely on some… Continue reading on Med
Investigation Reports: When Monitors Get Smarter
Dev.to · Patrick Londa ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Investigation Reports: When Monitors Get Smarter
What if your monitor didn't just fire an alert — it also investigated the issue and delivered a diagnosis before you even reached your keyboard? That's what Inv
It Worked When I Closed the Laptop. I Swear.
Dev.to · Game Dev Notes (Korea) ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
It Worked When I Closed the Laptop. I Swear.
Last night the build was clean. This morning, five red errors before the coffee was even warm. git...
Local Platform Engineering on Your Laptop | vind, Sveltos and ArgoCD
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Local Platform Engineering on Your Laptop | vind, Sveltos and ArgoCD
I’m not new to Kubernetes. Certified my way through most of what the CNCF has to offer. And yet this specific setup, a proper local… Continue reading on Medium
Treasure Hunt Engine: How We Survived the Horror of Default Config
Dev.to · ruth mhlanga ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Treasure Hunt Engine: How We Survived the Horror of Default Config
The Problem We Were Actually Solving At the early stages of our deployment, we focused on...
The Infrastructure Team Is the Real Single Point of Failure
Dev.to · NTCTech ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
The Infrastructure Team Is the Real Single Point of Failure
Every serious infrastructure investment goes into redundant hardware, distributed systems, and...
Why we moved from LVM to Ceph for container storage 📦
Dev.to · Flora Brandão ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Why we moved from LVM to Ceph for container storage 📦
Managing storage that is glued to your compute nodes makes scaling a nightmare. We found that LVM...
GitOps Needs Break Glass
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
GitOps Needs Break Glass
Declarative delivery is great. Production incidents do not give rats ass about your philosophy. Continue reading on Medium »
Vibe Coding vs Real Full-Stack Development: The Production Gap
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Vibe Coding vs Real Full-Stack Development: The Production Gap
“Frontend + Backend” is no longer enough to call yourself full-stack. Continue reading on Medium »
Your cron jobs are probably failing silently and you have no idea
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Your cron jobs are probably failing silently and you have no idea
Background jobs die quietly. Heartbeat monitoring is the pattern that catches them - how it works, why inverted checks beat ping monitors for scheduled tasks, a
How an expired SSL cert took down our checkout for six hours (and what I should have had watching)
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
How an expired SSL cert took down our checkout for six hours (and what I should have had watching)
A post-mortem on a cert expiry outage that an uptime monitor marked green for four hours. What Let's Encrypt auto-renewal actually fails on, and how to properly
The 5 things traditional uptime monitors miss (and how to catch them)
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
The 5 things traditional uptime monitors miss (and how to catch them)
JavaScript failures, CDN stale cache, hydration errors, visual regressions, and silent cron deaths - the production incidents that return HTTP 200 and fool your
How to build a visual uptime monitor with Go and headless Chrome
Dev.to · SamReid ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
How to build a visual uptime monitor with Go and headless Chrome
Most uptime monitors work by making an HTTP request and checking the response code. It's fast, cheap,...
Understanding Kubernetes RBAC Structure: Roles, Permissions, API Verbs, PATCH Access, and Special…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Understanding Kubernetes RBAC Structure: Roles, Permissions, API Verbs, PATCH Access, and Special…
One of the first things engineers learn in Kubernetes is: Continue reading on Medium »
Is the JVM Dead in the Cloud Era? Migrating Spring Boot to GraalVM Native Images
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 2w ago
Is the JVM Dead in the Cloud Era? Migrating Spring Boot to GraalVM Native Images
I asked myself this question seriously for the first time while staring at a Kubernetes node utilisation dashboard at eleven o’clock on a… Continue reading on M
Stop Learning These DevOps Tools in 2026. They Are Already Dead.
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Stop Learning These DevOps Tools in 2026. They Are Already Dead.
Most DevOps roadmaps on the internet are outdated. The courses are old, the tool recommendations are stale, and nobody tells you which… Continue reading on Medi
The Context Window Is RAM — Why Your Agent's SLIs Are Telling You It's Full
Dev.to · Ajay Devineni ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Context Window Is RAM — Why Your Agent's SLIs Are Telling You It's Full
The Microsoft team that built the Azure SRE Agent published something in January that I keep coming...
The Day an RDS Database Refused Only One Server
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Day an RDS Database Refused Only One Server
There are few things more frustrating in infrastructure debugging than a problem that makes absolutely no sense. Continue reading on Medium »
Enterprise Monitoring: Upgrading Zabbix Infrastructure and Integrating Cisco 9800 WLC SNMP…
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Enterprise Monitoring: Upgrading Zabbix Infrastructure and Integrating Cisco 9800 WLC SNMP…
Managing an enterprise network infrastructure requires absolute visibility. When you are responsible for monitoring hundreds of switches… Continue reading on Me
NIS2 Article 21 in Azure: Implementing Network Security Controls with Terraform
Dev.to · david ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
NIS2 Article 21 in Azure: Implementing Network Security Controls with Terraform
NIS2 Article 21 in Azure: Implementing Network Security Controls with Terraform Tags: terraform,...
What did our VPC look like at 22:14 (Tue, 2026-05-20) — building clew, a CLI for navigating AWS Config snapshots
Dev.to · nishikawaakira ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
What did our VPC look like at 22:14 (Tue, 2026-05-20) — building clew, a CLI for navigating AWS Config snapshots
It's 2am. Production is on fire. Someone in the war room asks, "what did the VPC actually look like...
Bir Data Scientist & AI Engineer’ın Docker ve Ağ ile İlk Teması
Medium · Data Science ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Bir Data Scientist & AI Engineer’ın Docker ve Ağ ile İlk Teması
Veriyle aram hep iyiydi. Ama ürünü gerçek dünyaya çıkarma sırası gelince, hiç bilmediğim bir dünyayla karşılaştım: container’lar, portlar… Continue reading on M
The Complete Guide to Running a Midnight Node: Setup, Sync & Monitoring
Dev.to · Uroy Nwankwo ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
The Complete Guide to Running a Midnight Node: Setup, Sync & Monitoring
If you've been building on Midnight or want to participate in the network without trusting...
HashiCorp built an MCP server for writing Terraform. I built one for reviewing it
Dev.to · Sanjeev Kumar ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
HashiCorp built an MCP server for writing Terraform. I built one for reviewing it
HashiCorp built an MCP server for writing Terraform. I built one for reviewing it. A few...
I built a CLI that eliminates README reading forever
Dev.to · rexrun ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
I built a CLI that eliminates README reading forever
rex detects your project stack and runs the right command. Zero config. One binary. 12 ecosystems.
When Your Automations Start Growing Up, Your Infrastructure Has to Grow Too
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
When Your Automations Start Growing Up, Your Infrastructure Has to Grow Too
Most automation projects start small. Continue reading on Medium »
Deploying a Scalable Web Architecture on AWS: ALB + EC2 + Nginx
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Deploying a Scalable Web Architecture on AWS: ALB + EC2 + Nginx
A hands on walkthrough of building a production-ready load-balanced infrastructure from scratch Continue reading on Medium »
When a port is already in use, there is no interactive way to find it — so I built `port-peek`
Dev.to · Mu Micro ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
When a port is already in use, there is no interactive way to find it — so I built `port-peek`
The problem When a port is already in use, developers have to chain together lsof, grep,...
Complete Guide to Diploma in Linux System Administration Courses in Delhi
Medium · AI ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Complete Guide to Diploma in Linux System Administration Courses in Delhi
What is Linux System Administration? Continue reading on Medium »
Terraform 1.15 — Everything That Changed and Why It Actually Matters
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
Terraform 1.15 — Everything That Changed and Why It Actually Matters
Terraform 1.15 is not a single-headline release. It ships six distinct features that collectively solve problems engineers have complained… Continue reading on
Medium · DevOps ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
I Built a Script That Hardens a Linux Server Automatically — Here’s How
When you spin up a fresh Ubuntu server, it comes insecure by default. Root login is enabled. SSH accepts passwords. There is no firewall… Continue reading on Me
How a Silent Upstream Dependency Upgrade Broke My Kubernetes Microservice
Medium · Python ☁️ DevOps & Cloud ⚡ AI Lesson 3w ago
How a Silent Upstream Dependency Upgrade Broke My Kubernetes Microservice
It is an experience every software engineer knows all too well: you make a completely unrelated minor enhancement, trigger a standard… Continue reading on Mediu