Observability Engineering: Metrics, Logs, and Traces

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Observability Engineering: Metrics, Logs, and Traces

Coursera · Intermediate ·📊 Data Analytics & Business Intelligence ·1mo ago
This program explores how observability enables engineers to understand, monitor, and troubleshoot modern distributed systems by using metrics, logs, and traces. You’ll begin by learning the foundational principles of observability, understanding how it differs from traditional monitoring, and exploring the three pillars of observability. Through hands-on demonstrations with Prometheus and Node Exporter, you will learn how system telemetry is collected and how metrics provide visibility into infrastructure and application behavior. You’ll then design reliability-focused metrics strategies using concepts such as Golden Signals, Service-Level Indicators (SLIs), Service-Level Objectives (SLOs), and error budgets. Practical demonstrations show how to collect application metrics, write PromQL queries, and analyze latency and error patterns. You will also explore metrics visualization and alerting by building Grafana dashboards, configuring thresholds, and creating alert rules with Prometheus and Alertmanager to detect operational incidents quickly. Next, you’ll examine centralized logging and distributed tracing, learning how logs and traces provide deeper insight into system behavior. Using Loki, Fluent Bit, OpenTelemetry, and Jaeger, you will explore how logs are aggregated, how requests are traced across microservices, and how engineers analyze service dependencies and request latency. You will also learn how modern observability platforms use AI-powered anomaly detection in Grafana to identify unusual system behavior and support proactive monitoring. By the end of this program, you will be able to: -Explain the principles of observability and differentiate it from monitoring. -Collect and analyze system metrics using Prometheus and PromQL. -Design dashboards and visualizations using Grafana. -Configure alerts and incident notifications using Prometheus and Alertmanager. -Implement centralized logging pipelines using Loki and Fluent Bit. -Instrument distributed sy
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Roblox Data Engineering Interview Questions: Full DE Prep Guide
Prepare for Roblox data engineering interviews with a focus on text-heavy product telemetry and search-related questions
Dev.to · Gowtham Potureddi
Tesla Data Engineering Interview Questions: Full DE Prep Guide
Prepare for Tesla data engineering interviews with this comprehensive guide, covering key concepts and practice questions to help you succeed
Dev.to · Gowtham Potureddi
Exodus Point Data Engineering Interview Questions: Full DE Prep Guide
Prepare for Exodus Point data engineering interviews with this comprehensive guide, covering key concepts and practice questions to help you succeed
Dev.to · Gowtham Potureddi
What I learned scraping Website Contact: schema, gotchas and the tooling that worked
Learn how to scrape Website Contact schema and overcome common obstacles with the right tooling
Dev.to · Can Yılmaz
Up next
Build Interactive Sales Dashboards in Excel
Coursera
Watch →