Observability on Kubernetes β OpenTelemetry, Grafana Stack
Welcome to the Observability on Kubernetes workshop! This training covers end-to-end observability using OpenTelemetry, Prometheus, Grafana, Loki, Tempo, Mimir, and alerting on Azure AKS.
Materials
OpenTelemetry & Instrumentation
- Observability vs Monitoring β Why classic monitoring is no longer enough
- Czym jest Observability β Observability, OpenTelemetry overview
- Logi β Log formats, best practices, architecture
- Metryki β Metric types, PromQL, naming conventions
- Traceβy β Distributed tracing, spans, sampling
- Profile (eBPF) β Continuous profiling, flame graphs
- Architektura OpenTelemetry β Collector, instrumentation, Grafana Stack
- Cost Optimization β Data volume, sampling, retention, storage costs
Prometheus (separate sidebar group)
- Overview β Architecture, data flow, pull model
- Service Discovery β Kubernetes SD, relabeling, annotations
- Federation β Hierarchical monitoring, cross-datacenter
- Remote Write β Long-term storage, queue tuning, HA
- Native Histograms β Sparse buckets, migration, PromQL
- Naming Conventions β Metric names, labels, base units
- Recording Rules β Pre-aggregation, hierarchical rules
- Internal Mechanisms β TSDB, WAL, compaction, security
- Metric Cardinality β High cardinality, refactoring, analysis
- Push Gateway β When to use, encoding, API
- Exporters β Writing exporters, naming, deployment
Loki (separate sidebar group)
- Overview β Components, deployment modes, chunk format, labels
- LogQL β Index filtering, content filtering, exercises
Tempo (separate sidebar group)
- Overview β Architecture, components, protocols
- Integrations & TraceQL β Metrics generator, traces to logs/profiles, MCP
- Configuration & Deployment β Helm values, Grafana, retention
Exercises
- Loki β Log querying exercises
- Tempo β Distributed tracing exercises
- Prometheus β Metrics and PromQL exercises