Flame Graphs

Reading Flame Graphs

Flame graphs are the primary visualization for profile data. Understanding how to read them is essential for effective profiling.

Structure

┌─────────────────────────────────────────────────────────┐
│                    main.handleRequest()                   │  ← root (widest)
├──────────────────────────────┬──────────────────────────┤
│      db.QueryContext()       │     json.Marshal()        │
├──────────────┬───────────────┤                          │
│ net.Read()   │ sql.Prepare() │                          │
└──────────────┴───────────────┴──────────────────────────┘
                                                     ↑ top (self time)

Width = proportion of total samples (time/resources)
Y-axis = call stack depth (root at bottom, leaf at top)
Color = typically indicates package/module (varies by tool)

How to Read

Wide blocks at the top → functions that themselves consume many resources (hot spots)
Wide blocks at the bottom → functions that call expensive subtrees
Narrow blocks → functions that contribute little to total resource usage
Self time vs total time: A function may have high total time (it calls expensive children) but low self time (it doesn’t do much work itself)

Key Patterns

CPU Hot Spot

main.ServeHTTP()                     ← 100% total, 2% self
  └── handler.ProcessRequest()       ← 95% total, 5% self
       ├── json.Marshal()            ← 60% total, 60% self  ← HOT SPOT
       └── db.Query()                ← 30% total, 3% self
            └── net.Read()           ← 27% total, 27% self

Action: Optimize json.Marshal() — perhaps use a faster serializer or reduce payload size.

Memory Leak Pattern

heap profile — growing over time:
main.handleRequest()
  └── cache.Store()
       └── make([]byte, largeSize)   ← allocations never freed

Action: Check if the cache has eviction logic.

Lock Contention

mutex profile:
main.handleRequest()
  └── sync.(*Mutex).Lock()           ← 80% of mutex wait time
       └── cache.(*Cache).Get()      ← shared cache with single lock

Action: Use a sharded cache or sync.RWMutex.

Diff Flame Graphs

Compare two profiles (e.g., before and after a deployment) to find regressions:

Red = functions that got slower (more samples in the new profile)
Green/Blue = functions that got faster (fewer samples)
Grey = unchanged

This is essential for:

Detecting performance regressions after deployments
Validating optimization efforts
Understanding the impact of code changes

Tips

Start with CPU profiles (most common performance issues)
Look for unexpectedly wide blocks — they indicate where time is actually spent
Compare profiles before and after changes using diff view
Use time range selection in Grafana to focus on specific incidents
Filter by service name to isolate individual services
eBPF profiles include both kernel and user-space — kernel time (e.g., sys_write) often reveals I/O bottlenecks

Flame Graphs

Flame Graphs

Reading Flame Graphs

Structure

How to Read

Key Patterns

CPU Hot Spot

Memory Leak Pattern

Lock Contention

Diff Flame Graphs

Tips

results matching ""

No results matching ""