Database Observability: Metrics, Logs, Traces, and Prometheus Alerting

Database observability goes beyond monitoring. It means having enough context to understand why the database is slow — not just that it is slow. Here is how to build a complete observability stack.

The Three Pillars

Metrics: aggregated numbers over time (TPS, latency, cache hit rate)
Logs: timestamped records of individual events (slow queries, errors)
Traces: end-to-end request paths showing time spent in each layer

Key Metrics to Track

text

PostgreSQL:
  - transactions per second (tps)
  - cache hit ratio (from pg_stat_bgwriter)
  - replication lag (pg_stat_replication.replay_lag)
  - table/index bloat (n_dead_tup)
  - connection count (pg_stat_activity)

MySQL:
  - Questions/Queries per second
  - InnoDB buffer pool hit ratio
  - Threads_running vs Threads_connected
  - Innodb_row_lock_waits
  - Seconds_Behind_Master

Prometheus + postgres_exporter

bash

# Run postgres_exporter
docker run -e DATA_SOURCE_NAME='postgresql://postgres:pass@localhost:5432/postgres?sslmode=disable' \
  quay.io/prometheuscommunity/postgres-exporter

# Key metrics exposed:
# pg_stat_database_tup_fetched
# pg_stat_bgwriter_buffers_alloc
# pg_replication_lag
# pg_stat_user_tables_n_dead_tup

Prometheus Alert Rules

yaml

groups:
  - name: postgres
    rules:
    - alert: PostgreSQLHighReplicationLag
      expr: pg_replication_lag > 30
      for: 2m
      labels:
        severity: warning
      annotations:
        summary: PostgreSQL replication lag > 30s

    - alert: PostgreSQLLowCacheHitRate
      expr: |
        pg_stat_database_blks_hit /
        (pg_stat_database_blks_hit + pg_stat_database_blks_read) < 0.95
      for: 5m
      labels:
        severity: critical

Distributed Tracing with OpenTelemetry

python

from opentelemetry import trace
from opentelemetry.instrumentation.psycopg2 import Psycopg2Instrumentor

# Auto-instrument database calls
Psycopg2Instrumentor().instrument()

# Every SQL call now appears in your trace with:
# - SQL query text
# - Execution time
# - Row counts
# - Connection attributes

Key Takeaways

Instrument metrics, logs, and traces — each answers different diagnostic questions
Set up Prometheus alerts for replication lag, low cache hit rate, and connection count spikes
OpenTelemetry auto-instrumentation adds database spans to distributed traces with zero code changes
Slow query logs are the most actionable log — ensure they are enabled and shipped to your log aggregator

JusDB Can Help

Building a complete database observability stack is complex. JusDB can design and implement monitoring, alerting, and tracing for your database layer.

Keep reading

Database SRE

Ola Hallengren's SQL Server Maintenance Solution: Production Setup Guide

Production setup of Ola Hallengren's SQL Server Maintenance Solution: the four jobs that matter, FULL/DIFF/LOG backup cadence for your RPO, DBCC CHECKDB scheduling, IndexOptimize tuning, encryption, and CommandLog-based alerting.

SQL Server13 minMay 27, 2026

Read

Database SRE

PostgreSQL Monitoring with Prometheus and postgres_exporter: A Production Guide

Set up PostgreSQL monitoring with Prometheus and postgres_exporter. Includes install steps, critical alert rules, Grafana dashboard panels, and custom query metrics.

PostgreSQL10 minMar 5, 2026

Read

Database SRE

PostgreSQL 16: New Features Every DBA Should Know

PostgreSQL 16 introduced logical replication from standbys, pg_stat_io, SQL/JSON constructors, COPY improvements, and pg_stat_checkpointer. Full DBA upgrade guide.

PostgreSQL10 minMar 5, 2026

Read

Database Observability: Metrics, Logs, Traces, and Prometheus Alerting

The Three Pillars

Key Metrics to Track

Prometheus + postgres_exporter

Prometheus Alert Rules

Distributed Tracing with OpenTelemetry

Key Takeaways

JusDB Can Help

Share this article

JusDB Team

Keep reading

Ola Hallengren's SQL Server Maintenance Solution: Production Setup Guide

PostgreSQL Monitoring with Prometheus and postgres_exporter: A Production Guide

PostgreSQL 16: New Features Every DBA Should Know

Need Expert Help?

PostgreSQL Consulting

PostgreSQL Migration

PostgreSQL Support

PostgreSQL High Availability

PostgreSQL Cloud Migration

PostgreSQL on Kubernetes