TimescaleDB 2026: Hypertables, Continuous Aggregates & Compression

Q: When should I use TimescaleDB vs vanilla PostgreSQL?

Use TimescaleDB when: time-series is your primary access pattern, you'll have >100M rows, you need retention policies and downsampling, you want columnar compression. Use vanilla PostgreSQL when: time isn't the primary dimension, dataset stays under 50M rows, query patterns are random rather than time-range. TimescaleDB adds operational complexity (chunk management, compression policies) — only worth it past the scale threshold.

Q: TimescaleDB vs InfluxDB — which to choose?

TimescaleDB : full SQL, JOINs to relational tables, mature ecosystem (PostgreSQL tooling), Apache 2.0 (community edition). InfluxDB 3 (Apache Arrow + DataFusion under the hood): better at very high cardinality, faster ingest for metrics-only workloads. Default to TimescaleDB if you already use PostgreSQL or need SQL JOINs; InfluxDB if you're metrics-only and need extreme write rates.

Q: How do I deploy TimescaleDB in production?

Three paths: (1) Timescale Cloud — fully managed, multi-region, $0.07/hr starting price. (2) Self-hosted on Kubernetes via Patroni + TimescaleDB extension or the Timescale Helm chart. (3) EC2 + EBS gp3 with manual HA via streaming replication. Production specs: 16+ cores, 64GB+ RAM, NVMe storage for hot chunks. Replicate via streaming (not logical — TimescaleDB extension state requires base backups).

Q: How does TimescaleDB compare to ClickHouse for analytics?

TimescaleDB : SQL-rich, transactional, integrates with PostgreSQL — great for time-series mixed with relational data. ClickHouse : 10-100× faster on huge-scale aggregations, columnar from the ground up, no transactions. Pick TimescaleDB for: IoT / metrics with JOINs to dimensional tables, observability platforms, financial tick + reference data. Pick ClickHouse for: log analytics at TB-PB scale, single-table OLAP, customer-facing dashboards on event data.

TL;DR — TimescaleDB in 60 seconds: TimescaleDB is a PostgreSQL extension that adds time-series superpowers without losing SQL. Three core features: hypertables (auto-partitioned by time + optional space dimension; user sees one table, engine sees thousands of chunks), continuous aggregates (self-refreshing materialized views — pre-compute 1-minute rollups while raw data lives longer), and columnar compression (90-95% size reduction on cold chunks, transparent decompression at query time). For IoT / observability / financial tick / metrics workloads, TimescaleDB beats raw PostgreSQL by 10-100× on time-window queries and stores data 10-20× more cheaply. TimescaleDB 2.18 (2025) added hypercore — a unified rowstore/columnstore engine where the same hypertable holds hot rows + compressed cold columns, with auto-conversion based on chunk age.

TimescaleDB is a PostgreSQL extension that automatically manages time-series data with hypertables, chunk-based storage, and continuous aggregates — without leaving the PostgreSQL ecosystem.

Installation

bash

# Add TimescaleDB to shared_preload_libraries
# postgresql.conf:
# shared_preload_libraries = 'timescaledb'

# Install extension
CREATE EXTENSION IF NOT EXISTS timescaledb;

Create a Hypertable

sql

-- Regular PostgreSQL table
CREATE TABLE metrics (
  time        TIMESTAMPTZ NOT NULL,
  device_id   TEXT NOT NULL,
  cpu_pct     DOUBLE PRECISION,
  mem_pct     DOUBLE PRECISION
);

-- Convert to hypertable (chunks by 1 day)
SELECT create_hypertable('metrics', 'time', chunk_time_interval => INTERVAL '1 day');

Query Performance

sql

-- Time-series queries are automatically chunk-pruned
SELECT time_bucket('5 minutes', time) AS bucket,
       device_id,
       avg(cpu_pct) AS avg_cpu
FROM metrics
WHERE time > now() - INTERVAL '1 hour'
  AND device_id = 'server-01'
GROUP BY bucket, device_id
ORDER BY bucket DESC;

Continuous Aggregates

sql

-- Pre-compute hourly averages
CREATE MATERIALIZED VIEW metrics_hourly
WITH (timescaledb.continuous) AS
SELECT time_bucket('1 hour', time) AS bucket,
       device_id,
       avg(cpu_pct) AS avg_cpu,
       max(cpu_pct) AS max_cpu
FROM metrics
GROUP BY bucket, device_id;

-- Add refresh policy
SELECT add_continuous_aggregate_policy('metrics_hourly',
  start_offset => INTERVAL '3 hours',
  end_offset   => INTERVAL '1 hour',
  schedule_interval => INTERVAL '1 hour');

Data Retention Policy

sql

-- Drop chunks older than 90 days automatically
SELECT add_retention_policy('metrics', INTERVAL '90 days');

-- Check chunks
SELECT show_chunks('metrics');

-- Manual chunk drop
SELECT drop_chunks('metrics', INTERVAL '90 days');

Compression

sql

-- Enable compression on old chunks
ALTER TABLE metrics SET (timescaledb.compress,
  timescaledb.compress_segmentby = 'device_id');

-- Auto-compress chunks older than 7 days
SELECT add_compression_policy('metrics', INTERVAL '7 days');

-- Check compression ratio
SELECT * FROM chunk_compression_stats('metrics');

Key Takeaways

Hypertables automatically partition data into chunks — queries are pruned to relevant chunks
Continuous aggregates pre-compute rollups, making dashboard queries orders of magnitude faster
Use retention policies to automatically drop old chunks — no manual DELETE overhead
Compression reduces storage 10-20x for historical chunks with minimal query performance impact

JusDB Can Help

TimescaleDB is the best choice for time-series data in the PostgreSQL ecosystem. JusDB can design your hypertable schema and continuous aggregate strategy.

Production Patterns: Chunk Sizing, Compression, and Retention

TimescaleDB's defaults are sensible for development but suboptimal for production. The three knobs that matter most — chunk interval, compression schedule, and retention policy — need to be tuned per workload. Getting them right unlocks the 90-95% compression and sub-second query latency TimescaleDB is known for. Getting them wrong creates chunk-explosion (too many small chunks slow planning) or memory pressure (chunks too big to fit in cache).

Sizing the Chunk Interval

The chunk_time_interval determines how much time-series data lives in each chunk. Default is 7 days. Production rule: each uncompressed chunk should fit in ~25% of available RAM. If your buffer pool is 32GB, target chunks of ~8GB. Calculate: estimated rows per day × row size × indexes. For a metrics workload at 10M rows/day with 200-byte rows + 2 indexes (≈ 4GB/day), chunk_time_interval = INTERVAL '2 days' is right. For high-volume IoT at 100M rows/day, chunk_time_interval = INTERVAL '6 hours' is appropriate. Set this at hypertable creation: SELECT create_hypertable('metrics', 'time', chunk_time_interval => INTERVAL '6 hours');. Changing it later only affects new chunks — old ones keep their interval.

Compression Policy Timing

Compression in TimescaleDB is one-way: once a chunk is compressed it becomes effectively read-only (UPDATE/DELETE require decompressing the chunk first, which is expensive). Set the compression policy to wait long enough that the data is unlikely to change. For most observability workloads, 7-14 days is the sweet spot — late-arriving data from agents has typically arrived, dashboards have already cached their queries against hot data, and the compression gains (90-95% size reduction) compound for the years the data sits in the cold tier.

Configure with: SELECT add_compression_policy('metrics', INTERVAL '14 days');. The first run compresses everything older than 14 days; subsequent runs compress newly-eligible chunks daily. The compression itself runs in the background and is throttled, so it won't impact write performance during business hours. For columnar workloads where queries do range scans across compressed chunks, consider SET timescaledb.enable_chunk_skipping = on to skip chunk decompression when min/max metadata says no rows match.

Retention Policies and Multi-Tier Storage

Retention is the third pillar. Hot data lives uncompressed on fast NVMe; warm data lives compressed on the same disk; cold data either gets dropped or migrated to S3 (via the timescaledb_tools backup extension). Configure retention with SELECT add_retention_policy('metrics', INTERVAL '180 days');. For workloads with regulatory retention (financial 7 years, healthcare 6 years), avoid dropping data — instead use TimescaleDB's tiered storage (Timescale Cloud) or manual chunk export to S3 + re-import for queries that need historical data.

Multi-Hypertable Patterns

A common mistake is putting everything in one hypertable. For mixed workloads — high-cardinality metrics + low-cardinality summary tables + dimensional data — use multiple hypertables joined at query time. TimescaleDB handles cross-hypertable JOINs natively; the planner pushes down predicates and chunk-prunes. A typical production setup has 3-5 hypertables: one per data source (sensor_readings, error_events, billing_meters) each with its own chunk interval, compression policy, and retention. This is the pattern we run for nationwide IoT clients processing 5+ billion rows per day.

Continuous Aggregates at Scale

For dashboards, continuous aggregates pre-compute time-bucketed rollups (1-minute, 1-hour, 1-day). The query planner transparently uses them when applicable. Real-time continuous aggregates (TimescaleDB 2.7+) combine pre-computed buckets with newer raw data on-the-fly for zero-lag dashboards. For terabyte-scale deployments, build aggregates hierarchically: 1-second raw → 1-minute aggregate → 1-hour aggregate → 1-day aggregate, with retention policies dropping the lower tiers as data ages. Query the appropriate aggregate based on the dashboard's time range — TimescaleDB's continuous aggregate views handle this dispatch automatically when configured correctly.

Frequently Asked Questions

What is a TimescaleDB hypertable?

A hypertable is a PostgreSQL table that auto-partitions by a time column. The user sees one logical table; under the hood TimescaleDB creates thousands of chunks (child tables) sized by time interval (e.g., 1 day per chunk). Queries with time predicates use chunk pruning to scan only relevant chunks — a query for 'last hour' scans 1 chunk, not 1000. Optionally adds a space partition (e.g., by device_id) for multi-dimensional pruning.

How does TimescaleDB compression work?

TimescaleDB compression converts older row-oriented chunks into columnar storage via the SELECT compress_chunk(...) function or an automatic policy. Compression typically achieves 90-95% size reduction on time-series data (high temporal repetition compresses well). Queries on compressed chunks are transparent — TimescaleDB decompresses on the fly, often with parallel decompression. Tradeoff: compressed chunks are read-only; UPDATE/DELETE requires decompressing the chunk first.

What are continuous aggregates?

Continuous aggregates are materialized views that automatically refresh as new data lands. Use case: pre-compute 1-minute averages on raw 1-second metrics, query the aggregate for dashboards. Refresh runs in the background; queries to the aggregate are O(1) regardless of how much raw data exists. CREATE MATERIALIZED VIEW ... WITH (timescaledb.continuous) is the syntax. New in 2.7+: real-time continuous aggregates combine pre-computed buckets with newer raw data on-the-fly for zero-lag dashboards.

When should I use TimescaleDB vs vanilla PostgreSQL?

Use TimescaleDB when: time-series is your primary access pattern, you'll have >100M rows, you need retention policies and downsampling, you want columnar compression. Use vanilla PostgreSQL when: time isn't the primary dimension, dataset stays under 50M rows, query patterns are random rather than time-range. TimescaleDB adds operational complexity (chunk management, compression policies) — only worth it past the scale threshold.

TimescaleDB vs InfluxDB — which to choose?

TimescaleDB: full SQL, JOINs to relational tables, mature ecosystem (PostgreSQL tooling), Apache 2.0 (community edition). InfluxDB 3 (Apache Arrow + DataFusion under the hood): better at very high cardinality, faster ingest for metrics-only workloads. Default to TimescaleDB if you already use PostgreSQL or need SQL JOINs; InfluxDB if you're metrics-only and need extreme write rates.

What is hypercore in TimescaleDB 2.18?

Hypercore is a unified storage engine: a single hypertable transparently holds rowstore (recent, mutable) chunks and columnstore (older, compressed) chunks. The boundary is managed by policies. Result: writes go to the fast rowstore, queries on old data hit compressed columnar storage — and you don't have to manage two table types. Released in TimescaleDB 2.18 (Dec 2025), now the recommended default for new installs.

How do I deploy TimescaleDB in production?

Three paths: (1) Timescale Cloud — fully managed, multi-region, $0.07/hr starting price. (2) Self-hosted on Kubernetes via Patroni + TimescaleDB extension or the Timescale Helm chart. (3) EC2 + EBS gp3 with manual HA via streaming replication. Production specs: 16+ cores, 64GB+ RAM, NVMe storage for hot chunks. Replicate via streaming (not logical — TimescaleDB extension state requires base backups).

How does TimescaleDB compare to ClickHouse for analytics?

TimescaleDB: SQL-rich, transactional, integrates with PostgreSQL — great for time-series mixed with relational data. ClickHouse: 10-100× faster on huge-scale aggregations, columnar from the ground up, no transactions. Pick TimescaleDB for: IoT / metrics with JOINs to dimensional tables, observability platforms, financial tick + reference data. Pick ClickHouse for: log analytics at TB-PB scale, single-table OLAP, customer-facing dashboards on event data.

TimescaleDB Hypertables, Continuous Aggregates & Compression (2026 Production Guide)

Installation

Create a Hypertable

Query Performance

Continuous Aggregates

Data Retention Policy

Compression

Key Takeaways

JusDB Can Help

Production Patterns: Chunk Sizing, Compression, and Retention

Sizing the Chunk Interval

Compression Policy Timing

Retention Policies and Multi-Tier Storage

Multi-Hypertable Patterns

Continuous Aggregates at Scale

Frequently Asked Questions

Share this article

JusDB Team

Keep reading

ClickHouse Explained (2026): MergeTree, Distributed Engine & Real-Time OLAP

StarRocks vs ClickHouse: Architecture, Table Models & When to Choose Each (2026)

StarRocks Monitoring & Alerting: The Complete Production Guide

Need Expert Help?

PostgreSQL Consulting

PostgreSQL Migration

PostgreSQL Support

PostgreSQL High Availability

PostgreSQL Cloud Migration

PostgreSQL on Kubernetes