InfluxDB Internals

InﬂuxDB Internals
Platform Engineering Team 
@ryanbetts / ryan@inﬂuxdata.com

How great are databases?
• I like making things with
smart, clever, kind
people.

• I’ve been working on
high-throughput, realtime
data for the last 10 years.

• What’s so special about time series

• Time series database designs

• InﬂuxDB internals

RDBMS NoSQL TSDB
Correctness ACID BASE BASE
Schema DDL
DDL /
documents
on-write
Writing data DML POST/PUT line protocol
Reading data SQL GET + ﬁlter
ﬁlter, window,
group, join

TSDB unique combination
• Ingest: thousands to millions of points per second

• Store: fast accumulating, append-mostly data, lots of
repetition, often with time-to-live

• Query: analytic queries with fast ﬁltering, windowing

• Scale: availability, storage, query

Facebook Gorilla
• TTL eviction

• Columnar compression

• Write availability > query
correctness

• Metric-based schema

• Separate query processing
from access-path

Druid
• Roll-up at ingest

• Columnar storage &
time-based segments

• Indexes on dimension for
fast ﬁltering

• Separation of real time
and historical data nodes

Bullet Journals
• Fast event
recording

• Ordered by time

• Indexed by
dimensions

• Weekly / Monthly
roll-up

InﬂuxDB
1.Write Path

2.Storage

3.Query Path

4.Clustering

InﬂuxDB: Adding data (1)
POST ’http://localhost:8086/write?db=mydb' --data-
binary 'cpu_load_short,host=server01,region=us-west
value=0.64 1434055562000000000’

InﬂuxDB: Adding data (2)
fsync( ) batch to
WAL
Add to in-
memory cache
Snapshot
cache to TSM
Add to index

InﬂuxDB: on-disk (ﬁlesystem)
CREATE RETENTION POLICY <retention_policy_name> ON
<database_name> DURATION <duration> REPLICATION <n> [SHARD
DURATION <duration>] [DEFAULT]
Database directory /db
Retention Policy directory /db/rp
Shard Group (time bounded) (Logical)
Shard directory (db/rp/Id#)
TSM0001.tsm (data file)
TSM0002.tsm (data file)

InﬂuxDB: Adding data (DB)
fsync( ) batch to
WAL
Add to in-
memory cache
Snapshot to
TSM
Add to index

InﬂuxDB: Adding data (index)
• Measurement name -> ﬁeld keys

• Measurement name -> series

• Measurement name -> tag keys -> tag value -> series

• Series -> shards

• (Also sketches of series and measurements for fast
cardinality estimation)

InﬂuxDB: TSI
• Roaring-bitmaps to short-
cut series creation on insert

• Iterators for index
mappings

• Index is per-shard; series id
ﬁle is per-database

• Partitioned for lock-splitting

InfluxDB: InfluxQL Queries
1. Parses time range and expressions for filtering data
2. Look-up shards to access using the list of
measurements and the time frame
3. Create the iterators for each shard
4. Merge the shard iterator outputs
select user, system from cpu
where time > now() - 1h and host = 'serverA

InfluxQL: Query with IFQL
1. Stand-alone ìfqld` coordinator nodes

2. Streaming storage iterators that support rate-limits

3. Separation of query planning and query distribution

4. Extensible, functional language

5. Unification of InfluxQL and TICKScript

A brief sidebar on
append-mostly
databases
No one tells you about:
* Wrong data

* Old (back-ﬁlled) data

InfluxDB Clustering
• Strongly consistent meta-cluster (based on RAFT)

• User configured replication factor

• Replication and shard aware query planner

• Hinted-Handoff queues on each data node

• (WIP) Anti-entropy consistency repair

Conclusions
• Time series data has unique storage and query
requirements that impact database design. 
• Evolution of InﬂuxDB:

1. TSI: remove the in-memory size limit on cardinality
2. IFQL: faster feature velocity; safer execution.

3. Anti-entropy repair: easier, more robust scale-out.

InfluxDB Internals

More Related Content

What's hot

Similar to InfluxDB Internals

More from InfluxData

InfluxDB Internals