Time Series Data in a Time Series World

© 2014 MapR Technologies 2
Agenda
• What is a time series?
• Where does it come from?
• What do we need to do with it?
– theoretically
– practically
• How can we do that?
– basics of time series processing
– advanced time series database

What is a Time Series?
• Stuff with timestamps
– sensor measurements
– system stats
– log files
– configuration files
Yes. Really.
• Well, several general categories
– numerical time series (what most people think of)
– events
– non-numerical time series (the strange cases)

Got Time Series?

What Do We Do With Time Series
• Acquire
– Measurement, transmission, reception
• Store
– Individually, or grouped for some amount of time
• Retrieve
– Ad hoc, flexible, correlate and aggregate
• Analyze and visualize
– We facilitate this via retrieval

Acquisition
Not usually our problem
• Sensors
• Data collection – agents, raspberry pi
• Transmission – via LAN/Wan, Mobile Network, Satellites
• Receipt into system – listening daemon or queue, or depending
on use case writing directly to the database

Storage Choices
• Flat files
– Great for rapid ingest with massive data
– Handles essentially any data type
– Less good for data requiring frequent updates
– Harder to find specific ranges
• Traditional RDBMS
– Ingests up to ~10,000/ sec; prefers well structured (numerical) data; expensive
• NoSQL (such as MapR-DB or HBase)
– Easily handle 10,000 rows / sec / node – True linear scaling
– Handles wide variety of data
– Good for frequent updates
– Easily scanned in a range

Retrieval Requirements
• Retrieve by time-series, time range, tags
– Possibly pull millions of data points at a time
– Possibly do on-the-fly windowed aggregations
• Simple querying
– start period, end period, metrics, tags
– REST API for integration
– CLI for testing
• Graphs

Specific Example
• Consider a server farm
• Lots of system metrics
• Typically 100-300 stats / 30 s
• Loads, RPC’s, packets, requests/s
• Common to have 100 – 10,000 machines

The General Outline
10 samples / second / machine
x 1,000 machines
= 10,000 samples / second
• This is what Open TSDB was designed to handle
• Install and go, but don’t test at scale

Will it Scale?

Specific Example
• Consider oil drilling rigs
• When drilling wells, there are *lots* of moving parts
• Typically a drilling rig makes about 10K samples/s
• Temperatures, pressures, magnetics,
machine vibration levels, salinity, voltage,
currents, many others
• Typical project has 100 rigs

The General Outline
10K samples / second / rig
x 100 rigs
= 1M samples / second
• But wait, there’s more
– Suppose you want to test your system
– Perhaps with a year of data
– And you want to load that data in << 1 year
• 100x real-time = 100M samples / second

How does that Work (Open TSDB on MapR)?
Samples
Message
MapR
Collector
queue
table Web service Users

Data Storage
Key 13 43 73 103 …
…
series-uid.time-window 4.5 5.2 6.1 4.9
…
• Typical time window is one hour
• Column names are offsets in time window
• Find series-uid in separate table

Key 13 43 73 103 blob
…
series-uid.time-window 4.5 5.2 6.1 4.9 {t:[13,43,73,103],
v=[4.5,5.2,6.1,4.9]}
Eventual Compaction
…
• Insertion of data as blob makes original columns redundant
• This is the way that TSD should work, not quite how it does work

Key blob
…
series-uid.time-window {t:[13,43,73,103],
Eventual Compaction
• Converting old data to blobs allows compact storage, faster
retrieval
v=[4.5,5.2,6.1,4.9]}
…

Single Point Loading
• Each sample requires one insertion, compaction requires
another
• Typical performance on a cluster
– 1 edge node + 4 cluster nodes
– Up to 20k samples per second observed
• Suitable for server monitoring
• Not suitable for large scale history ingestion
• 1000x too slow for industrial work

Web service queries
database and
collector
Small Trick … Buffer Data in Memory
Message
queue Samples
Users
Collector
MapR
table
Web service
Log
Buffering data for 1 hour in
collector allows >1000x
performance gain
Logging latest hour of data allows
clean restart of collector
(lambda + epsilon architecture)

Batch Loading
• 3600 samples require one insertion
– No compactions necessary
• Typical performance on SE cluster
– 1 edge node + 4 cluster nodes
– Up to 30 million samples per second per node observed
– ~700x faster ingestion
• Suitable for large scale history ingestion
• 30 million data points retrieved in 20s (in JSON format)
• Ready for industrial work

When is this All Wrong?
• In some cases, retrieval by series-id + time range not sufficient
• Log files
– May need very flexible retrieval of events based on text-like criteria
• Search may be better than time-series database
– Can scale Lucene based search to > 1 million events / second
• Geo-temporal storage access patterns

Q & A
Engage with us!
@kingmesal maprtech
jsccot@mapr.com
MapR
maprtech
mapr-technologies

Time Series Data in a Time Series World

More Related Content

What's hot

Viewers also liked

Similar to Time Series Data in a Time Series World

More from MapR Technologies

Recently uploaded

Time Series Data in a Time Series World