Key-value databases in practice Redis @ DotNetToscana

Matteo Baglini www.dotnettoscana.org
Software Developer, Freelance
matteo.baglini@gmail.com
http://it.linkedin.com/in/matteobaglini
http://github.cpom/bmatte

«Advanced key-value store.
It is often referred to as a
data structure server»

2

Key Value

page:index <html><head>[...]

user:123:session xDrSdEwd4dSlZkEkj+

user:123:avatar 77u/PD94bWwgdm+

Everything is a «blob»
Commands, primarily, can GET and SET the values
3

Key Value Type
page:index <html><head>[...] String

events:timeline { «Joe logged», «File X Uploaded», …} List

logged:today { 1, 2, 3, 4, 5 } Set

time => 10927353
user:123:profile Hash
username => bmatte
joe ~ 1.3483
game:leaderboard smith ~ 293.45 Sorted Set
fred ~ 83.22

Different «data type/structure»
Rich set of specialized commands 4

 Everything is stored in memory
 Screamingly fast performance
 Persistent via snapshot or append-only log file
 Replication (only Master/Slave)
 Extensible via embedded scripting engine (Lua)
 Rich set of client libraries
 High availability (In progress)
◦ Cluster (Fault tolerance, Multi-Node consistence)
◦ Sentinel (Monitoring, Notification, Automatic failover)

5

 Created by Salvatore
Sanfilippo (@antirez)
 First «public release»
in March 2009.
 Since 2010 sponsored
by VMware.

Initially written to improve performance of Web
Analytics product LLOOGG out of his startup
6

 Written in ANSI C
 No external dependencies
 Single thread (asynchronous evented I/O)
 Works on all POSIX-like system
 Exist unofficial build for Windows
 Open-source BSD licensed
 Community (list, IRC & wiki)

7

1. A DSL for Abstract Data Types.
2. Memory storage is #1.
3. Fundamental data structures for a
fundamental API.
4. Code is like a poem.
5. We're against complexity.
6. Two levels of API.
7. We optimize for joy.

8

Latest stable version (2.6.*)

10

Latest unstable version (2.9.7)

11

Any blob will do
(A value can be at max 512MB)

18

Operations on strings holding an
integer

19

 Sharing state across processes
◦ Distribute lock, Incremental ID, Time series,
User session.
 Web Analytics
◦ User visit (day, week, month), Feature Tracking.
 Caching
◦ String values can hold arbitrary data.
 Rate limiting
◦ Limit number of API calls/minute.
21

Any item in can be made to expire
after or at a certain time.

23

Sequence of string values

26

Sequence of string values
(Max length is 232 - 1 elements)

27

Prevent indefinite growth

28

 Events Store or Notification
◦ Logs, Social Network Timelines, Notifications.
 Fixed Data
◦ Last N activity.
 Message Passing
◦ Durable MQ, Job Queue.
 Circular list

30

Unordered set of unique values

32

Unordered set of unique values
(Max number of members is 232 – 1)

33

You can do unions, intersections,
differences of sets in very short time.

34

 Web Analytics
◦ Unique Page View, IP addresses visiting.
 Relations
◦ Friends, Followers, Tags.
 Caching Result
◦ Store result of expensive intersection of data.

36

Ordered set of unique values

38

 Web Analytics
◦ Online users, Most visited pages.
 Leaderbord
◦ Show top N.
 Order by data
◦ Maintain a set of ordered data like user by age.

42

Key → Value map (as value)

44

Set attributes
(Store up to 232 - 1 field-value pairs)

45

 Storing Objects
◦ Hashes are maps between string fields and
string values, so they are the perfect data type
to represent objects.

48

Dump data to disk after certain
conditions are met

50

 Pro:
◦ RDB is a very compact single-file.
◦ RDB files are perfect for backups.
◦ RDB is very good for disaster recovery.
◦ RDB allows faster restarts with big datasets.
◦ RDB maximizes performances (backgr. I/O via fork(2)).
 Contro:
◦ RDB is NOT good if you need to minimize the chance of
data loss in case Redis stops working (for example after
a power outage).
◦ Fork can be time consuming if the dataset is very big.

51

Append all write operations to a log

52

Durability depends on fsync(2) policy

53

 Pro:
◦ AOF is much more durable.
◦ AOF is an append only log, no seeks, nor corruption
problems (for example after a power outage).
◦ AOF contains a log of all the operations one after the
other in an easy to understand and parse format.
 Contro:
◦ AOF files are usually bigger than the equivalent RDB.
◦ AOF can be slower then RDB depending on the exact
fsync policy.

54

 Use both persistence methods if you want a degree of
data safety comparable to what any RDBMS can provide
you.

 If you care a lot about your data, but still can live with a
few minutes of data lose in case of disasters, you can
simply use RDB alone.

 There are many users using AOF alone, but we
discourage it since to have an RDB snapshot from time to
time is a great idea for doing database backups, for
faster restarts.

55

Rich set of clients

57

Multiple commands (ACID)

62

 Classic scenario
◦ Multi atomic commands.
 Optimistic locking
◦ Check and Set (CAS Pattern) write only if not
changed.

64

Publish
Subscribe
65

Provide 1-N messaging

66

Subscribe multi channels decoupled
from the key space

67

Publish on some channel

68

Subscriber getting notified

69

 Message Passing
◦ Distribute message-oriented system, Event-
Driven Architecture, Service Bus.

71

One master replicate to multiple slaves

74

Slave send SYNC command and master
transfers the database file to the slave

75

Slaves can perform only read operation

76

 Scalability
◦ Multiple slaves for read-only queries.
 Redundancy
◦ Data replication.
 Slave of Slave
◦ Graph-like structure for more scalability e
redundancy.

77

Screamingly fast performance

 ~50K read/write operations per
seconds.
 ~100K read/write ops per second on a
regular EC2 instance.

79

redis-benchmark tool on a Ubuntu
virtual machine ~36K rps

80

Application
Architecture
81

Application Server

SQL Redis
Server
82

«I see Redis definitely more as a flexible tool than
as a solution specialized to solve a specific
problem: his mixed soul of cache, store, and
messaging server shows this very well»

Salvatore Sanfilippo

85

 http://redis.io/
 http://github.com/antirez/redis
 http://groups.google.com/group/redis-db

86

Key-value databases in practice Redis @ DotNetToscana

Key-value databases in practice Redis @ DotNetToscana

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Key-value databases in practice Redis @ DotNetToscana

Similar to Key-value databases in practice Redis @ DotNetToscana (20)

More from Matteo Baglini

More from Matteo Baglini (11)

Recently uploaded

Recently uploaded (20)

Key-value databases in practice Redis @ DotNetToscana