Spanner : Google' s Globally Distributed Database

SPANNER
WS2019/2020
Google's Globally-Distributed
Database
BDASEM
Ahmed Amine Mchayaa
Technische Universität
Berlin
Gabor E. Gevay
Supervised by :

Outline
Intro :
What is
Spanner
Architectur
e behind
Spanner
Spanner:
a CA
system
Summary
2

Annual revenue of Google from 2005 to 2018 (in
Billion U.S dollars)
Revenue in billion
U.S. dollars
6.1
Sources: Google; Statista 2019
2005
29.3
2010
74.54
2015
136.22
2018
3
Background

Horizontally
Scaling Database
ADWARDS
ACID
Transactions with
global
consistency
No DownTime
Google needs
4

Globally
distributed
Fully managed,
database service
with global scale
Traditional
relational
semantics:
Schemas, ACID
transaction, SQL
Semi -Relational
Database
Synchronously
replicated
Automatic,
synchronous
replication within
and across regions
for availability
What is Spanner?
5

Architecture OverViewComputeStorage
DB 1
DB n
DB 1
DB n
DB 1
DB n
Zone 1 Zone 2 Zone 3
Regional Instance
6
Sources: Robert K.
Spanner - a fully managed horizontally
scalable relational database...

Architecture Overview
DB 1
DB 2
DB 3
DB 4
DB 5
DB n
Instance
Split 1
Split 2
Split 3
Split 4
Split 5
Split 6
Split 7
Split n
Table 1
Table 2
Table 3
Table 4
Table 5
Table 6
Table 7
Table n
Zone
7
Sources: Robert K.

Architecture OverView
Split 1
Zone 1
ComputeStorage
Split 2
Split 3
Paxos
Group
for Split
1
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
*TrueTime used for leader leases : There is only one leader for a split at any given time
Regional Instance
8
Sources: Robert K.

• Invented during the creation of Spanner
• Quantifies the « worst » possible error / drift between clocks in all
datacenters arount the world (global clock)
• TrueTime.now() gives you an interval [t1,t2]; t2 = t1+ 2Є
• t1 is guaranteed to be lower than the value of the global clock at
the instant when Now() finishes executing
• T2 is guaranteed to be higher than the value of the global clock at
the instant when Now() starts executing
9
Architecture OverView – True Time

10
Architecture OverView – True Time
Computer NodeComputer NodeComputer Node
Atomic MasterAtomic Master Atomic Master
GPS Master GPS Master GPS Master
Sync every 30 sec
Sync every 30 sec, Synchonization within 50µs, ε guaranteed interval around 2ms

Life of query : Consistent Read
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
11
Sources: Robert K.
slave leader slave
4. Wait for data /
Response
1. Request
3 No/ Yes
2. Ok
to read

Life of query : Stale Read
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
12
Sources: Robert K.
slave leader slave
4. Wait for data /
Response
1. Request
(max 15s old )
2. Am I up-
to-date

Life of query : Read
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
Split 1
Zone 1
Split 2
Split 3
13
Sources: Robert K.
slave
leader slave
3. Query result
4. txn. Bufferwrite
1. txn. Query()
2. acq. locks
5. Write 5. Write
6. ack6. ack 7. Rel. locks

Spanner claims to be consistent and available
CA CP
AP
It is impossible for a
distributed computer
system to simultaneously
provide more than two
out of three of the
following guarantees:
Consistency, Availability,
Partition Tolerance
15
Always able
to read and
Write
Always see the
same data as
others at same
point in time
Works even
in the case
of network
partition
Pick Two !!!

Summary
Read / Write Transactions
What Google Spanner is,
the idea behind and and
what offers as database
The Architecture behind
Google Spanner
16
Spanner a CA/CP
system

David F. Bacon et al. 2017. Spanner: Becoming a SQL System.
In Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD '17). ACM,
New York, NY, USA, 331-343. DOI: https://doi.org/10.1145/3035918.3056103
1
James C. Corbett et al. 2012. Spanner: Google's globally-distributed database
In Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
(OSDI'12). USENIX Association, Berkeley, CA, USA, 251-264.
2
Jeff Shute et al. 2013. F1: a distributed SQL database that scales.
Proc. VLDB Endow. 6, 11 (August 2013), 1068-1079. DOI: http://dx.doi.org/10.14778/2536222.2536232
3
Eric B. 2017. Spanner, TrueTime & The CAP Theorem
VP, Infrastructure, Google. February 14, 2017 :https://ai.google/research/pubs/pub45855
4
Robert K. 2017. Spanner - a fully managed horizontally scalable relational database
DEVOXX Poland (June 2017) [Video], https://www.youtube.com/watch?v=IFbydfGV2lQ
5
References
17

Spanner : Google' s Globally Distributed Database

Recommended

Recommended

More Related Content

Similar to Spanner : Google' s Globally Distributed Database

Similar to Spanner : Google' s Globally Distributed Database (20)

Recently uploaded

Recently uploaded (20)

Spanner : Google' s Globally Distributed Database