Transaction

A question for you?
What isolation level you have in
your (operational) database?

If you are using Oracle and
isolation level is
READ COMMITTED
then the following anomalies
can happen:

In database systems many things can go
wrong:
Q uer y + update Q uer y + update

Can transaction help
to simplify these issues?

rea
dwrite
rea
dwrite
What is transaction?

When dealing with
transactions it is important to
understand exactly what
safety guarantees it can
provide and associated cost.

A C I D
about handling faults (crashes)

Update listing
set buyer = John
Insert into
invoices …
Ok
TimeDB
Alice

Any of these fault can happen:
 Crash/power failure
 Deadlock
 Network fault
 Constraint violation
 …..

Update listing
set buyer = John
Insert into
invoices …
Ok
Time
Alice
DB

Transactions =
Multi-objects atomicity
Roll back writes on abort

Any of these fault can happen:
 Crash/power failure
 Deadlock
 Network fault
 Constraint violation
 …..
Abort

Alice
Bob
Get counter
42
Get counter
42
[ 42 + 1 = 43 ]
[ 42 + 3 = 45 ]
Set counter 43
OK
Set counter 45
OK
Time
DB

Isolation is the trickiest part,
more on this in later slides.

Durability means the promise
that any data is committed
successfully (that it has written)
during the transaction will not be
forgotten even if database
crashes.

Durability
Archive Tape fsync to Disk Replication

Isolation levels
Serializable
Repeatable Read Snapshot Isolation
Read committed
Read uncommitted
“Weaker than”

Serializable
Serializable isolation means that the
database guarantees that transactions
have the same effect as if they ran
serially, i.e. one at a time, without any
concurrency.

Serializability
vs
Linearizability

Linearizability Serializability
It is a guarantee about single
operations on single objects.
It provides a real-time (i.e.,
wall-clock) guarantee on the
behavior of a set of single
operations (often reads and
writes) on a single object
(e.g., distributed register or
data item).
It is a guarantee about
transactions, or groups of one or
more operations over one or
more objects.
It guarantees that the execution of
a set of transactions (usually
containing read and write
operations) over multiple items is
equivalent to some serial
execution (total ordering) of the
transactions.

Serializable isolation has a
performance cost
and many databases don’t want
to pay that price.

HAT, not CAP: Towards Highly Available Transactions
Peter Bailis et al, 2013

READ COMMITTED
Most basic level transaction isolation.
It makes two guarantees:
No dirty reads No dirty writes
Default in Postgres, Oracle, and SQL Server

Dirty reads
Alice
Bob
DB
TimeGet X
2
Set X = 3
Ok
Get X
3
Commit
Dirty Read

No dirty reads
Alice
Bob
DB
Time
Get X
2
Set X = 3
Ok
2
Commit
3
Get X Get X

Dirty writes
Alice
Bob
Lis ting
Time
Update listing set
buyer = Alice where
id = 1
Ok
Invoic e
Update listing set
buyer = Bob
where id = 1
O
k
Update invoice set
recipient = Bob where
listing_id = 1
O
k
Update invoice set
recipient = Alice where
listing_id = 1
Ok
Commit
Commit

Read skew can happen under read committed
X = 500
Y= 500
Backup
Set X = X - 100
Ok
Set Y = Y + 100
Ok
Get Y
500
Commit
400
600
Get X
400
400 + 600 = 1000
500 + 400 = 900
Account 1
Account 2
Alice
Time

Read skew can be prevented by
Snapshot Isolation/MVCC
Postgres “Repeatable Read”
MySQL “Repeatable Read” – for read only transactions
SQL Server “Snapshot”
Oracle “Serializable”

Snapshot Isolation
Snapshot Isolation is a guarantee that all reads
made in a transaction will see a consistent
snapshot of the database (in practice it reads the
last committed values that existed at the time it
started), and the transaction itself will
successfully commit only if no updates it has
made conflict with any concurrent updates made
since that snapshot.
Snapshot isolation is a popular feature: it is supported by PostgreSQL, MySQL with the InnoDB storage engine, Oracle, SQL
Server, and more.

In Snapshot Isolation
readers never block writers
writers never block readers.

Alice
Bob
DB
Time
Get X 2
Lost update problem
Get X 2
Set X = 4
Ok
Set X = 6
X = 2
Ok
Here Alice’s change
is lost.

Solutions to lost update problem
 Atomic Operations
 Explicit Locking
 Automatically detecting lost updates
 Compare-and-set
 Conflict resolution and replication

Write skew
A transaction reads something, makes a
decision based on the value it saw, and writes
the decision to the database. However, by the
time the write is made, the premise of the
decision is no longer true.

Write skew example
Account Amount
Alice £70
Bob £80
Constraint
Either account can be overdrawn if
Alice’s Amount + Bob’s Amount > 0

Write skew example
Account Amount
Alice - £30
Bob £80
Constraint
Transaction 1
Alice withdraws £100
Permitted, 70 + 80 = £150

Write skew example
Account Amount
Alice £70
Bob - £20
Constraint
Transaction 2
Bob withdraws £100
Permitted, 70 + 80 = £150

Write skew example
Account Amount
Alice £70
Bob £80
Account Amount
Alice - £30
Bob £80
Account Amount
Alice £70
Bob - £20
Transaction 1
Alice withdraws £100
Permitted, 70 + 80 = £150
Transaction 2
Bob withdraws £100
Permitted, 70 + 80 = £150
Account Amount
Alice - £30
Bob - £20

Solutions to write skew
Serializable
Lock everything you read
( 2 PL, pessimistic)
Literally serial order
(H-store, VoltDB)
Detect conflict & abort
(SSI, optimistic)

Non repeatable read
A transaction retrieves a row twice and
the values within the row differ between
the reads.

Non repeatable read example
Transaction 1
/* Query 1 */
SELECT * FROM users WHERE id = 1;
/* Query 1 */
SELECT * FROM users WHERE id = 1; COMMIT;
/* lock-based REPEATABLE READ */
Transaction 2
/* Query 2 */
UPDATE users SET age = 21 WHERE id =
1; COMMIT;
/* in multiversion concurrency control, or
lock-based READ COMMITTED */

Phantom reads
A transaction reads rows that match
some search condition. Another client
makes a write that affects the results of
that search.

Phantom read example
Transaction 1
/* Query 1 */
SELECT * FROM users
WHERE age BETWEEN 10 AND 30;
/* Query 1 */
SELECT * FROM users
WHERE age BETWEEN 10 AND 30;
Transaction 2
/* Query 2 */
INSERT INTO users(id,name,age)
VALUES ( 3, 'Bob', 27 ); COMMIT;
This search condition will give
different result now.

Summary
• Without transactions, data can become inconsistent in
various error scenarios.
• Transactions can hugely reduce the number of potential
error cases when there is a complex database access pattern.
A large class of errors is reduced down to a simple
transaction abort, and the application just needs to try again.
• There are several widely used isolation levels, in particular
read committed, snapshot isolation (sometimes called
repeatable read) and serializable.
• Different race conditions can happen: dirty reads, dirty
writes, read skew, lost updates, write skews, phantom reads.

Transaction

More Related Content

Similar to Transaction

More from Sanjoy Kumar Roy

Recently uploaded

Transaction

Editor's Notes