SlideShare a Scribd company logo
CS8492 – DATABASE
MANAGEMENT SYSTEMS
UNIT III - TRANSACTIONS
1
PreparedbyR.Arthy,AP/IT,KCET
OUTLINE
 Transaction Concepts – ACID Properties
 Schedules – Serializability
 Concurrency Control –Need for Concurrency
 Locking Protocols
 Two Phase Locking
 Deadlock
 Transaction Recovery
 Save Points –Isolation Levels
 SQL Facilities for Concurrency and Recovery
2
PreparedbyR.Arthy,AP/IT,KCET
TRANSACTION CONCEPTS
PreparedbyR.Arthy,AP/IT,KCET
3
TRANSACTION CONCEPTS
 Collections of operations that form a single logical unit
of work are called transactions.
 A transaction is a unit of program execution that
accesses and possibly updates various data items.
 A transaction is delimited by statements (or function
calls) of the form begin transaction and end
transaction.
 Two main issues to deal with:
 Failures of various kinds, such as hardware failures and
system crashes
 Concurrent execution of multiple transactions
4
PreparedbyR.Arthy,AP/IT,KCET
ACID PROPERTIES
 Atomicity. Either all operations of the transaction are properly
reflected in the database or none are.
 Consistency. Execution of a transaction in isolation preserves the
consistency of the database.
 Isolation. Although multiple transactions may execute concurrently,
each transaction must be unaware of other concurrently executing
transactions. Intermediate transaction results must be hidden from
other concurrently executed transactions.
 That is, for every pair of transactions Ti and Tj, it appears to Ti
that either Tj, finished execution before Ti started, or Tj started
execution after Ti finished.
 Durability. After a transaction completes successfully, the changes
it has made to the database persist, even if there are system failures.
5
PreparedbyR.Arthy,AP/IT,KCET
A SIMPLE TRANSACTION MODEL
Transactions access data using two
operations:
 read(X), which transfers the data item X from the
database to a variable, also called X, in a buffer in
main memory belonging to the transaction that
executed the read operation.
 write(X), which transfers the value in the variable
X in the main-memory buffer of the transaction
that executed the write to the data item X in the
database. 6
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Let Ti be a transaction that transfers $50 from account A to account
B. This transaction can be defined as:
Ti : read(A);
A := A − 50;
write(A);
read(B);
B := B + 50;
write(B).
 Atomicity requirement
 if the transaction fails after step 3 and before step 6, money will
be “lost” leading to an inconsistent database state
 Failure could be due to software or hardware
 the system should ensure that updates of a partially executed
transaction are not reflected in the database
7
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Durability requirement — once the user has been notified that the
transaction has completed (i.e., the transfer of the $50 has taken
place), the updates to the database by the transaction must persist
even if there are software or hardware failures.
 Consistency requirement in above example:
 the sum of A and B is unchanged by the execution of the
transaction
 In general, consistency requirements include
 Explicitly specified integrity constraints such as primary keys
and foreign keys
 Implicit integrity constraints
 e.g. sum of balances of all accounts, minus sum of loan
amounts must equal value of cash-in-hand
 A transaction must see a consistent database.
 During transaction execution the database may be temporarily
inconsistent.
 When the transaction completes successfully the database must
be consistent
 Erroneous transaction logic can lead to inconsistency
8
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Isolation requirement — if between steps 3 and 6, another
transaction T2 is allowed to access the partially updated database, it
will see an inconsistent database (the sum A + B will be less than it
should be).
T1 T2
1. read(A)
2. A := A – 50
3. write(A)
read(A), read(B), print(A+B)
4. read(B)
5. B := B + 50
6. write(B
 Isolation can be ensured trivially by running transactions serially
 that is, one after the other.
 However, executing multiple transactions concurrently has
significant benefits, as we will see later.
9
PreparedbyR.Arthy,AP/IT,KCET
TRANSACTION STATES
10
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Active – the initial state; the transaction stays in this state while it is
executing
 Partially committed – after the final statement has been executed.
 Failed -- after the discovery that normal execution can no longer
proceed.
 Aborted – after the transaction has been rolled back and the database
restored to its state prior to the start of the transaction. Two options
after it has been aborted:
 restart the transaction
 can be done only if no internal logical error
 kill the transaction
 Committed – after successful completion.
11
PreparedbyR.Arthy,AP/IT,KCET
QUIZ TIME
 Collections of operations that form a single logical unit
of work are called __________
a) Views
b) Networks
c) Units
d) Transactions
12
PreparedbyR.Arthy,AP/IT,KCET
 The “all-or-none” property is commonly referred to as
_________
a) Isolation
b) Durability
c) Atomicity
d) None of the mentioned
13
PreparedbyR.Arthy,AP/IT,KCET
 Which of the following is a property of transactions?
a) Atomicity
b) Durability
c) Isolation
d) All of the mentioned
14
PreparedbyR.Arthy,AP/IT,KCET
 Execution of translation in isolation preserves the
_________ of a database
a) Atomicity
b) Consistency
c) Durability
d) All of the mentioned
15
PreparedbyR.Arthy,AP/IT,KCET
 Which of the following is not a property of a transaction?
a) Atomicity
b) Simplicity
c) Isolation
d) Durability
16
PreparedbyR.Arthy,AP/IT,KCET
 A transaction that has not been completed successfully is
called as _______
a) Compensating transaction
b) Aborted transaction
c) Active transaction
d) Partially committed transaction
17
PreparedbyR.Arthy,AP/IT,KCET
 Which of the following is not a transaction state?
a) Active
b) Partially committed
c) Failed
d) Compensated
18
PreparedbyR.Arthy,AP/IT,KCET
PART B QUESTIONS
 Write a short note on Transaction concepts. [Nov 2014]
 During execution, a transaction passes through several
states, until it finally commits or aborts. List all possible
sequences of states through which a transaction may
pass. Explain why each state transition may occur? [May
2018]
 Explain with an example the properties that must be
satisfied by a transaction. [May 2018]
19
PreparedbyR.Arthy,AP/IT,KCET
SERIALIZABILITY
PreparedbyR.Arthy,AP/IT,KCET
20
CONCURRENT EXECUTIONS
 Multiple transactions are allowed to run concurrently in
the system. Advantages are:
 increased processor and disk utilization, leading to better
transaction throughput
 E.g. one transaction can be using the CPU while another is reading
from or writing to the disk
 reduced average response time for transactions: short
transactions need not wait behind long ones.
 Concurrency control schemes – mechanisms to
achieve isolation
 that is, to control the interaction among the concurrent
transactions in order to prevent them from destroying the
consistency of the database 21
PreparedbyR.Arthy,AP/IT,KCET
SCHEDULES
 Schedule – a sequences of instructions that specify the
chronological order in which instructions of concurrent
transactions are executed
 a schedule for a set of transactions must consist of all
instructions of those transactions
 must preserve the order in which the instructions appear in
each individual transaction.
 A transaction that successfully completes its execution
will have a commit instructions as the last statement
 by default transaction assumed to execute commit instruction
as its last step
 A transaction that fails to successfully complete its
execution will have an abort instruction as the last
statement
22
PreparedbyR.Arthy,AP/IT,KCET
SCHEDULE 1
 Let T1 transfer $50 from A to B, and T2 transfer 10% of
the balance from A to B.
 A serial schedule in which T1 is followed by T2 :
23
PreparedbyR.Arthy,AP/IT,KCET
SCHEDULE 2
 A serial schedule where T2 is followed by T1
24
PreparedbyR.Arthy,AP/IT,KCET
SCHEDULE 3
 Let T1 and T2 be the transactions defined previously. The
following schedule is not a serial schedule, but it is
equivalent to Schedule 1.
25
PreparedbyR.Arthy,AP/IT,KCET
SCHEDULE 4
 The following concurrent schedule does not preserve the
value of (A + B ).
26
PreparedbyR.Arthy,AP/IT,KCET
SERIALIZABILITY
 Basic Assumption – Each transaction preserves database
consistency.
 Thus serial execution of a set of transactions preserves
database consistency.
 A (possibly concurrent) schedule is serializable if it is
equivalent to a serial schedule. Different forms of schedule
equivalence give rise to the notions of:
1. Conflict serializability
2. View serializability
 Simplified view of transactions
 We ignore operations other than read and write instructions
 We assume that transactions may perform arbitrary computations on
data in local buffers in between reads and writes.
 Our simplified schedules consist of only read and write
instructions.
27
PreparedbyR.Arthy,AP/IT,KCET
CONFLICTING INSTRUCTIONS
 Instructions li and lj of transactions Ti and Tj respectively,
conflict if and only if there exists some item Q accessed
by both li and lj, and at least one of these instructions
wrote Q.
1. li = read(Q), lj = read(Q). li and lj don’t conflict.
2. li = read(Q), lj = write(Q). They conflict.
3. li = write(Q), lj = read(Q). They conflict
4. li = write(Q), lj = write(Q). They conflict
 Intuitively, a conflict between li and lj forces a (logical)
temporal order between them.
 If li and lj are consecutive in a schedule and they do not
conflict, their results would remain the same even if they had
been interchanged in the schedule. 28
PreparedbyR.Arthy,AP/IT,KCET
CONFLICT SERIALIZABILITY
 If a schedule S can be transformed into a schedule S´ by
a series of swaps of non-conflicting instructions, we say
that S and S´ are conflict equivalent.
 We say that a schedule S is conflict serializable if it is
conflict equivalent to a serial schedule
29
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Schedule 3 can be transformed into Schedule 6, a serial schedule
where T2 follows T1, by series of swaps of non-conflicting
instructions.
 Therefore Schedule 3 is conflict serializable.
30
PreparedbyR.Arthy,AP/IT,KCET
Schedule 3 Schedule 6
[CONTD…]
 Example of a schedule that is not conflict serializable:
 We are unable to swap instructions in the above schedule
to obtain either the serial schedule < T3, T4 >, or the
serial schedule < T4, T3 >.
31
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 To determine conflict serializability of a schedule.
 Consider a schedule S. We construct a directed graph, called a
precedence graph, from S.
 This graph consists of a pair G = (V, E), where
 V is a set of vertices and E is a set of edges. The set of vertices
consists of all the transactions participating in the schedule.
 The set of edges consists of all edges
 Ti →Tj for which one of three conditions holds:
 Ti executes write(Q) before Tj executes read(Q).
 Ti executes read(Q) before Tj executes write(Q).
 Ti executes write(Q) before Tj executes write(Q).
32
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 If an edge Ti → Tj exists in the precedence graph, then,
in any serial schedule S equivalent to S, Ti must appear
before Tj .
33
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 If the precedence graph for S has a cycle, then schedule S
is not conflict serializable.
 If the graph contains no cycles, then the schedule S is
conflict serializable.
34
PreparedbyR.Arthy,AP/IT,KCET
TEST FOR CONFLICT SERIALIZABILITY
 A schedule is conflict serializable if
and only if its precedence graph is
acyclic.
 Cycle-detection algorithms exist which
take order n2 time, where n is the
number of vertices in the graph.
 (Better algorithms take order n + e
where e is the number of edges.)
 If precedence graph is acyclic, the
serializability order can be obtained by
a topological sorting of the graph.
 This is a linear order consistent
with the partial order of the graph.
35
PreparedbyR.Arthy,AP/IT,KCET
VIEW SERIALIZABILITY
 Let S and S´ be two schedules with the same set of transactions. S and S´
are view equivalent if the following three conditions are met, for each data
item Q,
1. If in schedule S, transaction Ti reads the initial value of Q, then in
schedule S’ also transaction Ti must read the initial value of Q.
2. If in schedule S transaction Ti executes read(Q), and that value was
produced by transaction Tj (if any), then in schedule S’ also
transaction Ti must read the value of Q that was produced by the same
write(Q) operation of transaction Tj .
3. The transaction (if any) that performs the final write(Q) operation in
schedule S must also perform the final write(Q) operation in schedule
S’.
As can be seen, view equivalence is also based purely on reads and writes
alone.
36
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 A schedule S is view serializable if it is view equivalent
to a serial schedule.
 Every conflict serializable schedule is also view
serializable.
 Every view serializable schedule that is not conflict
serializable has blind writes.
37
PreparedbyR.Arthy,AP/IT,KCET
TEST FOR VIEW SERIALIZABILITY
 The precedence graph test for conflict serializability cannot be used directly
to test for view serializability.
 Extension to test for view serializability has cost exponential in the size
of the precedence graph.
 The problem of checking if a schedule is view serializable falls in the class
of NP-complete problems.
 Thus existence of an efficient algorithm is extremely unlikely.
 However practical algorithms that just check some sufficient conditions for
view serializability can still be used.
38
PreparedbyR.Arthy,AP/IT,KCET
RECOVERABLE SCHEDULES
 Recoverable schedule — if a transaction Tj reads a data item
previously written by a transaction Ti , then the commit
operation of Ti appears before the commit operation of Tj.
 The following schedule (Schedule 11) is not recoverable if T9
commits immediately after the read
 If T8 should abort, T9 would have read (and possibly shown to
the user) an inconsistent database state. Hence, database must
ensure that schedules are recoverable. 39
PreparedbyR.Arthy,AP/IT,KCET
CASCADING ROLLBACKS
 Cascading rollback – a single transaction failure leads to a
series of transaction rollbacks. Consider the following
schedule where none of the transactions has yet committed (so
the schedule is recoverable)
If T10 fails, T11 and T12 must also be rolled back.
 Can lead to the undoing of a significant amount of work 40
PreparedbyR.Arthy,AP/IT,KCET
QUIZ TIME
 I and J are _________ if they are operations by different
transactions on the same data item, and at least one of
them is a write operation.
a) Conflicting
b) Overwriting
c) Isolated
d) Durable
41
PreparedbyR.Arthy,AP/IT,KCET
 If a schedule S can be transformed into a schedule S’ by
a series of swaps of non-conflicting instructions, then S
and S’ are
a) Non conflict equivalent
b) Equal
c) Conflict equivalent
d) Isolation equivalent
42
PreparedbyR.Arthy,AP/IT,KCET
 A schedule is __________ if it is conflict equivalent to a
serial schedule.
a) Conflict serializable
b) Conflicting
c) Non serializable
d) None of the mentioned
43
PreparedbyR.Arthy,AP/IT,KCET
 The set of ________ in a precedence graph consists of all
the transactions participating in the schedule
a) Vertices
b) Edges
c) Directions
d) None of the mentioned
44
PreparedbyR.Arthy,AP/IT,KCET
 A ___________of the transactions can be obtained by
finding a linear order consistent with the partial order of
the precedence graph.
a) Serializability order
b) Direction graph
c) Precedence graph
d) Scheduling scheme
45
PreparedbyR.Arthy,AP/IT,KCET
 State true or false: If I = read(Q) and J = read(Q) then the
order of I and J does not matter.
a) True
b) False
46
PreparedbyR.Arthy,AP/IT,KCET
 State true or false: If I = read(Q) and J = write(Q) then
the order of I and J does not matter.
a) True
b) False
47
PreparedbyR.Arthy,AP/IT,KCET
 The phenomenon in which one failure leads to a series of
transaction rollbacks is called as ________
a) Cascading rollback
b) Cascadeless rollback
c) Cascade cause
d) None of the mentioned
48
PreparedbyR.Arthy,AP/IT,KCET
 A ___________ is one where, for each pair of
transactions Ti and Tj such that Tj reads a data item
previously written by Ti , the commit operation of Ti
appears before the commit operation of Tj
a) Partial schedule
b) Dependent schedule
c) Recoverable schedule
d) None of the mentioned
49
PreparedbyR.Arthy,AP/IT,KCET
 Which of the following are the advantages of transaction
concurrency?
a) Increased throughput
b) Increased utilization
c) Reduces average response time
d) All of the mentioned
50
PreparedbyR.Arthy,AP/IT,KCET
GATE QUESTIONS
The following functional dependencies hold true for the
relational schema R{V, W, X, Y, Z}:
V -> W
VW -> X
Y -> VX
Y -> Z
Which of the following is irreducible equivalent for this set of
functional dependencies?
GATE | GATE-CS-2017 (Set 1) | Question 15
(A) A
(B) B
(C) C
(D) D
51
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
Consider a database that has the relation schemas EMP (EmpId, EmpName, DepId),
and DEPT(DeptName, DeptId). Note that the DepId can be permitted to be NULL
in the relation EMP. Consider the following queries on the database expressed in
tuple relational calculus.
I. {t | ∃ u ∈ EMP (t[EMPName] = u[EmpName] ∧ ∀ v ∈ DEPT (t[DeptId] ≠
DeptId]))}
II. {t | ∃ u ∈ EMP (t[EMPName] = u[EmpName] ∧ ∃ v ∈ DEPT (t[DeptId] ≠
DeptId]))}
III. {t | ∃ u ∈ EMP (t[EMPName] = u[EmpName] ∧ ∃ v ∈ DEPT (t[DeptId] =
DeptId]))}
GATE | GATE-CS-2017 (Set 1) | Question 36
(A) I and II only
(B) I and III only
(C) II and III only
(D) I, II, and III
52
PreparedbyR.Arthy,AP/IT,KCET
PART B QUESTIONS
 Discuss view serializability and conflict serializability. [Nov
2015, May 2018]
 Consider the following extension to the tree locking protocol,
which allows both shared and exclusive locks:
i) A transaction can be either a read only transaction, in which
case it can request only shared locks or an update transaction
in which case it can request only exclusive locks.
ii) Each transaction must follow the rules of the tree protocol.
Read only transactions may lock any data item first, where as
update transactions must lock the root first.
Show that the protocol ensures Serializability and deadlock
freedom. [Nov 2016]
 Discuss in detail about the testing of serializability. [May
2019] 53
PreparedbyR.Arthy,AP/IT,KCET
CONCURRENCY CONTROL
PreparedbyR.Arthy,AP/IT,KCET
54
NEED FOR CONCURRENCY CONTROL
 In the concurrency control, the multiple transactions can
be executed simultaneously.
 It may affect the transaction result. It is highly important
to maintain the order of execution of those transactions.
 Several problems can occur when concurrent
transactions are executed in an uncontrolled manner.
 Following are the three problems in concurrency control.
 Lost updates
 Dirty read
 Unrepeatable read
55
PreparedbyR.Arthy,AP/IT,KCET
LOST UPDATE PROBLEM
 When two transactions that access the same database
items contain their operations in a way that makes the
value of some database item incorrect, then the lost
update problem occurs.
 If two transactions T1 and T2 read a record and then
update it, then the effect of updating of the first record
will be overwritten by the second update.
56
PreparedbyR.Arthy,AP/IT,KCET
DIRTY READ
 A dirty read is the situation when a transaction reads a
data that has not yet been committed.
 Example:
 Transaction 1 updates a row and leaves it uncommitted
 Transaction 2 reads the updated row
 If transaction 1 rolls back the change, transaction 2 will have
read data that is considered never to have existed.
57
PreparedbyR.Arthy,AP/IT,KCET
UNREPEATABLE READ
 Non repeatable read occurs when a transaction reads
same row twice but received a different value each time.
 Example:
 Transaction T1 reads a data
 Due to concurrency, another transaction T2 updates the same
data and commit
 Transaction T1 rereads the same data, it will retrieve a
different value
58
PreparedbyR.Arthy,AP/IT,KCET
CONCURRENCY CONTROL PROTOCOL
 Concurrency control protocols ensure atomicity,
isolation, and serializability of concurrent transactions.
 The concurrency control protocol can be divided into
three categories:
 Lock based protocol
 Time-stamp protocol
 Validation based protocol
59
PreparedbyR.Arthy,AP/IT,KCET
LOCK-BASED PROTOCOLS
 A lock is a mechanism to control concurrent access to a
data item
 Data items can be locked in two modes :
1. exclusive (X) mode. Data item can be both read as
well as written. X-lock is requested using lock-X
instruction.
2. shared (S) mode. Data item can only be read. S-lock
is requested using lock-S instruction.
 Lock requests are made to concurrency-control manager.
Transaction can proceed only after request is granted.
60
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Lock-compatibility matrix
 A transaction may be granted a lock on an item if the
requested lock is compatible with locks already held on the
item by other transactions
 Any number of transactions can hold shared locks on an item,
 but if any transaction holds an exclusive on the item no other
transaction may hold any lock on the item.
 If a lock cannot be granted, the requesting transaction is made
to wait till all incompatible locks held by other transactions
have been released. The lock is then granted.
61
PreparedbyR.Arthy,AP/IT,KCET
RULES FOR LOCKING
 If a transaction has a read lock on a data item, then it can
only read the item but cannot update.
 If a transaction has a write lock on a data item, then it
can both read and update the data item.
 If a transaction has a write lock on a data item, then other
transactions cannot acquire read or write locks on the
data item.
62
PreparedbyR.Arthy,AP/IT,KCET
PRE-CLAIMING LOCK PROTOCOL
 It evaluate their operations and create a list of data items
on which they need locks.
 Before initiating an execution, the transaction requests
the system for all the locks it needs beforehand.
 If all the locks are granted, the transaction executes and
releases all the locks when all its operations are over.
 If all the locks are not granted, the transaction rolls back
and waits until all t he locks are granted.
63
PreparedbyR.Arthy,AP/IT,KCET
T begin T end Time
Lock acquisition
phase
[CONTD…]
 Example of a transaction performing locking:
T2: lock-S(A);
read (A);
unlock(A);
lock-S(B);
read (B);
unlock(B);
display(A+B)
 Locking as above is not sufficient to guarantee serializability
— if A and B get updated in-between the read of A and B, the
displayed sum would be wrong.
 A locking protocol is a set of rules followed by all
transactions while requesting and releasing locks. Locking
protocols restrict the set of possible schedules. 64
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
65
PreparedbyR.Arthy,AP/IT,KCET
PROBLEM
 Consider the partial schedule
 Neither T3 nor T4 can make progress — executing lock-S(B)
causes T4 to wait for T3 to release its lock on B, while
executing lock-X(A) causes T3 to wait for T4 to release its
lock on A.
 Such a situation is called a deadlock.
 To handle a deadlock one of T3 or T4 must be rolled back
and its locks released.
66
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 The potential for deadlock exists in most locking
protocols. Deadlocks are a necessary evil.
 Starvation is also possible if concurrency control
manager is badly designed. For example:
 A transaction may be waiting for an X-lock on an item, while
a sequence of other transactions request and are granted an S-
lock on the same item.
 The same transaction is repeatedly rolled back due to
deadlocks.
 Concurrency control manager can be designed to prevent
starvation.
67
PreparedbyR.Arthy,AP/IT,KCET
TWO-PHASE LOCKING PROTOCOL
 This is a protocol which ensures conflict-serializable
schedules.
 Phase 1: Growing Phase
 transaction may obtain locks
 transaction may not release locks
 Phase 2: Shrinking Phase
 transaction may release locks
 transaction may not obtain locks
68
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 The protocol assures serializability.
 It can be proved that the transactions can be serialized in
the order of their lock points (i.e. the point where a
transaction acquired its final lock). 69
PreparedbyR.Arthy,AP/IT,KCET
T begin T end
Time
Lock acquisition phase Lock releasing phase
[CONTD…]
 Two-phase locking does not ensure freedom from
deadlocks
 Cascading roll-back is possible under two-phase locking.
 To avoid this, follow a modified protocol called strict
two-phase locking. Here a transaction must hold all its
exclusive locks till it commits/aborts.
 Rigorous two-phase locking is even stricter: here all
locks are held till commit/abort.
 In this protocol transactions can be serialized in the order
in which they commit.
70
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Two-phase locking with lock conversions:
– First Phase:
 can acquire a lock-S on item
 can acquire a lock-X on item
 can convert a lock-S to a lock-X (upgrade)
– Second Phase:
 can release a lock-S
 can release a lock-X
 can convert a lock-X to a lock-S (downgrade)
 This protocol assures serializability. But still relies on the
programmer to insert the various locking instructions. 71
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 A transaction Ti issues the standard read/write
instruction, without explicit locking calls.
 The operation read(D) is processed as:
if Ti has a lock on D
then
read(D)
else begin
if necessary wait until no other
transaction has a lock-X on D
grant Ti a lock-S on D;
read(D)
end 72
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 write(D) is processed as:
if Ti has a lock-X on D
then
write(D)
else begin
if necessary wait until no other trans. has any lock
on D,
if Ti has a lock-S on D
then
upgrade lock on D to lock-X
else
grant Ti a lock-X on D
write(D)
end;
 All locks are released after commit or abort
73
PreparedbyR.Arthy,AP/IT,KCET
IMPLEMENTATION OF LOCKING
 A lock manager can be implemented as a separate
process to which transactions send lock and unlock
requests
 The lock manager replies to a lock request by sending a
lock grant messages (or a message asking the transaction
to roll back, in case of a deadlock)
 The requesting transaction waits until its request is
answered
 The lock manager maintains a data-structure called a
lock table to record granted locks and pending requests
 The lock table is usually implemented as an in-memory
hash table indexed on the name of the data item being
locked 74
PreparedbyR.Arthy,AP/IT,KCET
LOCK TABLE
 Black rectangles indicate granted locks,
white ones indicate waiting requests
 Lock table also records the type of lock
granted or requested
 New request is added to the end of the
queue of requests for the data item, and
granted if it is compatible with all earlier
locks
 Unlock requests result in the request
being deleted, and later requests are
checked to see if they can now be
granted
 If transaction aborts, all waiting or
granted requests of the transaction are
deleted
 lock manager may keep a list of
locks held by each transaction, to
implement this efficiently`
75
PreparedbyR.Arthy,AP/IT,KCET
TIMESTAMP-BASED PROTOCOLS
 Each transaction is issued a timestamp when it enters the
system. If an old transaction Ti has time-stamp TS(Ti), a
new transaction Tj is assigned time-stamp TS(Tj) such
that TS(Ti) <TS(Tj).
 The protocol manages concurrent execution such that the
time-stamps determine the serializability order.
 In order to assure such behavior, the protocol maintains
for each data Q two timestamp values:
 W-timestamp(Q) is the largest time-stamp of any transaction
that executed write(Q) successfully.
 R-timestamp(Q) is the largest time-stamp of any transaction
that executed read(Q) successfully. 76
PreparedbyR.Arthy,AP/IT,KCET
VALIDATION-BASED PROTOCOL
 Execution of transaction Ti is done in three phases.
1. Read and execution phase: Transaction Ti writes only to
temporary local variables
2. Validation phase: Transaction Ti performs a ``validation test''
to determine if local variables can be written without violating
serializability.
3. Write phase: If Ti is validated, the updates are applied to the
database; otherwise, Ti is rolled back.
 The three phases of concurrently executing transactions can be
interleaved, but each transaction must go through the three phases in
that order.
 Assume for simplicity that the validation and write phase occur
together, atomically and serially
 i.e., only one transaction executes validation/write at a time.
 Also called as optimistic concurrency control since transaction
executes fully in the hope that all will go well during validation
77
PreparedbyR.Arthy,AP/IT,KCET
PART B QUESTIONS
 What is concurrency? Explain it in terms of locking
mechanism and two phase locking protocol. [Nov 2014,
May 2016, Nov 2016, Nov 2017, May 2018]
 What is concurrency control? How is it implemented in
DBMS? Illustrate with a suitable example. [Nov 2015]
 Explain about locking protocols. [May 2016]
 Explain lock based concurrency control with suitable
example. [Nov 2017]
 Differentiate strict two phase locking protocol and
rigorous two phase locking protocol. [Nov 2018]
 How the time stamps are implemented? Explain. [Nov
2018]
78
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Consider the following two transactions:
T1 and T2 works in parallel. Add lock and unlock
instructions to transactions T1 and T2 so that they
observe the two phase locking protocol. Can the
execution of these transactions result in a deadlock?
[Nov 2016] 79
PreparedbyR.Arthy,AP/IT,KCET
T1 T2
read(A);
read(B),
if A=0 then B:=B+1;
write(B).
read(B);
read(A);
if B=0 then A:A+1;
write(A).
DEADLOCK
PreparedbyR.Arthy,AP/IT,KCET
80
INTRODUCTION
 In a multi-process system, deadlock is an unwanted
situation that arises in a shared resource environment.
 Where a process indefinitely waits for a resource that is
held by another process.
 Definition
 System is deadlocked if there is a set of transactions such that
every transaction in the set is waiting for another transaction
in the set.
 For example:
 Assume a set of transactions {T0, T1, T2}
81
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 T0 needs a resource X to complete its task
 Resource X is held by T1, and T1 is waiting for a
resource Y, which is held by T2
 T2 is waiting for the resource Z, which is held by T0
82
PreparedbyR.Arthy,AP/IT,KCET
T0
T1
T2
DEADLOCK PREVENTION
 Deadlock prevention protocols ensure that the system
will never enter into a deadlock state.
 Some prevention strategies :
 Require that each transaction locks all its data items before it
begins execution (predeclaration).
 Impose partial ordering of all data items and require that a
transaction can lock data items only in the order specified by
the partial order (graph-based protocol).
 Schemes:
 Wait – Die Scheme
 Wound – Wait Scheme
83
PreparedbyR.Arthy,AP/IT,KCET
WAIT-DIE SCHEME — NON-PREEMPTIVE
 In this scheme, is a transaction requests to lock a data
item, which is already locked by another transaction,
then one of the two possibilities may occur:
 If TS(Ti) < TS(Tj) – that is Ti is older than Tj, which is
requesting a conflicting lock, is older than Tj – then Ti is
allowed to wait until the data item is available.
 If TS(Ti) > TS(Tj) – that is Ti is younger than Tj – then Ti is
aborted. Ti is restarted later with a random delay but with the
same timestamp.
 Older transaction may wait for younger one to release data item.
Younger transactions never wait for older ones; they are rolled
back instead.
 A transaction may die several times before acquiring needed data
item 84
PreparedbyR.Arthy,AP/IT,KCET
WOUND-WAIT SCHEME
 In this scheme, is a transaction requests to lock a data
item, which is already locked by another transaction,
then one of the two possibilities may occur:
 If TS(Ti) < TS(Tj) – that is Ti is older than Tj, then Ti forces
Tj to be rolled back – that is Ti wounds Tj. Tj is restarted later
with a random delay but with the same timestamp.
 If TS(Ti) > TS(Tj) – that is Ti is younger than Tj – Ti is
forced to wait until the resource is available.
 Older transaction wounds (forces rollback) of younger
transaction instead of waiting for it. Younger transactions
may wait for older ones.
 May be fewer rollbacks than wait-die scheme. 85
PreparedbyR.Arthy,AP/IT,KCET
DEADLOCK AVOIDANCE
 Deadlocks can be described as a wait-for graph, which
consists of a pair G = (V,E),
 V is a set of vertices (all the transactions in the system)
 E is a set of edges; each element is an ordered pair Ti Tj.
 If Ti  Tj is in E, then there is a directed edge from Ti to Tj,
implying that Ti is waiting for Tj to release a data item.
 When Ti requests a data item currently being held by Tj, then
the edge Ti Tj is inserted in the wait-for graph. This edge is
removed only when Tj is no longer holding a data item needed
by Ti.
 The system is in a deadlock state if and only if the wait-for
graph has a cycle. Must invoke a deadlock-detection
algorithm periodically to look for cycles. 86
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
87
PreparedbyR.Arthy,AP/IT,KCET
Wait-for graph without a cycle Wait-for graph with a cycle
DEADLOCK RECOVERY
 When deadlock is detected :
 Some transaction will have to rolled back (made a victim) to
break deadlock. Select that transaction as victim that will
incur minimum cost.
 Rollback -- determine how far to roll back transaction
 Total rollback: Abort the transaction and then restart it.
 More effective to roll back transaction only as far as necessary to
break deadlock.
 Starvation happens if same transaction is always chosen as
victim. Include the number of rollbacks in the cost factor to
avoid starvation
88
PreparedbyR.Arthy,AP/IT,KCET
PART B QUESTIONS
 Write a short note on deadlock. [Nov 2014]
 What is deadlock? How does it occur? How transactions
be written to avoid deadlock and guarantee correct
execution. Illustrate with suitable example. [Nov 2015]
 Outline the deadlock handling mechanisms. [Nov 2016,
Nov 2018]
 Explain the concepts of Deadlock avoidance and
prevention in detail. [Nov 2018]
89
PreparedbyR.Arthy,AP/IT,KCET
PROBLEMS
 Consider the following schedules. The actions are listed in the
order they are scheduled and prefixed with the transaction
name:
 S1: T1:R(X),T2:R(X),T1:W(Y),T2:W(Y),T1:R(Y),T2:R(Y)
 S2: T3:W(X),T1:R(X),T1:W(Y),T2:W(Z),T3:R(Z)
 For each of the schedules answer the following questions:
 What is the precedence graph for the schedule?
 Is the schedule conflict serializable? If so what are all the
conflict equivalent serial schedules?
 Is the schedule view serializable? If so what are all the view
equivalent serial schedule?
[May 2015] 90
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Consider the following schedules S1 and S2 with three
transactions T1,T2 and T3 as follows;
 S1:r1(X);r3(X);w1(X);r2(X);w3(X)
 S2:r3(X);r2(X);w3(X);r1(X);w1(X)
 Is the schedule S1 and S2 conflict serializable?
91
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Consider a schedule S with three transactions T1,T2 and
T3 as follows;
 S1: R1(x);W2(x);R1(x);W1(y);commit1;commit2;
 S2: R1(A) , R2(A) , R1(B) , R2(B) , R3(B) , W1(A) ,
W2(B)
 Is the schedule S1 and S2 conflict serializable?
92
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Consider schedules s3,s4,and s5 below. Determine whether
each schedule is strict, cascadeless, recoverable or
nonrecoverable. (Determine the strictest recoverability
condition that each schedule satisfies.)
 S3:r1(X);r2(Z);r3(X);r3(Y);w1(X);c1;w3(Y);c3;r2(Y);w2(Z
);w2(Y);C2;
 s4 :
r1(X);r2(Z);r1(Z);r3(X),r3(Y);w1(X);w3(Y);r2(Y);w2(Z);w
2(Y);c1;c2;c3;
 s5 :
r1(X);r2(Z);r3(X);r1(Z);r2(Y);r3(Y);w1(X);C1;w2(Z);w3(Y
);w2(Y);c3;c2; 93
PreparedbyR.Arthy,AP/IT,KCET
TRANSACTION RECOVERY
PreparedbyR.Arthy,AP/IT,KCET
94
PART B QUESTIONS
 Explain deferred and immediate modification versions of
the log based recovery scheme. [May 2019]
 Explain why timestamp based concurrency control
allows schedules that are not recoverable. Describe how
it can be modified through buffering to disallow such
schedules.
95
PreparedbyR.Arthy,AP/IT,KCET
FAILURE CLASSIFICATION
 Transaction failure :
 Logical errors: transaction cannot complete due to some internal
error condition
 System errors: the database system must terminate an active
transaction due to an error condition (e.g., deadlock)
 System crash: a power failure or other hardware or software
failure causes the system to crash.
 Fail-stop assumption: non-volatile storage contents are assumed to
not be corrupted by system crash
 Database systems have numerous integrity checks to prevent corruption
of disk data
 Disk failure: a head crash or similar disk failure destroys all
or part of disk storage
 Destruction is assumed to be detectable: disk drives use checksums
to detect failures
96
PreparedbyR.Arthy,AP/IT,KCET
RECOVERY ALGORITHMS
 Recovery algorithms are techniques to ensure database
consistency and transaction atomicity and durability
despite failures
 Focus of this chapter
 Recovery algorithms have two parts
1. Actions taken during normal transaction processing to
ensure enough information exists to recover from failures
2. Actions taken after a failure to recover the database contents
to a state that ensures atomicity, consistency and durability
97
PreparedbyR.Arthy,AP/IT,KCET
STORAGE STRUCTURE
 Volatile storage:
 does not survive system crashes
 examples: main memory, cache memory
 Nonvolatile storage:
 survives system crashes
 examples: disk, tape, flash memory,
non-volatile (battery backed up) RAM
 Stable storage:
 a mythical form of storage that survives all failures
 approximated by maintaining multiple copies on distinct
nonvolatile media
98
PreparedbyR.Arthy,AP/IT,KCET
STABLE-STORAGE IMPLEMENTATION
 Maintain multiple copies of each block on separate disks
 copies can be at remote sites to protect against disasters such as fire
or flooding.
 Failure during data transfer can still result in inconsistent
copies: Block transfer can result in
 Successful completion
 Partial failure: destination block has incorrect information
 Total failure: destination block was never updated
 Protecting storage media from failure during data transfer
(one solution):
 Execute output operation as follows (assuming two copies of each
block):
1. Write the information onto the first physical block.
2. When the first write successfully completes, write the same
information onto the second physical block.
3. The output is completed only after the second write successfully
completes.
99
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Protecting storage media from failure during data transfer (cont.):
 Copies of a block may differ due to failure during output
operation. To recover from failure:
1. First find inconsistent blocks:
1. Expensive solution: Compare the two copies of every disk block.
2. Better solution:
 Record in-progress disk writes on non-volatile storage (Non-volatile
RAM or special area of disk).
 Use this information during recovery to find blocks that may be
inconsistent, and only compare copies of these.
 Used in hardware RAID systems
2. If either copy of an inconsistent block is detected to have an error (bad
checksum), overwrite it by the other copy. If both have no error, but are
different, overwrite the second block by the first block.
100
PreparedbyR.Arthy,AP/IT,KCET
DATA ACCESS
 Physical blocks are those blocks residing on the disk.
 Buffer blocks are the blocks residing temporarily in main
memory.
 Block movements between disk and main memory are initiated
through the following two operations:
 input(B) transfers the physical block B to main memory.
 output(B) transfers the buffer block B to the disk, and replaces the
appropriate physical block there.
 Each transaction Ti has its private work-area in which local
copies of all data items accessed and updated by it are kept.
 Ti's local copy of a data item X is called xi.
 We assume, for simplicity, that each data item fits in, and is
stored inside, a single block.
101
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Transaction transfers data items between system buffer blocks
and its private work-area using the following operations :
 read(X) assigns the value of data item X to the local variable xi.
 write(X) assigns the value of local variable xi to data item {X} in the
buffer block.
 both these commands may necessitate the issue of an input(BX)
instruction before the assignment, if the block BX in which X resides
is not already in memory.
 Transactions
 Perform read(X) while accessing X for the first time;
 All subsequent accesses are to the local copy.
 After last access, transaction executes write(X).
 output(BX) need not immediately follow write(X). System
can perform the output operation when it deems fit.
102
PreparedbyR.Arthy,AP/IT,KCET
EXAMPLE OF DATA ACCESS
X
Y
A
B
x1
y1
buffer
Buffer Block A
Buffer Block B
input(A)
output(B)
read(X)
write(Y)
disk
work area
of T1
work area
of T2
memory
x2
103
PreparedbyR.Arthy,AP/IT,KCET
RECOVERY AND ATOMICITY
 Modifying the database without ensuring that the
transaction will commit may leave the database in an
inconsistent state.
 Consider transaction Ti that transfers $50 from account A to
account B; goal is either to perform all database
modifications made by Ti or none at all.
 Several output operations may be required for Ti (to output
A and B). A failure may occur after one of these
modifications have been made but before all of them are
made.
104
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 To ensure atomicity despite failures, we first output
information describing the modifications to stable
storage without modifying the database itself.
 We study two approaches:
 log-based recovery, and
 shadow-paging
 We assume (initially) that transactions run serially, that
is, one after the other.
105
PreparedbyR.Arthy,AP/IT,KCET
LOG-BASED RECOVERY
 A log is kept on stable storage.
 The log is a sequence of log records, and maintains a record of update
activities on the database.
 When transaction Ti starts, it registers itself by writing a
<Ti start>log record
 Before Ti executes write(X), a log record <Ti, X, V1, V2> is written,
where V1 is the value of X before the write, and V2 is the value to be
written to X.
 Log record notes that Ti has performed a write on data item Xj Xj had value
V1 before the write, and will have value V2 after the write.
 When Ti finishes it last statement, the log record <Ti commit> is
written.
 We assume for now that log records are written directly to stable
storage (that is, they are not buffered)
 Two approaches using logs
 Deferred database modification
 Immediate database modification
106
PreparedbyR.Arthy,AP/IT,KCET
DEFERRED DATABASE MODIFICATION
 The deferred database modification scheme records all
modifications to the log, but defers all the writes to after partial
commit.
 Assume that transactions execute serially
 Transaction starts by writing <Ti start> record to log.
 A write(X) operation results in a log record <Ti, X, V> being written,
where V is the new value for X
 Note: old value is not needed for this scheme
 The write is not performed on X at this time, but is deferred.
 When Ti partially commits, <Ti commit> is written to the log
 Finally, the log records are read and used to actually execute the
previously deferred writes.
107
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 During recovery after a crash, a transaction needs to be redone if
and only if both <Ti start> and<Ti commit> are there in the log.
 Redoing a transaction Ti ( redoTi) sets the value of all data items
updated by the transaction to the new values.
 Crashes can occur while
 the transaction is executing the original updates, or
 while recovery action is being taken
 example transactions T0 and T1 (T0 executes before T1):
T0: read (A) T1 : read (C)
A: - A - 50 C:- C- 100
Write (A) write (C)
read (B)
B:- B + 50
write (B) 108
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Below we show the log as it appears at three instances of time.
 If log on stable storage at time of crash is as in case:
(a) No redo actions need to be taken
(b) redo(T0) must be performed since <T0 commit> is present
(c) redo(T0) must be performed followed by redo(T1) since
<T0 commit> and <Ti commit> are present
109
PreparedbyR.Arthy,AP/IT,KCET
IMMEDIATE DATABASE MODIFICATION
 The immediate database modification scheme allows
database updates of an uncommitted transaction to be made as
the writes are issued
 since undoing may be needed, update logs must have both old value
and new value
 Update log record must be written before database item is
written
 We assume that the log record is output directly to stable storage
 Can be extended to postpone log record output, so long as prior to
execution of an output(B) operation for a data block B, all log
records corresponding to items B must be flushed to stable storage
 Output of updated blocks can take place at any time before or
after transaction commit
 Order in which blocks are output can be different from the
order in which they are written.
110
PreparedbyR.Arthy,AP/IT,KCET
IMMEDIATE DATABASE MODIFICATION EXAMPLE
Log Write Output
<T0 start>
<T0, A, 1000, 950>
To, B, 2000, 2050
A = 950
B = 2050
<T0 commit>
<T1 start>
<T1, C, 700, 600>
C = 600
BB, BC
<T1 commit>
BA
 Note: BX denotes block containing X.
x1
111
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Recovery procedure has two operations instead of one:
 undo(Ti) restores the value of all data items updated by Ti to their
old values, going backwards from the last log record for Ti
 redo(Ti) sets the value of all data items updated by Ti to the new
values, going forward from the first log record for Ti
 Both operations must be idempotent
 That is, even if the operation is executed multiple times the effect is
the same as if it is executed once
 Needed since operations may get re-executed during recovery
 When recovering after failure:
 Transaction Ti needs to be undone if the log contains the record
<Ti start>, but does not contain the record <Ti commit>.
 Transaction Ti needs to be redone if the log contains both the record
<Ti start> and the record <Ti commit>.
 Undo operations are performed first, then redo operations.
112
PreparedbyR.Arthy,AP/IT,KCET
IMMEDIATE DB MODIFICATION RECOVERY EXAMPLE
Below we show the log as it appears at three instances of time.
Recovery actions in each case above are:
(a) undo (T0): B is restored to 2000 and A to 1000.
(b) undo (T1) and redo (T0): C is restored to 700, and then A and B are
set to 950 and 2050 respectively.
(c) redo (T0) and redo (T1): A and B are set to 950 and 2050
respectively. Then C is set to 600
113
PreparedbyR.Arthy,AP/IT,KCET
CHECKPOINTS
 Problems in recovery procedure as discussed earlier :
1. searching the entire log is time-consuming
2. we might unnecessarily redo transactions which have
already
3. output their updates to the database.
 Streamline recovery procedure by periodically
performing checkpointing
1. Output all log records currently residing in main memory
onto stable storage.
2. Output all modified buffer blocks to the disk.
3. Write a log record < checkpoint> onto stable storage.
114
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 During recovery we need to consider only the most recent
transaction Ti that started before the checkpoint, and transactions
that started after Ti.
1. Scan backwards from end of log to find the most recent
<checkpoint> record
2. Continue scanning backwards till a record <Ti start> is found.
3. Need only consider the part of log following above start
record. Earlier part of log can be ignored during recovery, and
can be erased whenever desired.
4. For all transactions (starting from Ti or later) with no <Ti
commit>, execute undo(Ti). (Done only in case of immediate
modification.)
5. Scanning forward in the log, for all transactions starting
from Ti or later with a <Ti commit>, execute redo(Ti). 115
PreparedbyR.Arthy,AP/IT,KCET
EXAMPLE OF CHECKPOINTS
 T1 can be ignored (updates already output to disk due to
checkpoint)
 T2 and T3 redone.
 T4 undone
Tc Tf
T1
T2
T3
T4
checkpoint system failure
116
PreparedbyR.Arthy,AP/IT,KCET
RECOVERY WITH CONCURRENT TRANSACTIONS
 We modify the log-based recovery schemes to allow multiple
transactions to execute concurrently.
 All transactions share a single disk buffer and a single log
 A buffer block can have data items updated by one or more transactions
 We assume concurrency control using strict two-phase locking;
 i.e. the updates of uncommitted transactions should not be visible to
other transactions
 Otherwise how to perform undo if T1 updates A, then T2 updates A and
commits, and finally T1 has to abort?
 Logging is done as described earlier.
 Log records of different transactions may be interspersed in the log.
 The checkpointing technique and actions taken on recovery have
to be changed
 since several transactions may be active when a checkpoint is performed.
117
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 Checkpoints are performed as before, except that the checkpoint log record
is now of the form
< checkpoint L>
where L is the list of transactions active at the time of the checkpoint
 We assume no updates are in progress while the checkpoint is carried
out (will relax this later)
 When the system recovers from a crash, it first does the following:
1. Initialize undo-list and redo-list to empty
2. Scan the log backwards from the end, stopping when the first
<checkpoint L> record is found.
For each record found during the backward scan:
 if the record is <Ti commit>, add Ti to redo-list
 if the record is <Ti start>, then if Ti is not in redo-list, add Ti to undo-list
3. For every Ti in L, if Ti is not in redo-list, add Ti to undo-list
118
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 At this point undo-list consists of incomplete transactions which
must be undone, and redo-list consists of finished transactions that
must be redone.
 Recovery now continues as follows:
1. Scan log backwards from most recent record, stopping when
<Ti start> records have been encountered for every Ti in undo-
list.
 During the scan, perform undo for each log record that belongs to a
transaction in undo-list.
2. Locate the most recent <checkpoint L> record.
3. Scan log forwards from the <checkpoint L> record till the end
of the log.
 During the scan, perform redo for each log record that belongs to a
transaction on redo-list
119
PreparedbyR.Arthy,AP/IT,KCET
EXAMPLE OF RECOVERY
 Go over the steps of the recovery algorithm on the
following log:
<T0 start>
<T0, A, 0, 10>
<T0 commit>
<T1 start> /* Scan at step 1 comes up to here */
<T1, B, 0, 10>
<T2 start>
<T2, C, 0, 10>
<T2, C, 10, 20>
<checkpoint {T1, T2}>
<T3 start>
<T3, A, 10, 20>
<T3, D, 0, 10>
<T3 commit>
120
PreparedbyR.Arthy,AP/IT,KCET
LOG RECORD BUFFERING
 Log record buffering: log records are buffered in main
memory, instead of of being output directly to stable storage.
 Log records are output to stable storage when a block of log
records in the buffer is full, or a log force operation is executed.
 Log force is performed to commit a transaction by forcing all
its log records (including the commit record) to stable
storage.
 Several log records can thus be output using a single output
operation, reducing the I/O cost.
121
PreparedbyR.Arthy,AP/IT,KCET
[CONTD…]
 The rules below must be followed if log records are
buffered:
 Log records are output to stable storage in the order in which
they are created.
 Transaction Ti enters the commit state only when the log
record
<Ti commit> has been output to stable storage.
 Before a block of data in main memory is output to the
database, all log records pertaining to data in that block must
have been output to stable storage.
 This rule is called the write-ahead logging or WAL rule
 Strictly speaking WAL only requires undo information to be
output 122
PreparedbyR.Arthy,AP/IT,KCET
DATABASE BUFFERING
 Database maintains an in-memory buffer of data blocks
 When a new block is needed, if buffer is full an existing block needs to be
removed from buffer
 If the block chosen for removal has been updated, it must be output to disk
 If a block with uncommitted updates is output to disk, log records with undo
information for the updates are output to the log on stable storage first
 (Write ahead logging)
 No updates should be in progress on a block when it is output to disk. Can
be ensured as follows.
 Before writing a data item, transaction acquires exclusive lock on block
containing the data item
 Lock can be released once the write is completed.
 Such locks held for short duration are called latches.
 Before a block is output to disk, the system acquires an exclusive latch on the
block
 Ensures no update can be in progress on the block
123
PreparedbyR.Arthy,AP/IT,KCET
BUFFER MANAGEMENT (CONT.)
 Database buffer can be implemented either
 in an area of real main-memory reserved for the database, or
 in virtual memory
 Implementing buffer in reserved main-memory has
drawbacks:
 Memory is partitioned before-hand between database buffer
and applications, limiting flexibility.
 Needs may change, and although operating system knows
best how memory should be divided up at any time, it cannot
change the partitioning of memory.
124
PreparedbyR.Arthy,AP/IT,KCET
BUFFER MANAGEMENT (CONT.)
 Database buffers are generally implemented in virtual memory in spite
of some drawbacks:
 When operating system needs to evict a page that has been
modified, the page is written to swap space on disk.
 When database decides to write buffer page to disk, buffer page
may be in swap space, and may have to be read from swap space
on disk and output to the database on disk, resulting in extra I/O!
 Known as dual paging problem.
 Ideally when OS needs to evict a page from the buffer, it should
pass control to database, which in turn should
1. Output the page to database instead of to swap space (making sure to output
log records first), if it is modified
2. Release the page from the buffer, for the OS to use
Dual paging can thus be avoided, but common operating systems do not support
such functionality.
125
PreparedbyR.Arthy,AP/IT,KCET
FAILURE WITH LOSS OF NONVOLATILE
STORAGE
 So far we assumed no loss of non-volatile storage
 Technique similar to checkpointing used to deal with loss of non-volatile
storage
 Periodically dump the entire content of the database to stable storage
 No transaction may be active during the dump procedure; a procedure
similar to checkpointing must take place
 Output all log records currently residing in main memory onto stable
storage.
 Output all buffer blocks onto the disk.
 Copy the contents of the database to stable storage.
 Output a record <dump> to log on stable storage.
126
PreparedbyR.Arthy,AP/IT,KCET
RECOVERING FROM FAILURE OF NON-VOLATILE STORAGE
 To recover from disk failure
 restore database from most recent dump.
 Consult the log and redo all transactions that committed after the
dump
 Can be extended to allow transactions to be active during dump;
known as fuzzy dump or online dump
 Will study fuzzy checkpointing later
127
PreparedbyR.Arthy,AP/IT,KCET

More Related Content

What's hot

Quick tutorial on IEEE 754 FLOATING POINT representation
Quick tutorial on IEEE 754 FLOATING POINT representationQuick tutorial on IEEE 754 FLOATING POINT representation
Quick tutorial on IEEE 754 FLOATING POINT representation
Ritu Ranjan Shrivastwa
 
Des
DesDes
Csc1401 lecture03 - computer arithmetic - arithmetic and logic unit (alu)
Csc1401   lecture03 - computer arithmetic - arithmetic and logic unit (alu)Csc1401   lecture03 - computer arithmetic - arithmetic and logic unit (alu)
Csc1401 lecture03 - computer arithmetic - arithmetic and logic unit (alu)
IIUM
 
Register organization, stack
Register organization, stackRegister organization, stack
Register organization, stack
Asif Iqbal
 
Graph in data structure
Graph in data structureGraph in data structure
Graph in data structure
Abrish06
 
Aes
AesAes
One time pad Encryption:
One time pad Encryption:One time pad Encryption:
One time pad Encryption:
Asad Ali
 
AES-Advanced Encryption Standard
AES-Advanced Encryption StandardAES-Advanced Encryption Standard
AES-Advanced Encryption Standard
Prince Rachit
 
Encryption And Decryption Using AES Algorithm
Encryption And Decryption Using AES AlgorithmEncryption And Decryption Using AES Algorithm
Encryption And Decryption Using AES Algorithm
Ahmed Raza Shaikh
 
Merge sort algorithm
Merge sort algorithmMerge sort algorithm
Merge sort algorithm
Shubham Dwivedi
 
SEARCHING AND SORTING ALGORITHMS
SEARCHING AND SORTING ALGORITHMSSEARCHING AND SORTING ALGORITHMS
SEARCHING AND SORTING ALGORITHMS
Gokul Hari
 
Dynamic Programming Code-Optimization Algorithm (Compiler Design)
Dynamic Programming Code-Optimization Algorithm (Compiler Design)Dynamic Programming Code-Optimization Algorithm (Compiler Design)
Dynamic Programming Code-Optimization Algorithm (Compiler Design)
Dhrumil Panchal
 
Computer Organization and Design
Computer Organization and DesignComputer Organization and Design
Computer Organization and Design
Ra'Fat Al-Msie'deen
 
instruction cycle ppt
instruction cycle pptinstruction cycle ppt
instruction cycle ppt
sheetal singh
 
Operations on linked list
Operations on linked listOperations on linked list
Operations on linked list
Sumathi Kv
 
Chapter 02 instructions language of the computer
Chapter 02   instructions language of the computerChapter 02   instructions language of the computer
Chapter 02 instructions language of the computer
Bảo Hoang
 
S-DES.ppt
S-DES.pptS-DES.ppt
S-DES.ppt
ArikumarKS1
 
Lecture 10 (serial communication)
Lecture 10 (serial communication)Lecture 10 (serial communication)
Lecture 10 (serial communication)
cairo university
 
pipelining
pipeliningpipelining
pipelining
Siddique Ibrahim
 
Cryptography and Network Security William Stallings Lawrie Brown
Cryptography and Network Security William Stallings Lawrie BrownCryptography and Network Security William Stallings Lawrie Brown
Cryptography and Network Security William Stallings Lawrie Brown
Information Security Awareness Group
 

What's hot (20)

Quick tutorial on IEEE 754 FLOATING POINT representation
Quick tutorial on IEEE 754 FLOATING POINT representationQuick tutorial on IEEE 754 FLOATING POINT representation
Quick tutorial on IEEE 754 FLOATING POINT representation
 
Des
DesDes
Des
 
Csc1401 lecture03 - computer arithmetic - arithmetic and logic unit (alu)
Csc1401   lecture03 - computer arithmetic - arithmetic and logic unit (alu)Csc1401   lecture03 - computer arithmetic - arithmetic and logic unit (alu)
Csc1401 lecture03 - computer arithmetic - arithmetic and logic unit (alu)
 
Register organization, stack
Register organization, stackRegister organization, stack
Register organization, stack
 
Graph in data structure
Graph in data structureGraph in data structure
Graph in data structure
 
Aes
AesAes
Aes
 
One time pad Encryption:
One time pad Encryption:One time pad Encryption:
One time pad Encryption:
 
AES-Advanced Encryption Standard
AES-Advanced Encryption StandardAES-Advanced Encryption Standard
AES-Advanced Encryption Standard
 
Encryption And Decryption Using AES Algorithm
Encryption And Decryption Using AES AlgorithmEncryption And Decryption Using AES Algorithm
Encryption And Decryption Using AES Algorithm
 
Merge sort algorithm
Merge sort algorithmMerge sort algorithm
Merge sort algorithm
 
SEARCHING AND SORTING ALGORITHMS
SEARCHING AND SORTING ALGORITHMSSEARCHING AND SORTING ALGORITHMS
SEARCHING AND SORTING ALGORITHMS
 
Dynamic Programming Code-Optimization Algorithm (Compiler Design)
Dynamic Programming Code-Optimization Algorithm (Compiler Design)Dynamic Programming Code-Optimization Algorithm (Compiler Design)
Dynamic Programming Code-Optimization Algorithm (Compiler Design)
 
Computer Organization and Design
Computer Organization and DesignComputer Organization and Design
Computer Organization and Design
 
instruction cycle ppt
instruction cycle pptinstruction cycle ppt
instruction cycle ppt
 
Operations on linked list
Operations on linked listOperations on linked list
Operations on linked list
 
Chapter 02 instructions language of the computer
Chapter 02   instructions language of the computerChapter 02   instructions language of the computer
Chapter 02 instructions language of the computer
 
S-DES.ppt
S-DES.pptS-DES.ppt
S-DES.ppt
 
Lecture 10 (serial communication)
Lecture 10 (serial communication)Lecture 10 (serial communication)
Lecture 10 (serial communication)
 
pipelining
pipeliningpipelining
pipelining
 
Cryptography and Network Security William Stallings Lawrie Brown
Cryptography and Network Security William Stallings Lawrie BrownCryptography and Network Security William Stallings Lawrie Brown
Cryptography and Network Security William Stallings Lawrie Brown
 

Similar to DBMS Unit III Material

Unit06 dbms
Unit06 dbmsUnit06 dbms
Unit06 dbms
arnold 7490
 
dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...
dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...
dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...
DrCViji
 
Ch15
Ch15Ch15
Transaction slide
Transaction slideTransaction slide
Transaction slide
shawon roy
 
Dartabase Transaction.pptx
Dartabase Transaction.pptxDartabase Transaction.pptx
Dartabase Transaction.pptx
Bibus Poudel
 
Transaction
TransactionTransaction
Transaction
Amin Omi
 
ch17.pptx
ch17.pptxch17.pptx
ch17_Transaction management in Database Management System
ch17_Transaction management in Database Management Systemch17_Transaction management in Database Management System
ch17_Transaction management in Database Management System
coolscools1231
 
24904 lecture11
24904 lecture1124904 lecture11
Introduction to database-Transaction Concurrency and Recovery
Introduction to database-Transaction Concurrency and RecoveryIntroduction to database-Transaction Concurrency and Recovery
Introduction to database-Transaction Concurrency and Recovery
Ajit Nayak
 
Introduction to transaction processing
Introduction to transaction processingIntroduction to transaction processing
Introduction to transaction processing
Jafar Nesargi
 
Chapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processingChapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processing
Jafar Nesargi
 
Chapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processingChapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processing
Jafar Nesargi
 
Cs501 transaction
Cs501 transactionCs501 transaction
Cs501 transaction
Kamal Singh Lodhi
 
ch14.ppt
ch14.pptch14.ppt
ch14.ppt
SyedMalek2
 
DBMS Transcations
DBMS TranscationsDBMS Transcations
DBMS Transcations
AjayReddy994773
 
Ch15
Ch15Ch15
Dbms sixth chapter_part-1_2011
Dbms sixth chapter_part-1_2011Dbms sixth chapter_part-1_2011
Dbms sixth chapter_part-1_2011
sumit_study
 
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSDBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
SnehalVinod
 
Introduction to transaction processing concepts and theory
Introduction to transaction processing concepts and theoryIntroduction to transaction processing concepts and theory
Introduction to transaction processing concepts and theory
Zainab Almugbel
 

Similar to DBMS Unit III Material (20)

Unit06 dbms
Unit06 dbmsUnit06 dbms
Unit06 dbms
 
dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...
dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...
dokumen.tips_silberschatz-korth-and-sudarshan1-transactions-transaction-conce...
 
Ch15
Ch15Ch15
Ch15
 
Transaction slide
Transaction slideTransaction slide
Transaction slide
 
Dartabase Transaction.pptx
Dartabase Transaction.pptxDartabase Transaction.pptx
Dartabase Transaction.pptx
 
Transaction
TransactionTransaction
Transaction
 
ch17.pptx
ch17.pptxch17.pptx
ch17.pptx
 
ch17_Transaction management in Database Management System
ch17_Transaction management in Database Management Systemch17_Transaction management in Database Management System
ch17_Transaction management in Database Management System
 
24904 lecture11
24904 lecture1124904 lecture11
24904 lecture11
 
Introduction to database-Transaction Concurrency and Recovery
Introduction to database-Transaction Concurrency and RecoveryIntroduction to database-Transaction Concurrency and Recovery
Introduction to database-Transaction Concurrency and Recovery
 
Introduction to transaction processing
Introduction to transaction processingIntroduction to transaction processing
Introduction to transaction processing
 
Chapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processingChapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processing
 
Chapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processingChapter 9 introduction to transaction processing
Chapter 9 introduction to transaction processing
 
Cs501 transaction
Cs501 transactionCs501 transaction
Cs501 transaction
 
ch14.ppt
ch14.pptch14.ppt
ch14.ppt
 
DBMS Transcations
DBMS TranscationsDBMS Transcations
DBMS Transcations
 
Ch15
Ch15Ch15
Ch15
 
Dbms sixth chapter_part-1_2011
Dbms sixth chapter_part-1_2011Dbms sixth chapter_part-1_2011
Dbms sixth chapter_part-1_2011
 
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSDBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
 
Introduction to transaction processing concepts and theory
Introduction to transaction processing concepts and theoryIntroduction to transaction processing concepts and theory
Introduction to transaction processing concepts and theory
 

More from ArthyR3

Unit IV Knowledge and Hybrid Recommendation System.pdf
Unit IV Knowledge and Hybrid Recommendation System.pdfUnit IV Knowledge and Hybrid Recommendation System.pdf
Unit IV Knowledge and Hybrid Recommendation System.pdf
ArthyR3
 
VIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdfVIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdf
ArthyR3
 
OOPs - JAVA Quick Reference.pdf
OOPs - JAVA Quick Reference.pdfOOPs - JAVA Quick Reference.pdf
OOPs - JAVA Quick Reference.pdf
ArthyR3
 
NodeJS and ExpressJS.pdf
NodeJS and ExpressJS.pdfNodeJS and ExpressJS.pdf
NodeJS and ExpressJS.pdf
ArthyR3
 
MongoDB.pdf
MongoDB.pdfMongoDB.pdf
MongoDB.pdf
ArthyR3
 
REACTJS.pdf
REACTJS.pdfREACTJS.pdf
REACTJS.pdf
ArthyR3
 
ANGULARJS.pdf
ANGULARJS.pdfANGULARJS.pdf
ANGULARJS.pdf
ArthyR3
 
JQUERY.pdf
JQUERY.pdfJQUERY.pdf
JQUERY.pdf
ArthyR3
 
Qb it1301
Qb   it1301Qb   it1301
Qb it1301
ArthyR3
 
CNS - Unit v
CNS - Unit vCNS - Unit v
CNS - Unit v
ArthyR3
 
Cs8792 cns - unit v
Cs8792   cns - unit vCs8792   cns - unit v
Cs8792 cns - unit v
ArthyR3
 
Cs8792 cns - unit iv
Cs8792   cns - unit ivCs8792   cns - unit iv
Cs8792 cns - unit iv
ArthyR3
 
Cs8792 cns - unit iv
Cs8792   cns - unit ivCs8792   cns - unit iv
Cs8792 cns - unit iv
ArthyR3
 
Cs8792 cns - unit i
Cs8792   cns - unit iCs8792   cns - unit i
Cs8792 cns - unit i
ArthyR3
 
Java quick reference
Java quick referenceJava quick reference
Java quick reference
ArthyR3
 
Cs8792 cns - Public key cryptosystem (Unit III)
Cs8792   cns - Public key cryptosystem (Unit III)Cs8792   cns - Public key cryptosystem (Unit III)
Cs8792 cns - Public key cryptosystem (Unit III)
ArthyR3
 
Cryptography Workbook
Cryptography WorkbookCryptography Workbook
Cryptography Workbook
ArthyR3
 
Cns
CnsCns
Cns
ArthyR3
 
Cs6701 cryptography and network security
Cs6701 cryptography and network securityCs6701 cryptography and network security
Cs6701 cryptography and network security
ArthyR3
 
Compiler question bank
Compiler question bankCompiler question bank
Compiler question bank
ArthyR3
 

More from ArthyR3 (20)

Unit IV Knowledge and Hybrid Recommendation System.pdf
Unit IV Knowledge and Hybrid Recommendation System.pdfUnit IV Knowledge and Hybrid Recommendation System.pdf
Unit IV Knowledge and Hybrid Recommendation System.pdf
 
VIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdfVIT336 – Recommender System - Unit 3.pdf
VIT336 – Recommender System - Unit 3.pdf
 
OOPs - JAVA Quick Reference.pdf
OOPs - JAVA Quick Reference.pdfOOPs - JAVA Quick Reference.pdf
OOPs - JAVA Quick Reference.pdf
 
NodeJS and ExpressJS.pdf
NodeJS and ExpressJS.pdfNodeJS and ExpressJS.pdf
NodeJS and ExpressJS.pdf
 
MongoDB.pdf
MongoDB.pdfMongoDB.pdf
MongoDB.pdf
 
REACTJS.pdf
REACTJS.pdfREACTJS.pdf
REACTJS.pdf
 
ANGULARJS.pdf
ANGULARJS.pdfANGULARJS.pdf
ANGULARJS.pdf
 
JQUERY.pdf
JQUERY.pdfJQUERY.pdf
JQUERY.pdf
 
Qb it1301
Qb   it1301Qb   it1301
Qb it1301
 
CNS - Unit v
CNS - Unit vCNS - Unit v
CNS - Unit v
 
Cs8792 cns - unit v
Cs8792   cns - unit vCs8792   cns - unit v
Cs8792 cns - unit v
 
Cs8792 cns - unit iv
Cs8792   cns - unit ivCs8792   cns - unit iv
Cs8792 cns - unit iv
 
Cs8792 cns - unit iv
Cs8792   cns - unit ivCs8792   cns - unit iv
Cs8792 cns - unit iv
 
Cs8792 cns - unit i
Cs8792   cns - unit iCs8792   cns - unit i
Cs8792 cns - unit i
 
Java quick reference
Java quick referenceJava quick reference
Java quick reference
 
Cs8792 cns - Public key cryptosystem (Unit III)
Cs8792   cns - Public key cryptosystem (Unit III)Cs8792   cns - Public key cryptosystem (Unit III)
Cs8792 cns - Public key cryptosystem (Unit III)
 
Cryptography Workbook
Cryptography WorkbookCryptography Workbook
Cryptography Workbook
 
Cns
CnsCns
Cns
 
Cs6701 cryptography and network security
Cs6701 cryptography and network securityCs6701 cryptography and network security
Cs6701 cryptography and network security
 
Compiler question bank
Compiler question bankCompiler question bank
Compiler question bank
 

Recently uploaded

4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
Gino153088
 
Software Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.pptSoftware Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.ppt
TaghreedAltamimi
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
Yasser Mahgoub
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
PKavitha10
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
bjmsejournal
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
Madan Karki
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
bijceesjournal
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
architagupta876
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
Anant Corporation
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
ElakkiaU
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
171ticu
 
Mechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdfMechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdf
21UME003TUSHARDEB
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 

Recently uploaded (20)

4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
 
Software Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.pptSoftware Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.ppt
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
 
Mechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdfMechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdf
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 

DBMS Unit III Material

  • 1. CS8492 – DATABASE MANAGEMENT SYSTEMS UNIT III - TRANSACTIONS 1 PreparedbyR.Arthy,AP/IT,KCET
  • 2. OUTLINE  Transaction Concepts – ACID Properties  Schedules – Serializability  Concurrency Control –Need for Concurrency  Locking Protocols  Two Phase Locking  Deadlock  Transaction Recovery  Save Points –Isolation Levels  SQL Facilities for Concurrency and Recovery 2 PreparedbyR.Arthy,AP/IT,KCET
  • 4. TRANSACTION CONCEPTS  Collections of operations that form a single logical unit of work are called transactions.  A transaction is a unit of program execution that accesses and possibly updates various data items.  A transaction is delimited by statements (or function calls) of the form begin transaction and end transaction.  Two main issues to deal with:  Failures of various kinds, such as hardware failures and system crashes  Concurrent execution of multiple transactions 4 PreparedbyR.Arthy,AP/IT,KCET
  • 5. ACID PROPERTIES  Atomicity. Either all operations of the transaction are properly reflected in the database or none are.  Consistency. Execution of a transaction in isolation preserves the consistency of the database.  Isolation. Although multiple transactions may execute concurrently, each transaction must be unaware of other concurrently executing transactions. Intermediate transaction results must be hidden from other concurrently executed transactions.  That is, for every pair of transactions Ti and Tj, it appears to Ti that either Tj, finished execution before Ti started, or Tj started execution after Ti finished.  Durability. After a transaction completes successfully, the changes it has made to the database persist, even if there are system failures. 5 PreparedbyR.Arthy,AP/IT,KCET
  • 6. A SIMPLE TRANSACTION MODEL Transactions access data using two operations:  read(X), which transfers the data item X from the database to a variable, also called X, in a buffer in main memory belonging to the transaction that executed the read operation.  write(X), which transfers the value in the variable X in the main-memory buffer of the transaction that executed the write to the data item X in the database. 6 PreparedbyR.Arthy,AP/IT,KCET
  • 7. [CONTD…]  Let Ti be a transaction that transfers $50 from account A to account B. This transaction can be defined as: Ti : read(A); A := A − 50; write(A); read(B); B := B + 50; write(B).  Atomicity requirement  if the transaction fails after step 3 and before step 6, money will be “lost” leading to an inconsistent database state  Failure could be due to software or hardware  the system should ensure that updates of a partially executed transaction are not reflected in the database 7 PreparedbyR.Arthy,AP/IT,KCET
  • 8. [CONTD…]  Durability requirement — once the user has been notified that the transaction has completed (i.e., the transfer of the $50 has taken place), the updates to the database by the transaction must persist even if there are software or hardware failures.  Consistency requirement in above example:  the sum of A and B is unchanged by the execution of the transaction  In general, consistency requirements include  Explicitly specified integrity constraints such as primary keys and foreign keys  Implicit integrity constraints  e.g. sum of balances of all accounts, minus sum of loan amounts must equal value of cash-in-hand  A transaction must see a consistent database.  During transaction execution the database may be temporarily inconsistent.  When the transaction completes successfully the database must be consistent  Erroneous transaction logic can lead to inconsistency 8 PreparedbyR.Arthy,AP/IT,KCET
  • 9. [CONTD…]  Isolation requirement — if between steps 3 and 6, another transaction T2 is allowed to access the partially updated database, it will see an inconsistent database (the sum A + B will be less than it should be). T1 T2 1. read(A) 2. A := A – 50 3. write(A) read(A), read(B), print(A+B) 4. read(B) 5. B := B + 50 6. write(B  Isolation can be ensured trivially by running transactions serially  that is, one after the other.  However, executing multiple transactions concurrently has significant benefits, as we will see later. 9 PreparedbyR.Arthy,AP/IT,KCET
  • 11. [CONTD…]  Active – the initial state; the transaction stays in this state while it is executing  Partially committed – after the final statement has been executed.  Failed -- after the discovery that normal execution can no longer proceed.  Aborted – after the transaction has been rolled back and the database restored to its state prior to the start of the transaction. Two options after it has been aborted:  restart the transaction  can be done only if no internal logical error  kill the transaction  Committed – after successful completion. 11 PreparedbyR.Arthy,AP/IT,KCET
  • 12. QUIZ TIME  Collections of operations that form a single logical unit of work are called __________ a) Views b) Networks c) Units d) Transactions 12 PreparedbyR.Arthy,AP/IT,KCET
  • 13.  The “all-or-none” property is commonly referred to as _________ a) Isolation b) Durability c) Atomicity d) None of the mentioned 13 PreparedbyR.Arthy,AP/IT,KCET
  • 14.  Which of the following is a property of transactions? a) Atomicity b) Durability c) Isolation d) All of the mentioned 14 PreparedbyR.Arthy,AP/IT,KCET
  • 15.  Execution of translation in isolation preserves the _________ of a database a) Atomicity b) Consistency c) Durability d) All of the mentioned 15 PreparedbyR.Arthy,AP/IT,KCET
  • 16.  Which of the following is not a property of a transaction? a) Atomicity b) Simplicity c) Isolation d) Durability 16 PreparedbyR.Arthy,AP/IT,KCET
  • 17.  A transaction that has not been completed successfully is called as _______ a) Compensating transaction b) Aborted transaction c) Active transaction d) Partially committed transaction 17 PreparedbyR.Arthy,AP/IT,KCET
  • 18.  Which of the following is not a transaction state? a) Active b) Partially committed c) Failed d) Compensated 18 PreparedbyR.Arthy,AP/IT,KCET
  • 19. PART B QUESTIONS  Write a short note on Transaction concepts. [Nov 2014]  During execution, a transaction passes through several states, until it finally commits or aborts. List all possible sequences of states through which a transaction may pass. Explain why each state transition may occur? [May 2018]  Explain with an example the properties that must be satisfied by a transaction. [May 2018] 19 PreparedbyR.Arthy,AP/IT,KCET
  • 21. CONCURRENT EXECUTIONS  Multiple transactions are allowed to run concurrently in the system. Advantages are:  increased processor and disk utilization, leading to better transaction throughput  E.g. one transaction can be using the CPU while another is reading from or writing to the disk  reduced average response time for transactions: short transactions need not wait behind long ones.  Concurrency control schemes – mechanisms to achieve isolation  that is, to control the interaction among the concurrent transactions in order to prevent them from destroying the consistency of the database 21 PreparedbyR.Arthy,AP/IT,KCET
  • 22. SCHEDULES  Schedule – a sequences of instructions that specify the chronological order in which instructions of concurrent transactions are executed  a schedule for a set of transactions must consist of all instructions of those transactions  must preserve the order in which the instructions appear in each individual transaction.  A transaction that successfully completes its execution will have a commit instructions as the last statement  by default transaction assumed to execute commit instruction as its last step  A transaction that fails to successfully complete its execution will have an abort instruction as the last statement 22 PreparedbyR.Arthy,AP/IT,KCET
  • 23. SCHEDULE 1  Let T1 transfer $50 from A to B, and T2 transfer 10% of the balance from A to B.  A serial schedule in which T1 is followed by T2 : 23 PreparedbyR.Arthy,AP/IT,KCET
  • 24. SCHEDULE 2  A serial schedule where T2 is followed by T1 24 PreparedbyR.Arthy,AP/IT,KCET
  • 25. SCHEDULE 3  Let T1 and T2 be the transactions defined previously. The following schedule is not a serial schedule, but it is equivalent to Schedule 1. 25 PreparedbyR.Arthy,AP/IT,KCET
  • 26. SCHEDULE 4  The following concurrent schedule does not preserve the value of (A + B ). 26 PreparedbyR.Arthy,AP/IT,KCET
  • 27. SERIALIZABILITY  Basic Assumption – Each transaction preserves database consistency.  Thus serial execution of a set of transactions preserves database consistency.  A (possibly concurrent) schedule is serializable if it is equivalent to a serial schedule. Different forms of schedule equivalence give rise to the notions of: 1. Conflict serializability 2. View serializability  Simplified view of transactions  We ignore operations other than read and write instructions  We assume that transactions may perform arbitrary computations on data in local buffers in between reads and writes.  Our simplified schedules consist of only read and write instructions. 27 PreparedbyR.Arthy,AP/IT,KCET
  • 28. CONFLICTING INSTRUCTIONS  Instructions li and lj of transactions Ti and Tj respectively, conflict if and only if there exists some item Q accessed by both li and lj, and at least one of these instructions wrote Q. 1. li = read(Q), lj = read(Q). li and lj don’t conflict. 2. li = read(Q), lj = write(Q). They conflict. 3. li = write(Q), lj = read(Q). They conflict 4. li = write(Q), lj = write(Q). They conflict  Intuitively, a conflict between li and lj forces a (logical) temporal order between them.  If li and lj are consecutive in a schedule and they do not conflict, their results would remain the same even if they had been interchanged in the schedule. 28 PreparedbyR.Arthy,AP/IT,KCET
  • 29. CONFLICT SERIALIZABILITY  If a schedule S can be transformed into a schedule S´ by a series of swaps of non-conflicting instructions, we say that S and S´ are conflict equivalent.  We say that a schedule S is conflict serializable if it is conflict equivalent to a serial schedule 29 PreparedbyR.Arthy,AP/IT,KCET
  • 30. [CONTD…]  Schedule 3 can be transformed into Schedule 6, a serial schedule where T2 follows T1, by series of swaps of non-conflicting instructions.  Therefore Schedule 3 is conflict serializable. 30 PreparedbyR.Arthy,AP/IT,KCET Schedule 3 Schedule 6
  • 31. [CONTD…]  Example of a schedule that is not conflict serializable:  We are unable to swap instructions in the above schedule to obtain either the serial schedule < T3, T4 >, or the serial schedule < T4, T3 >. 31 PreparedbyR.Arthy,AP/IT,KCET
  • 32. [CONTD…]  To determine conflict serializability of a schedule.  Consider a schedule S. We construct a directed graph, called a precedence graph, from S.  This graph consists of a pair G = (V, E), where  V is a set of vertices and E is a set of edges. The set of vertices consists of all the transactions participating in the schedule.  The set of edges consists of all edges  Ti →Tj for which one of three conditions holds:  Ti executes write(Q) before Tj executes read(Q).  Ti executes read(Q) before Tj executes write(Q).  Ti executes write(Q) before Tj executes write(Q). 32 PreparedbyR.Arthy,AP/IT,KCET
  • 33. [CONTD…]  If an edge Ti → Tj exists in the precedence graph, then, in any serial schedule S equivalent to S, Ti must appear before Tj . 33 PreparedbyR.Arthy,AP/IT,KCET
  • 34. [CONTD…]  If the precedence graph for S has a cycle, then schedule S is not conflict serializable.  If the graph contains no cycles, then the schedule S is conflict serializable. 34 PreparedbyR.Arthy,AP/IT,KCET
  • 35. TEST FOR CONFLICT SERIALIZABILITY  A schedule is conflict serializable if and only if its precedence graph is acyclic.  Cycle-detection algorithms exist which take order n2 time, where n is the number of vertices in the graph.  (Better algorithms take order n + e where e is the number of edges.)  If precedence graph is acyclic, the serializability order can be obtained by a topological sorting of the graph.  This is a linear order consistent with the partial order of the graph. 35 PreparedbyR.Arthy,AP/IT,KCET
  • 36. VIEW SERIALIZABILITY  Let S and S´ be two schedules with the same set of transactions. S and S´ are view equivalent if the following three conditions are met, for each data item Q, 1. If in schedule S, transaction Ti reads the initial value of Q, then in schedule S’ also transaction Ti must read the initial value of Q. 2. If in schedule S transaction Ti executes read(Q), and that value was produced by transaction Tj (if any), then in schedule S’ also transaction Ti must read the value of Q that was produced by the same write(Q) operation of transaction Tj . 3. The transaction (if any) that performs the final write(Q) operation in schedule S must also perform the final write(Q) operation in schedule S’. As can be seen, view equivalence is also based purely on reads and writes alone. 36 PreparedbyR.Arthy,AP/IT,KCET
  • 37. [CONTD…]  A schedule S is view serializable if it is view equivalent to a serial schedule.  Every conflict serializable schedule is also view serializable.  Every view serializable schedule that is not conflict serializable has blind writes. 37 PreparedbyR.Arthy,AP/IT,KCET
  • 38. TEST FOR VIEW SERIALIZABILITY  The precedence graph test for conflict serializability cannot be used directly to test for view serializability.  Extension to test for view serializability has cost exponential in the size of the precedence graph.  The problem of checking if a schedule is view serializable falls in the class of NP-complete problems.  Thus existence of an efficient algorithm is extremely unlikely.  However practical algorithms that just check some sufficient conditions for view serializability can still be used. 38 PreparedbyR.Arthy,AP/IT,KCET
  • 39. RECOVERABLE SCHEDULES  Recoverable schedule — if a transaction Tj reads a data item previously written by a transaction Ti , then the commit operation of Ti appears before the commit operation of Tj.  The following schedule (Schedule 11) is not recoverable if T9 commits immediately after the read  If T8 should abort, T9 would have read (and possibly shown to the user) an inconsistent database state. Hence, database must ensure that schedules are recoverable. 39 PreparedbyR.Arthy,AP/IT,KCET
  • 40. CASCADING ROLLBACKS  Cascading rollback – a single transaction failure leads to a series of transaction rollbacks. Consider the following schedule where none of the transactions has yet committed (so the schedule is recoverable) If T10 fails, T11 and T12 must also be rolled back.  Can lead to the undoing of a significant amount of work 40 PreparedbyR.Arthy,AP/IT,KCET
  • 41. QUIZ TIME  I and J are _________ if they are operations by different transactions on the same data item, and at least one of them is a write operation. a) Conflicting b) Overwriting c) Isolated d) Durable 41 PreparedbyR.Arthy,AP/IT,KCET
  • 42.  If a schedule S can be transformed into a schedule S’ by a series of swaps of non-conflicting instructions, then S and S’ are a) Non conflict equivalent b) Equal c) Conflict equivalent d) Isolation equivalent 42 PreparedbyR.Arthy,AP/IT,KCET
  • 43.  A schedule is __________ if it is conflict equivalent to a serial schedule. a) Conflict serializable b) Conflicting c) Non serializable d) None of the mentioned 43 PreparedbyR.Arthy,AP/IT,KCET
  • 44.  The set of ________ in a precedence graph consists of all the transactions participating in the schedule a) Vertices b) Edges c) Directions d) None of the mentioned 44 PreparedbyR.Arthy,AP/IT,KCET
  • 45.  A ___________of the transactions can be obtained by finding a linear order consistent with the partial order of the precedence graph. a) Serializability order b) Direction graph c) Precedence graph d) Scheduling scheme 45 PreparedbyR.Arthy,AP/IT,KCET
  • 46.  State true or false: If I = read(Q) and J = read(Q) then the order of I and J does not matter. a) True b) False 46 PreparedbyR.Arthy,AP/IT,KCET
  • 47.  State true or false: If I = read(Q) and J = write(Q) then the order of I and J does not matter. a) True b) False 47 PreparedbyR.Arthy,AP/IT,KCET
  • 48.  The phenomenon in which one failure leads to a series of transaction rollbacks is called as ________ a) Cascading rollback b) Cascadeless rollback c) Cascade cause d) None of the mentioned 48 PreparedbyR.Arthy,AP/IT,KCET
  • 49.  A ___________ is one where, for each pair of transactions Ti and Tj such that Tj reads a data item previously written by Ti , the commit operation of Ti appears before the commit operation of Tj a) Partial schedule b) Dependent schedule c) Recoverable schedule d) None of the mentioned 49 PreparedbyR.Arthy,AP/IT,KCET
  • 50.  Which of the following are the advantages of transaction concurrency? a) Increased throughput b) Increased utilization c) Reduces average response time d) All of the mentioned 50 PreparedbyR.Arthy,AP/IT,KCET
  • 51. GATE QUESTIONS The following functional dependencies hold true for the relational schema R{V, W, X, Y, Z}: V -> W VW -> X Y -> VX Y -> Z Which of the following is irreducible equivalent for this set of functional dependencies? GATE | GATE-CS-2017 (Set 1) | Question 15 (A) A (B) B (C) C (D) D 51 PreparedbyR.Arthy,AP/IT,KCET
  • 52. [CONTD…] Consider a database that has the relation schemas EMP (EmpId, EmpName, DepId), and DEPT(DeptName, DeptId). Note that the DepId can be permitted to be NULL in the relation EMP. Consider the following queries on the database expressed in tuple relational calculus. I. {t | ∃ u ∈ EMP (t[EMPName] = u[EmpName] ∧ ∀ v ∈ DEPT (t[DeptId] ≠ DeptId]))} II. {t | ∃ u ∈ EMP (t[EMPName] = u[EmpName] ∧ ∃ v ∈ DEPT (t[DeptId] ≠ DeptId]))} III. {t | ∃ u ∈ EMP (t[EMPName] = u[EmpName] ∧ ∃ v ∈ DEPT (t[DeptId] = DeptId]))} GATE | GATE-CS-2017 (Set 1) | Question 36 (A) I and II only (B) I and III only (C) II and III only (D) I, II, and III 52 PreparedbyR.Arthy,AP/IT,KCET
  • 53. PART B QUESTIONS  Discuss view serializability and conflict serializability. [Nov 2015, May 2018]  Consider the following extension to the tree locking protocol, which allows both shared and exclusive locks: i) A transaction can be either a read only transaction, in which case it can request only shared locks or an update transaction in which case it can request only exclusive locks. ii) Each transaction must follow the rules of the tree protocol. Read only transactions may lock any data item first, where as update transactions must lock the root first. Show that the protocol ensures Serializability and deadlock freedom. [Nov 2016]  Discuss in detail about the testing of serializability. [May 2019] 53 PreparedbyR.Arthy,AP/IT,KCET
  • 55. NEED FOR CONCURRENCY CONTROL  In the concurrency control, the multiple transactions can be executed simultaneously.  It may affect the transaction result. It is highly important to maintain the order of execution of those transactions.  Several problems can occur when concurrent transactions are executed in an uncontrolled manner.  Following are the three problems in concurrency control.  Lost updates  Dirty read  Unrepeatable read 55 PreparedbyR.Arthy,AP/IT,KCET
  • 56. LOST UPDATE PROBLEM  When two transactions that access the same database items contain their operations in a way that makes the value of some database item incorrect, then the lost update problem occurs.  If two transactions T1 and T2 read a record and then update it, then the effect of updating of the first record will be overwritten by the second update. 56 PreparedbyR.Arthy,AP/IT,KCET
  • 57. DIRTY READ  A dirty read is the situation when a transaction reads a data that has not yet been committed.  Example:  Transaction 1 updates a row and leaves it uncommitted  Transaction 2 reads the updated row  If transaction 1 rolls back the change, transaction 2 will have read data that is considered never to have existed. 57 PreparedbyR.Arthy,AP/IT,KCET
  • 58. UNREPEATABLE READ  Non repeatable read occurs when a transaction reads same row twice but received a different value each time.  Example:  Transaction T1 reads a data  Due to concurrency, another transaction T2 updates the same data and commit  Transaction T1 rereads the same data, it will retrieve a different value 58 PreparedbyR.Arthy,AP/IT,KCET
  • 59. CONCURRENCY CONTROL PROTOCOL  Concurrency control protocols ensure atomicity, isolation, and serializability of concurrent transactions.  The concurrency control protocol can be divided into three categories:  Lock based protocol  Time-stamp protocol  Validation based protocol 59 PreparedbyR.Arthy,AP/IT,KCET
  • 60. LOCK-BASED PROTOCOLS  A lock is a mechanism to control concurrent access to a data item  Data items can be locked in two modes : 1. exclusive (X) mode. Data item can be both read as well as written. X-lock is requested using lock-X instruction. 2. shared (S) mode. Data item can only be read. S-lock is requested using lock-S instruction.  Lock requests are made to concurrency-control manager. Transaction can proceed only after request is granted. 60 PreparedbyR.Arthy,AP/IT,KCET
  • 61. [CONTD…]  Lock-compatibility matrix  A transaction may be granted a lock on an item if the requested lock is compatible with locks already held on the item by other transactions  Any number of transactions can hold shared locks on an item,  but if any transaction holds an exclusive on the item no other transaction may hold any lock on the item.  If a lock cannot be granted, the requesting transaction is made to wait till all incompatible locks held by other transactions have been released. The lock is then granted. 61 PreparedbyR.Arthy,AP/IT,KCET
  • 62. RULES FOR LOCKING  If a transaction has a read lock on a data item, then it can only read the item but cannot update.  If a transaction has a write lock on a data item, then it can both read and update the data item.  If a transaction has a write lock on a data item, then other transactions cannot acquire read or write locks on the data item. 62 PreparedbyR.Arthy,AP/IT,KCET
  • 63. PRE-CLAIMING LOCK PROTOCOL  It evaluate their operations and create a list of data items on which they need locks.  Before initiating an execution, the transaction requests the system for all the locks it needs beforehand.  If all the locks are granted, the transaction executes and releases all the locks when all its operations are over.  If all the locks are not granted, the transaction rolls back and waits until all t he locks are granted. 63 PreparedbyR.Arthy,AP/IT,KCET T begin T end Time Lock acquisition phase
  • 64. [CONTD…]  Example of a transaction performing locking: T2: lock-S(A); read (A); unlock(A); lock-S(B); read (B); unlock(B); display(A+B)  Locking as above is not sufficient to guarantee serializability — if A and B get updated in-between the read of A and B, the displayed sum would be wrong.  A locking protocol is a set of rules followed by all transactions while requesting and releasing locks. Locking protocols restrict the set of possible schedules. 64 PreparedbyR.Arthy,AP/IT,KCET
  • 66. PROBLEM  Consider the partial schedule  Neither T3 nor T4 can make progress — executing lock-S(B) causes T4 to wait for T3 to release its lock on B, while executing lock-X(A) causes T3 to wait for T4 to release its lock on A.  Such a situation is called a deadlock.  To handle a deadlock one of T3 or T4 must be rolled back and its locks released. 66 PreparedbyR.Arthy,AP/IT,KCET
  • 67. [CONTD…]  The potential for deadlock exists in most locking protocols. Deadlocks are a necessary evil.  Starvation is also possible if concurrency control manager is badly designed. For example:  A transaction may be waiting for an X-lock on an item, while a sequence of other transactions request and are granted an S- lock on the same item.  The same transaction is repeatedly rolled back due to deadlocks.  Concurrency control manager can be designed to prevent starvation. 67 PreparedbyR.Arthy,AP/IT,KCET
  • 68. TWO-PHASE LOCKING PROTOCOL  This is a protocol which ensures conflict-serializable schedules.  Phase 1: Growing Phase  transaction may obtain locks  transaction may not release locks  Phase 2: Shrinking Phase  transaction may release locks  transaction may not obtain locks 68 PreparedbyR.Arthy,AP/IT,KCET
  • 69. [CONTD…]  The protocol assures serializability.  It can be proved that the transactions can be serialized in the order of their lock points (i.e. the point where a transaction acquired its final lock). 69 PreparedbyR.Arthy,AP/IT,KCET T begin T end Time Lock acquisition phase Lock releasing phase
  • 70. [CONTD…]  Two-phase locking does not ensure freedom from deadlocks  Cascading roll-back is possible under two-phase locking.  To avoid this, follow a modified protocol called strict two-phase locking. Here a transaction must hold all its exclusive locks till it commits/aborts.  Rigorous two-phase locking is even stricter: here all locks are held till commit/abort.  In this protocol transactions can be serialized in the order in which they commit. 70 PreparedbyR.Arthy,AP/IT,KCET
  • 71. [CONTD…]  Two-phase locking with lock conversions: – First Phase:  can acquire a lock-S on item  can acquire a lock-X on item  can convert a lock-S to a lock-X (upgrade) – Second Phase:  can release a lock-S  can release a lock-X  can convert a lock-X to a lock-S (downgrade)  This protocol assures serializability. But still relies on the programmer to insert the various locking instructions. 71 PreparedbyR.Arthy,AP/IT,KCET
  • 72. [CONTD…]  A transaction Ti issues the standard read/write instruction, without explicit locking calls.  The operation read(D) is processed as: if Ti has a lock on D then read(D) else begin if necessary wait until no other transaction has a lock-X on D grant Ti a lock-S on D; read(D) end 72 PreparedbyR.Arthy,AP/IT,KCET
  • 73. [CONTD…]  write(D) is processed as: if Ti has a lock-X on D then write(D) else begin if necessary wait until no other trans. has any lock on D, if Ti has a lock-S on D then upgrade lock on D to lock-X else grant Ti a lock-X on D write(D) end;  All locks are released after commit or abort 73 PreparedbyR.Arthy,AP/IT,KCET
  • 74. IMPLEMENTATION OF LOCKING  A lock manager can be implemented as a separate process to which transactions send lock and unlock requests  The lock manager replies to a lock request by sending a lock grant messages (or a message asking the transaction to roll back, in case of a deadlock)  The requesting transaction waits until its request is answered  The lock manager maintains a data-structure called a lock table to record granted locks and pending requests  The lock table is usually implemented as an in-memory hash table indexed on the name of the data item being locked 74 PreparedbyR.Arthy,AP/IT,KCET
  • 75. LOCK TABLE  Black rectangles indicate granted locks, white ones indicate waiting requests  Lock table also records the type of lock granted or requested  New request is added to the end of the queue of requests for the data item, and granted if it is compatible with all earlier locks  Unlock requests result in the request being deleted, and later requests are checked to see if they can now be granted  If transaction aborts, all waiting or granted requests of the transaction are deleted  lock manager may keep a list of locks held by each transaction, to implement this efficiently` 75 PreparedbyR.Arthy,AP/IT,KCET
  • 76. TIMESTAMP-BASED PROTOCOLS  Each transaction is issued a timestamp when it enters the system. If an old transaction Ti has time-stamp TS(Ti), a new transaction Tj is assigned time-stamp TS(Tj) such that TS(Ti) <TS(Tj).  The protocol manages concurrent execution such that the time-stamps determine the serializability order.  In order to assure such behavior, the protocol maintains for each data Q two timestamp values:  W-timestamp(Q) is the largest time-stamp of any transaction that executed write(Q) successfully.  R-timestamp(Q) is the largest time-stamp of any transaction that executed read(Q) successfully. 76 PreparedbyR.Arthy,AP/IT,KCET
  • 77. VALIDATION-BASED PROTOCOL  Execution of transaction Ti is done in three phases. 1. Read and execution phase: Transaction Ti writes only to temporary local variables 2. Validation phase: Transaction Ti performs a ``validation test'' to determine if local variables can be written without violating serializability. 3. Write phase: If Ti is validated, the updates are applied to the database; otherwise, Ti is rolled back.  The three phases of concurrently executing transactions can be interleaved, but each transaction must go through the three phases in that order.  Assume for simplicity that the validation and write phase occur together, atomically and serially  i.e., only one transaction executes validation/write at a time.  Also called as optimistic concurrency control since transaction executes fully in the hope that all will go well during validation 77 PreparedbyR.Arthy,AP/IT,KCET
  • 78. PART B QUESTIONS  What is concurrency? Explain it in terms of locking mechanism and two phase locking protocol. [Nov 2014, May 2016, Nov 2016, Nov 2017, May 2018]  What is concurrency control? How is it implemented in DBMS? Illustrate with a suitable example. [Nov 2015]  Explain about locking protocols. [May 2016]  Explain lock based concurrency control with suitable example. [Nov 2017]  Differentiate strict two phase locking protocol and rigorous two phase locking protocol. [Nov 2018]  How the time stamps are implemented? Explain. [Nov 2018] 78 PreparedbyR.Arthy,AP/IT,KCET
  • 79. [CONTD…]  Consider the following two transactions: T1 and T2 works in parallel. Add lock and unlock instructions to transactions T1 and T2 so that they observe the two phase locking protocol. Can the execution of these transactions result in a deadlock? [Nov 2016] 79 PreparedbyR.Arthy,AP/IT,KCET T1 T2 read(A); read(B), if A=0 then B:=B+1; write(B). read(B); read(A); if B=0 then A:A+1; write(A).
  • 81. INTRODUCTION  In a multi-process system, deadlock is an unwanted situation that arises in a shared resource environment.  Where a process indefinitely waits for a resource that is held by another process.  Definition  System is deadlocked if there is a set of transactions such that every transaction in the set is waiting for another transaction in the set.  For example:  Assume a set of transactions {T0, T1, T2} 81 PreparedbyR.Arthy,AP/IT,KCET
  • 82. [CONTD…]  T0 needs a resource X to complete its task  Resource X is held by T1, and T1 is waiting for a resource Y, which is held by T2  T2 is waiting for the resource Z, which is held by T0 82 PreparedbyR.Arthy,AP/IT,KCET T0 T1 T2
  • 83. DEADLOCK PREVENTION  Deadlock prevention protocols ensure that the system will never enter into a deadlock state.  Some prevention strategies :  Require that each transaction locks all its data items before it begins execution (predeclaration).  Impose partial ordering of all data items and require that a transaction can lock data items only in the order specified by the partial order (graph-based protocol).  Schemes:  Wait – Die Scheme  Wound – Wait Scheme 83 PreparedbyR.Arthy,AP/IT,KCET
  • 84. WAIT-DIE SCHEME — NON-PREEMPTIVE  In this scheme, is a transaction requests to lock a data item, which is already locked by another transaction, then one of the two possibilities may occur:  If TS(Ti) < TS(Tj) – that is Ti is older than Tj, which is requesting a conflicting lock, is older than Tj – then Ti is allowed to wait until the data item is available.  If TS(Ti) > TS(Tj) – that is Ti is younger than Tj – then Ti is aborted. Ti is restarted later with a random delay but with the same timestamp.  Older transaction may wait for younger one to release data item. Younger transactions never wait for older ones; they are rolled back instead.  A transaction may die several times before acquiring needed data item 84 PreparedbyR.Arthy,AP/IT,KCET
  • 85. WOUND-WAIT SCHEME  In this scheme, is a transaction requests to lock a data item, which is already locked by another transaction, then one of the two possibilities may occur:  If TS(Ti) < TS(Tj) – that is Ti is older than Tj, then Ti forces Tj to be rolled back – that is Ti wounds Tj. Tj is restarted later with a random delay but with the same timestamp.  If TS(Ti) > TS(Tj) – that is Ti is younger than Tj – Ti is forced to wait until the resource is available.  Older transaction wounds (forces rollback) of younger transaction instead of waiting for it. Younger transactions may wait for older ones.  May be fewer rollbacks than wait-die scheme. 85 PreparedbyR.Arthy,AP/IT,KCET
  • 86. DEADLOCK AVOIDANCE  Deadlocks can be described as a wait-for graph, which consists of a pair G = (V,E),  V is a set of vertices (all the transactions in the system)  E is a set of edges; each element is an ordered pair Ti Tj.  If Ti  Tj is in E, then there is a directed edge from Ti to Tj, implying that Ti is waiting for Tj to release a data item.  When Ti requests a data item currently being held by Tj, then the edge Ti Tj is inserted in the wait-for graph. This edge is removed only when Tj is no longer holding a data item needed by Ti.  The system is in a deadlock state if and only if the wait-for graph has a cycle. Must invoke a deadlock-detection algorithm periodically to look for cycles. 86 PreparedbyR.Arthy,AP/IT,KCET
  • 88. DEADLOCK RECOVERY  When deadlock is detected :  Some transaction will have to rolled back (made a victim) to break deadlock. Select that transaction as victim that will incur minimum cost.  Rollback -- determine how far to roll back transaction  Total rollback: Abort the transaction and then restart it.  More effective to roll back transaction only as far as necessary to break deadlock.  Starvation happens if same transaction is always chosen as victim. Include the number of rollbacks in the cost factor to avoid starvation 88 PreparedbyR.Arthy,AP/IT,KCET
  • 89. PART B QUESTIONS  Write a short note on deadlock. [Nov 2014]  What is deadlock? How does it occur? How transactions be written to avoid deadlock and guarantee correct execution. Illustrate with suitable example. [Nov 2015]  Outline the deadlock handling mechanisms. [Nov 2016, Nov 2018]  Explain the concepts of Deadlock avoidance and prevention in detail. [Nov 2018] 89 PreparedbyR.Arthy,AP/IT,KCET
  • 90. PROBLEMS  Consider the following schedules. The actions are listed in the order they are scheduled and prefixed with the transaction name:  S1: T1:R(X),T2:R(X),T1:W(Y),T2:W(Y),T1:R(Y),T2:R(Y)  S2: T3:W(X),T1:R(X),T1:W(Y),T2:W(Z),T3:R(Z)  For each of the schedules answer the following questions:  What is the precedence graph for the schedule?  Is the schedule conflict serializable? If so what are all the conflict equivalent serial schedules?  Is the schedule view serializable? If so what are all the view equivalent serial schedule? [May 2015] 90 PreparedbyR.Arthy,AP/IT,KCET
  • 91. [CONTD…]  Consider the following schedules S1 and S2 with three transactions T1,T2 and T3 as follows;  S1:r1(X);r3(X);w1(X);r2(X);w3(X)  S2:r3(X);r2(X);w3(X);r1(X);w1(X)  Is the schedule S1 and S2 conflict serializable? 91 PreparedbyR.Arthy,AP/IT,KCET
  • 92. [CONTD…]  Consider a schedule S with three transactions T1,T2 and T3 as follows;  S1: R1(x);W2(x);R1(x);W1(y);commit1;commit2;  S2: R1(A) , R2(A) , R1(B) , R2(B) , R3(B) , W1(A) , W2(B)  Is the schedule S1 and S2 conflict serializable? 92 PreparedbyR.Arthy,AP/IT,KCET
  • 93. [CONTD…]  Consider schedules s3,s4,and s5 below. Determine whether each schedule is strict, cascadeless, recoverable or nonrecoverable. (Determine the strictest recoverability condition that each schedule satisfies.)  S3:r1(X);r2(Z);r3(X);r3(Y);w1(X);c1;w3(Y);c3;r2(Y);w2(Z );w2(Y);C2;  s4 : r1(X);r2(Z);r1(Z);r3(X),r3(Y);w1(X);w3(Y);r2(Y);w2(Z);w 2(Y);c1;c2;c3;  s5 : r1(X);r2(Z);r3(X);r1(Z);r2(Y);r3(Y);w1(X);C1;w2(Z);w3(Y );w2(Y);c3;c2; 93 PreparedbyR.Arthy,AP/IT,KCET
  • 95. PART B QUESTIONS  Explain deferred and immediate modification versions of the log based recovery scheme. [May 2019]  Explain why timestamp based concurrency control allows schedules that are not recoverable. Describe how it can be modified through buffering to disallow such schedules. 95 PreparedbyR.Arthy,AP/IT,KCET
  • 96. FAILURE CLASSIFICATION  Transaction failure :  Logical errors: transaction cannot complete due to some internal error condition  System errors: the database system must terminate an active transaction due to an error condition (e.g., deadlock)  System crash: a power failure or other hardware or software failure causes the system to crash.  Fail-stop assumption: non-volatile storage contents are assumed to not be corrupted by system crash  Database systems have numerous integrity checks to prevent corruption of disk data  Disk failure: a head crash or similar disk failure destroys all or part of disk storage  Destruction is assumed to be detectable: disk drives use checksums to detect failures 96 PreparedbyR.Arthy,AP/IT,KCET
  • 97. RECOVERY ALGORITHMS  Recovery algorithms are techniques to ensure database consistency and transaction atomicity and durability despite failures  Focus of this chapter  Recovery algorithms have two parts 1. Actions taken during normal transaction processing to ensure enough information exists to recover from failures 2. Actions taken after a failure to recover the database contents to a state that ensures atomicity, consistency and durability 97 PreparedbyR.Arthy,AP/IT,KCET
  • 98. STORAGE STRUCTURE  Volatile storage:  does not survive system crashes  examples: main memory, cache memory  Nonvolatile storage:  survives system crashes  examples: disk, tape, flash memory, non-volatile (battery backed up) RAM  Stable storage:  a mythical form of storage that survives all failures  approximated by maintaining multiple copies on distinct nonvolatile media 98 PreparedbyR.Arthy,AP/IT,KCET
  • 99. STABLE-STORAGE IMPLEMENTATION  Maintain multiple copies of each block on separate disks  copies can be at remote sites to protect against disasters such as fire or flooding.  Failure during data transfer can still result in inconsistent copies: Block transfer can result in  Successful completion  Partial failure: destination block has incorrect information  Total failure: destination block was never updated  Protecting storage media from failure during data transfer (one solution):  Execute output operation as follows (assuming two copies of each block): 1. Write the information onto the first physical block. 2. When the first write successfully completes, write the same information onto the second physical block. 3. The output is completed only after the second write successfully completes. 99 PreparedbyR.Arthy,AP/IT,KCET
  • 100. [CONTD…]  Protecting storage media from failure during data transfer (cont.):  Copies of a block may differ due to failure during output operation. To recover from failure: 1. First find inconsistent blocks: 1. Expensive solution: Compare the two copies of every disk block. 2. Better solution:  Record in-progress disk writes on non-volatile storage (Non-volatile RAM or special area of disk).  Use this information during recovery to find blocks that may be inconsistent, and only compare copies of these.  Used in hardware RAID systems 2. If either copy of an inconsistent block is detected to have an error (bad checksum), overwrite it by the other copy. If both have no error, but are different, overwrite the second block by the first block. 100 PreparedbyR.Arthy,AP/IT,KCET
  • 101. DATA ACCESS  Physical blocks are those blocks residing on the disk.  Buffer blocks are the blocks residing temporarily in main memory.  Block movements between disk and main memory are initiated through the following two operations:  input(B) transfers the physical block B to main memory.  output(B) transfers the buffer block B to the disk, and replaces the appropriate physical block there.  Each transaction Ti has its private work-area in which local copies of all data items accessed and updated by it are kept.  Ti's local copy of a data item X is called xi.  We assume, for simplicity, that each data item fits in, and is stored inside, a single block. 101 PreparedbyR.Arthy,AP/IT,KCET
  • 102. [CONTD…]  Transaction transfers data items between system buffer blocks and its private work-area using the following operations :  read(X) assigns the value of data item X to the local variable xi.  write(X) assigns the value of local variable xi to data item {X} in the buffer block.  both these commands may necessitate the issue of an input(BX) instruction before the assignment, if the block BX in which X resides is not already in memory.  Transactions  Perform read(X) while accessing X for the first time;  All subsequent accesses are to the local copy.  After last access, transaction executes write(X).  output(BX) need not immediately follow write(X). System can perform the output operation when it deems fit. 102 PreparedbyR.Arthy,AP/IT,KCET
  • 103. EXAMPLE OF DATA ACCESS X Y A B x1 y1 buffer Buffer Block A Buffer Block B input(A) output(B) read(X) write(Y) disk work area of T1 work area of T2 memory x2 103 PreparedbyR.Arthy,AP/IT,KCET
  • 104. RECOVERY AND ATOMICITY  Modifying the database without ensuring that the transaction will commit may leave the database in an inconsistent state.  Consider transaction Ti that transfers $50 from account A to account B; goal is either to perform all database modifications made by Ti or none at all.  Several output operations may be required for Ti (to output A and B). A failure may occur after one of these modifications have been made but before all of them are made. 104 PreparedbyR.Arthy,AP/IT,KCET
  • 105. [CONTD…]  To ensure atomicity despite failures, we first output information describing the modifications to stable storage without modifying the database itself.  We study two approaches:  log-based recovery, and  shadow-paging  We assume (initially) that transactions run serially, that is, one after the other. 105 PreparedbyR.Arthy,AP/IT,KCET
  • 106. LOG-BASED RECOVERY  A log is kept on stable storage.  The log is a sequence of log records, and maintains a record of update activities on the database.  When transaction Ti starts, it registers itself by writing a <Ti start>log record  Before Ti executes write(X), a log record <Ti, X, V1, V2> is written, where V1 is the value of X before the write, and V2 is the value to be written to X.  Log record notes that Ti has performed a write on data item Xj Xj had value V1 before the write, and will have value V2 after the write.  When Ti finishes it last statement, the log record <Ti commit> is written.  We assume for now that log records are written directly to stable storage (that is, they are not buffered)  Two approaches using logs  Deferred database modification  Immediate database modification 106 PreparedbyR.Arthy,AP/IT,KCET
  • 107. DEFERRED DATABASE MODIFICATION  The deferred database modification scheme records all modifications to the log, but defers all the writes to after partial commit.  Assume that transactions execute serially  Transaction starts by writing <Ti start> record to log.  A write(X) operation results in a log record <Ti, X, V> being written, where V is the new value for X  Note: old value is not needed for this scheme  The write is not performed on X at this time, but is deferred.  When Ti partially commits, <Ti commit> is written to the log  Finally, the log records are read and used to actually execute the previously deferred writes. 107 PreparedbyR.Arthy,AP/IT,KCET
  • 108. [CONTD…]  During recovery after a crash, a transaction needs to be redone if and only if both <Ti start> and<Ti commit> are there in the log.  Redoing a transaction Ti ( redoTi) sets the value of all data items updated by the transaction to the new values.  Crashes can occur while  the transaction is executing the original updates, or  while recovery action is being taken  example transactions T0 and T1 (T0 executes before T1): T0: read (A) T1 : read (C) A: - A - 50 C:- C- 100 Write (A) write (C) read (B) B:- B + 50 write (B) 108 PreparedbyR.Arthy,AP/IT,KCET
  • 109. [CONTD…]  Below we show the log as it appears at three instances of time.  If log on stable storage at time of crash is as in case: (a) No redo actions need to be taken (b) redo(T0) must be performed since <T0 commit> is present (c) redo(T0) must be performed followed by redo(T1) since <T0 commit> and <Ti commit> are present 109 PreparedbyR.Arthy,AP/IT,KCET
  • 110. IMMEDIATE DATABASE MODIFICATION  The immediate database modification scheme allows database updates of an uncommitted transaction to be made as the writes are issued  since undoing may be needed, update logs must have both old value and new value  Update log record must be written before database item is written  We assume that the log record is output directly to stable storage  Can be extended to postpone log record output, so long as prior to execution of an output(B) operation for a data block B, all log records corresponding to items B must be flushed to stable storage  Output of updated blocks can take place at any time before or after transaction commit  Order in which blocks are output can be different from the order in which they are written. 110 PreparedbyR.Arthy,AP/IT,KCET
  • 111. IMMEDIATE DATABASE MODIFICATION EXAMPLE Log Write Output <T0 start> <T0, A, 1000, 950> To, B, 2000, 2050 A = 950 B = 2050 <T0 commit> <T1 start> <T1, C, 700, 600> C = 600 BB, BC <T1 commit> BA  Note: BX denotes block containing X. x1 111 PreparedbyR.Arthy,AP/IT,KCET
  • 112. [CONTD…]  Recovery procedure has two operations instead of one:  undo(Ti) restores the value of all data items updated by Ti to their old values, going backwards from the last log record for Ti  redo(Ti) sets the value of all data items updated by Ti to the new values, going forward from the first log record for Ti  Both operations must be idempotent  That is, even if the operation is executed multiple times the effect is the same as if it is executed once  Needed since operations may get re-executed during recovery  When recovering after failure:  Transaction Ti needs to be undone if the log contains the record <Ti start>, but does not contain the record <Ti commit>.  Transaction Ti needs to be redone if the log contains both the record <Ti start> and the record <Ti commit>.  Undo operations are performed first, then redo operations. 112 PreparedbyR.Arthy,AP/IT,KCET
  • 113. IMMEDIATE DB MODIFICATION RECOVERY EXAMPLE Below we show the log as it appears at three instances of time. Recovery actions in each case above are: (a) undo (T0): B is restored to 2000 and A to 1000. (b) undo (T1) and redo (T0): C is restored to 700, and then A and B are set to 950 and 2050 respectively. (c) redo (T0) and redo (T1): A and B are set to 950 and 2050 respectively. Then C is set to 600 113 PreparedbyR.Arthy,AP/IT,KCET
  • 114. CHECKPOINTS  Problems in recovery procedure as discussed earlier : 1. searching the entire log is time-consuming 2. we might unnecessarily redo transactions which have already 3. output their updates to the database.  Streamline recovery procedure by periodically performing checkpointing 1. Output all log records currently residing in main memory onto stable storage. 2. Output all modified buffer blocks to the disk. 3. Write a log record < checkpoint> onto stable storage. 114 PreparedbyR.Arthy,AP/IT,KCET
  • 115. [CONTD…]  During recovery we need to consider only the most recent transaction Ti that started before the checkpoint, and transactions that started after Ti. 1. Scan backwards from end of log to find the most recent <checkpoint> record 2. Continue scanning backwards till a record <Ti start> is found. 3. Need only consider the part of log following above start record. Earlier part of log can be ignored during recovery, and can be erased whenever desired. 4. For all transactions (starting from Ti or later) with no <Ti commit>, execute undo(Ti). (Done only in case of immediate modification.) 5. Scanning forward in the log, for all transactions starting from Ti or later with a <Ti commit>, execute redo(Ti). 115 PreparedbyR.Arthy,AP/IT,KCET
  • 116. EXAMPLE OF CHECKPOINTS  T1 can be ignored (updates already output to disk due to checkpoint)  T2 and T3 redone.  T4 undone Tc Tf T1 T2 T3 T4 checkpoint system failure 116 PreparedbyR.Arthy,AP/IT,KCET
  • 117. RECOVERY WITH CONCURRENT TRANSACTIONS  We modify the log-based recovery schemes to allow multiple transactions to execute concurrently.  All transactions share a single disk buffer and a single log  A buffer block can have data items updated by one or more transactions  We assume concurrency control using strict two-phase locking;  i.e. the updates of uncommitted transactions should not be visible to other transactions  Otherwise how to perform undo if T1 updates A, then T2 updates A and commits, and finally T1 has to abort?  Logging is done as described earlier.  Log records of different transactions may be interspersed in the log.  The checkpointing technique and actions taken on recovery have to be changed  since several transactions may be active when a checkpoint is performed. 117 PreparedbyR.Arthy,AP/IT,KCET
  • 118. [CONTD…]  Checkpoints are performed as before, except that the checkpoint log record is now of the form < checkpoint L> where L is the list of transactions active at the time of the checkpoint  We assume no updates are in progress while the checkpoint is carried out (will relax this later)  When the system recovers from a crash, it first does the following: 1. Initialize undo-list and redo-list to empty 2. Scan the log backwards from the end, stopping when the first <checkpoint L> record is found. For each record found during the backward scan:  if the record is <Ti commit>, add Ti to redo-list  if the record is <Ti start>, then if Ti is not in redo-list, add Ti to undo-list 3. For every Ti in L, if Ti is not in redo-list, add Ti to undo-list 118 PreparedbyR.Arthy,AP/IT,KCET
  • 119. [CONTD…]  At this point undo-list consists of incomplete transactions which must be undone, and redo-list consists of finished transactions that must be redone.  Recovery now continues as follows: 1. Scan log backwards from most recent record, stopping when <Ti start> records have been encountered for every Ti in undo- list.  During the scan, perform undo for each log record that belongs to a transaction in undo-list. 2. Locate the most recent <checkpoint L> record. 3. Scan log forwards from the <checkpoint L> record till the end of the log.  During the scan, perform redo for each log record that belongs to a transaction on redo-list 119 PreparedbyR.Arthy,AP/IT,KCET
  • 120. EXAMPLE OF RECOVERY  Go over the steps of the recovery algorithm on the following log: <T0 start> <T0, A, 0, 10> <T0 commit> <T1 start> /* Scan at step 1 comes up to here */ <T1, B, 0, 10> <T2 start> <T2, C, 0, 10> <T2, C, 10, 20> <checkpoint {T1, T2}> <T3 start> <T3, A, 10, 20> <T3, D, 0, 10> <T3 commit> 120 PreparedbyR.Arthy,AP/IT,KCET
  • 121. LOG RECORD BUFFERING  Log record buffering: log records are buffered in main memory, instead of of being output directly to stable storage.  Log records are output to stable storage when a block of log records in the buffer is full, or a log force operation is executed.  Log force is performed to commit a transaction by forcing all its log records (including the commit record) to stable storage.  Several log records can thus be output using a single output operation, reducing the I/O cost. 121 PreparedbyR.Arthy,AP/IT,KCET
  • 122. [CONTD…]  The rules below must be followed if log records are buffered:  Log records are output to stable storage in the order in which they are created.  Transaction Ti enters the commit state only when the log record <Ti commit> has been output to stable storage.  Before a block of data in main memory is output to the database, all log records pertaining to data in that block must have been output to stable storage.  This rule is called the write-ahead logging or WAL rule  Strictly speaking WAL only requires undo information to be output 122 PreparedbyR.Arthy,AP/IT,KCET
  • 123. DATABASE BUFFERING  Database maintains an in-memory buffer of data blocks  When a new block is needed, if buffer is full an existing block needs to be removed from buffer  If the block chosen for removal has been updated, it must be output to disk  If a block with uncommitted updates is output to disk, log records with undo information for the updates are output to the log on stable storage first  (Write ahead logging)  No updates should be in progress on a block when it is output to disk. Can be ensured as follows.  Before writing a data item, transaction acquires exclusive lock on block containing the data item  Lock can be released once the write is completed.  Such locks held for short duration are called latches.  Before a block is output to disk, the system acquires an exclusive latch on the block  Ensures no update can be in progress on the block 123 PreparedbyR.Arthy,AP/IT,KCET
  • 124. BUFFER MANAGEMENT (CONT.)  Database buffer can be implemented either  in an area of real main-memory reserved for the database, or  in virtual memory  Implementing buffer in reserved main-memory has drawbacks:  Memory is partitioned before-hand between database buffer and applications, limiting flexibility.  Needs may change, and although operating system knows best how memory should be divided up at any time, it cannot change the partitioning of memory. 124 PreparedbyR.Arthy,AP/IT,KCET
  • 125. BUFFER MANAGEMENT (CONT.)  Database buffers are generally implemented in virtual memory in spite of some drawbacks:  When operating system needs to evict a page that has been modified, the page is written to swap space on disk.  When database decides to write buffer page to disk, buffer page may be in swap space, and may have to be read from swap space on disk and output to the database on disk, resulting in extra I/O!  Known as dual paging problem.  Ideally when OS needs to evict a page from the buffer, it should pass control to database, which in turn should 1. Output the page to database instead of to swap space (making sure to output log records first), if it is modified 2. Release the page from the buffer, for the OS to use Dual paging can thus be avoided, but common operating systems do not support such functionality. 125 PreparedbyR.Arthy,AP/IT,KCET
  • 126. FAILURE WITH LOSS OF NONVOLATILE STORAGE  So far we assumed no loss of non-volatile storage  Technique similar to checkpointing used to deal with loss of non-volatile storage  Periodically dump the entire content of the database to stable storage  No transaction may be active during the dump procedure; a procedure similar to checkpointing must take place  Output all log records currently residing in main memory onto stable storage.  Output all buffer blocks onto the disk.  Copy the contents of the database to stable storage.  Output a record <dump> to log on stable storage. 126 PreparedbyR.Arthy,AP/IT,KCET
  • 127. RECOVERING FROM FAILURE OF NON-VOLATILE STORAGE  To recover from disk failure  restore database from most recent dump.  Consult the log and redo all transactions that committed after the dump  Can be extended to allow transactions to be active during dump; known as fuzzy dump or online dump  Will study fuzzy checkpointing later 127 PreparedbyR.Arthy,AP/IT,KCET