A talk on mysql & aurora

Indexes
Primary key index
Secondary key index
Clustered key index
Hash index*

Primary key index (btree)
Rows are attached to the
Leaf node as part of same
data-structure

Secondary key index (btree)
Leaf node have pointers to
PK of rows

Clustered Key
In absence of PK/uniq key,
MySQL creates a hidden
clustered key.

Index use
● Left preﬁx rule
○ Name, ID, country
■ Select … where ID = 3 and country = us
■ Select … where ID = 3
■ Select … where country = us
■ Select … where Name = ‘blah’ and country = us
■ Select … where Name = ‘blah’ and ID = 4

Index use
● Left preﬁx rule
○ Name, ID, country
■ Select … where ID = 3 and country = us
■ Select … where ID = 3
■ Select … where country = us
■ Select … where Name = ‘blah’ and country = us
■ Select … where Name = ‘blah’ and ID = 4
● Duplicate indexes
○ Name, country
○ Name, country, zip
○ Name, zip

Index use
● Selectivity rule
○ Select … where gender = male
○ Say 2ndary index on gender (cardinality is say 2 = male/female)
○ Say 70 % of rows are male

Index use
○ => in the btree, 70% of rows will be under male node.

Index use
○ => since 2ndary index, the node has pointers to 70% of rows

Index use
○ What does MySQL Do ?

Index use
■ Does it read index , traverse it and then go to disk for the 70% of data ?

Index use
■ It does not , it bypasses index and goes to disk directly

Index use
■ It does not , it bypasses index and goes to disk directly
■ It does a table scan ! (more eﬀective).
● Explain might indicate use of index but in practice it does not!

Storage engines
Innodb *
MyIsam
Memory (can create hash index)
Federated
NDB

Storage engines
Innodb * (enable adaptive hash idx)
MyIsam
Memory (can create hash index)
Federated
NDB

Lock wait timeout exceeded error
Deadlocks

Lock errors
1. Lock wait timeout exceeded:
a. set global innodb_lock_wait_timeout = x; <default is 150 sec i believe>
b. Show engine innodb status; (when txn is blocked, u can see on whats its blocked)
c. Show process list; (list of connections and what they are doing)

Lock errors
1. Lock wait timeout exceeded:
a. set global innodb_lock_wait_timeout = x; <default is 150 sec i believe>
b. Show engine innodb status; (when txn is blocked, u can see on whats its blocked)
c. Show process list; (list of connections and what they are doing)
2. Deadlocks
a. You have to do nothing. Auto resolved by Mysql - randomly 1 txn wins and other rolled back.
b. Show engine innodb status - shows u latest deadlocks that occurred

Select for update
Select for share

Read locks
1. Select for update
a. Use carefully.
i. You might end up locking part of the index tree. (select .. where cost > 50)
b. Good practice is to select row ids ﬁrst and then update (i.e. speciﬁc row locks)
i. Select id where cost > 50
ii. Update where id = x

Read locks
1. Select for update
a. Use carefully.
i. You might end up locking part of the index tree. (select .. where cost > 50)
b. Good practice is to select row ids ﬁrst and then update (i.e. speciﬁc row locks)
i. Select id where cost > 50
ii. Update where id = x
2. Select for share
a. It’s a pure read lock. Writers will wait for read to complete

Repeatable read
Read committed
Read un-committed
Serializable

Isolation levels
1. Repeatable read (default i believe in aurora)
a. The same read if done again in the txn sees the same thing (except if some other txn commits
before the second read)

Isolation levels
2. Read Committed (newly added)
a. Every read in the txn sees the latest state

Isolation levels
3. Read un-Committed (not recommended)
a. A read in the txn sees the dirty state of uncommitted txns

Isolation levels
3. Read un-Committed (not recommended)
a. A read in the txn sees the dirty state of uncommitted txns
4. Serializable
a. All txns go in sequence

Why is this important ?
For every transaction:

1. Rows updated:
a. Before version of rows is stored in undo-log. If txn is rolled back, the before version is restored.

1. Rows updated:
b. Even newly inserted rows are stored in undo log.

1. Rows updated:
c. Once txn completes (rolled back/committed) , the undo log purges the relevant rows.

1. Rows updated:
2. Rows being read:
a. A snapshot is stored in the undo log.

1. Rows updated:
2. Rows being read:
b. It helps satisfy the isolation level of txn

Aurora Undo Log
● Normally in vanilla mysql,
○ each node has its own storage.

Aurora Undo Log
● Storage is shared across nodes
○ => a single undo log that is shared between writer & reader

Aurora Undo Log
● Imagine long running transactions

Aurora Undo Log
○ Based on isolation level

Aurora Undo Log
○ The undo log might keep growing …

Aurora Undo Log
○ Purging / garbage collection will not occur…

Aurora Undo Log
○ Purging / garbage collection will not occur…
○ At some point, Paralysis due to overdue GC

Writes to Aurora & Cost
● Keep an eye on IOPS
○ IOPS ++ == $ ++
● Batch your writes if possible
● Compress your data before sending.

1. Mysql error log
2. Mysql slow query log
3. Metrics

Monitoring
1. Never ignore mysql error logs. It might have something critical mentioned. Its
your best friend !
2. Can Enable slow query logs to keep track of slow running queries
3. Metrics
a. Recommend Percona PMM (available metrics are graphed nicely)
b. Buﬀer pool usage metrics
c. Undo log history growth / RollbackSegmentHistoryListLength metric
d. Insert latencies
e. IOPS usage

Aurora Parallel query
1. The only feature missing in other mysql variants.
2. Allows for parallelism in a select query

3. Bypasses the in-memory buﬀer pool doing table scans on disk. :)

a. => IOPS => $ :)

a. => IOPS => $ :)
4. Supposedly good for your reporting queries

Other helpful stuﬀ
1. Use START TRANSACTION READ ONLY
; (less bookkeeping for readonly)
2. Run an explain on your query; be aware if index is used.
a. Explains are not always accurate
3. Show process list; (i used to kill long running transactions/ sleeping transactions - no
mercy :) )
4. Show engine innodb status;
5. You have an index on group by columns but order by columns not in index ?
6. Joining 2 tables - think of 2 for loops (keep outer for loop short)
7. Query Cache - apparently works well in aurora ! (discouraged in rds/mysql)

Finally ● Make 1 change at a time
○ Change
○ See eﬀect
○ Make next change

Finally
● Make 1 change at a time
○ Change
○ See eﬀect
○ Make next change
● Keep an eye on $ cost

Select QNS from you;
select Thank you from me;
Who am I ?
Ex Mysql Guy at Flipkart / Data guy at Trustana
linkedin.com/in/213vishnu/
mash213.wordpress.com/conferences/
https://twitter.com/sweetweet213

A talk on mysql & aurora

Recommended

Recommended

More Related Content

Similar to A talk on mysql & aurora

Similar to A talk on mysql & aurora (20)

More from vishnu rao

More from vishnu rao (10)

Recently uploaded

Recently uploaded (20)

A talk on mysql & aurora