HOT Understanding this important update optimization

© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Grant McAlister – Senior Principal Engineer – Amazon RDS
Oct 2018
HOT
UNDERSTANDING THIS IMPORTANT UPDATE OPTIMIZATION

What is HOT - Heap Only Tuples
The Heap Only Tuple (HOT) feature eliminates redundant index entries and
allows the re-use of space taken by DELETEd or obsoleted UPDATEd tuples
without performing a table-wide vacuum. It does this by allowing
single-page vacuuming, also called "defragmentation".
Full description - src/backend/access/heap/README.HOT

Regular Update
heap
1 2 lp
index
leaf
index A index B index C
tuple v1
block0 block1

Regular Update
heap
1 2 lp
index
leaf
tuple v1
block0 block1
tuple v2

pageinspect – regular update
postgres=# update benchmark_uuid2 set last_updated = now() where id=4;
UPDATE 1
-----------+----------+-------+-------+-------+---------+----------+---------+-----------
0/15E67A8 | 0 | 0 | 32 | 7824 | 8192 | 8192 | 4 | 1862
(1 row)
postgres=# select lp, lp_off, case when lp_flags = 0 THEN 'LP_UNUSED' when lp_flags=1 THEN 'LP_NORMAL' when
lp_flags=2 THEN 'LP_REDIRECT' when lp_flags=3 THEN 'LP_DEAD' END as lp_flags, lp_len, t_xmin, t_xmax,
t_ctid,infomask(t_infomask, 1) as infomask,infomask(t_infomask2, 2) as infomask2 from
----+--------+-----------+--------+--------+--------+--------+----------------------------------+-----------
1 | 8008 | LP_NORMAL | 184 | 1861 | 1862 | (0,2) | XMIN_COMMITTED|HASVARWIDTH |
2 | 7824 | LP_NORMAL | 184 | 1862 | 0 | (0,2) | UPDATED|XMAX_INVALID|HASVARWIDTH |
(2 rows)
------------+-------+---------+-------+------+-------------------------
1 | (0,2) | 16 | f | f | 04 00 00 00 00 00 00 00
2 | (0,1) | 16 | f | f | 04 00 00 00 00 00 00 00
(2 rows)

HOT Update
heap
1 lp
index
leaf
tuple v1
block0 block1

HOT Update
heap
1 lp
index
leaf
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple v2

HOT Update
heap
1 2 lp
index
leaf
tuple v1
block0 block1
tuple v2

HOT Update
heap
31 2 lp
index
leaf
tuple v1
block0 block1
tuple v2
tuple v3

pageinspect – first hot update
postgres=# update benchmark_uuid2 set e=cast(0 as boolean) where id = 5;
UPDATE 1
-----------+----------+-------+-------+-------+---------+----------+---------+-----------
0/15EC1A0 | 0 | 0 | 32 | 7824 | 8192 | 8192 | 4 | 1865
(1 row)
postgres=# select lp, lp_off, case when lp_flags = 0 THEN 'LP_UNUSED' when lp_flags=1 THEN 'LP_NORMAL' when
lp_flags=2 THEN 'LP_REDIRECT' when lp_flags=3 THEN 'LP_DEAD' END as lp_flags, lp_len, t_xmin, t_xmax,
t_ctid,infomask(t_infomask, 1) as infomask,infomask(t_infomask2, 2) as infomask2 from
----+--------+-----------+--------+--------+--------+--------+----------------------------------+-----------------
1 | 8008 | LP_NORMAL | 184 | 1864 | 1865 | (0,2) | XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED
2 | 7824 | LP_NORMAL | 184 | 1865 | 0 | (0,2) | UPDATED|XMAX_INVALID|HASVARWIDTH | HEAP_ONLY_TUPLE
(2 rows)
------------+-------+---------+-------+------+-------------------------
1 | (0,1) | 16 | f | f | 05 00 00 00 00 00 00 00
(1 row)

pageinspect – second hot update
postgres=# update benchmark_uuid2 set e=cast(1 as boolean) where id = 5;
UPDATE 1
-----------+----------+-------+-------+-------+---------+----------+---------+-----------
0/15EC4C8 | 0 | 0 | 36 | 7640 | 8192 | 8192 | 4 | 1865
(1 row)
postgres=# select lp, lp_off, case when lp_flags = 0 THEN 'LP_UNUSED' when lp_flags=1 THEN 'LP_NORMAL' when lp_flags=2 THEN 'LP_REDIRECT'
when lp_flags=3 THEN 'LP_DEAD' END as lp_flags, lp_len, t_xmin, t_xmax, t_ctid,infomask(t_infomask, 1) as infomask,infomask(t_infomask2,
2) as infomask2 from heap_page_items(get_raw_page('benchmark_uuid2', 0));
----+--------+-----------+--------+--------+--------+--------+-------------------------------------------+-----------------------------
1 | 8008 | LP_NORMAL | 184 | 1864 | 1865 | (0,2) | XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED
2 | 7824 | LP_NORMAL | 184 | 1865 | 1866 | (0,3) | UPDATED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED|HEAP_ONLY_TUPLE
3 | 7640 | LP_NORMAL | 184 | 1866 | 0 | (0,3) | UPDATED|XMAX_INVALID|HASVARWIDTH | HEAP_ONLY_TUPLE
(3 rows)
------------+-------+---------+-------+------+-------------------------
1 | (0,1) | 16 | f | f | 05 00 00 00 00 00 00 00
(1 row)

HOT Update - Pruning
heap
31 2 lp
index
leaf
tuple v1
block0 block1
tuple v2
tuple v3

heap
31 2 lp
index
leaf
block0 block1
tuple v2
tuple v3
X

heap
31 2 lp
index
leaf
block0 block1
tuple v2
tuple v3

heap
31 2 lp
index
leaf
block0 block1
tuple v3

pageinspect – pruning (almost full page)
postgres=# select lp, lp_off, case when lp_flags = 0 THEN 'LP_UNUSED' when lp_flags=1 THEN 'LP_NORMAL' when lp_flags=2 THEN 'LP_REDIRECT' when lp_flags=3 THEN
'LP_DEAD' END as lp_flags, lp_len, t_xmin, t_xmax, t_ctid,infomask(t_infomask, 1) as infomask,infomask(t_infomask2, 2) as infomask2 from
----+--------+-----------+--------+--------+--------+--------+---------------------------------------------------+-----------------------------
1 | 8008 | LP_NORMAL | 184 | 1864 | 1865 | (0,2) | XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED
2 | 7824 | LP_NORMAL | 184 | 1865 | 1866 | (0,3) | UPDATED|XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED| HEAP_ONLY_TUPLE
6 | 7088 | LP_NORMAL | 184 | 1869 | 1870 | (0,7) | UPDATED|XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED|HEAP_ONLY_TUPLE
10 | 6352 | LP_NORMAL | 184 | 1873 | 1874 | (0,11) | UPDATED|XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED |HEAP_ONLY_TUPLE
...
33 | 2120 | LP_NORMAL | 184 | 1896 | 1897 | (0,34) | UPDATED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED |HEAP_ONLY_TUPLE
34 | 1936 | LP_NORMAL | 184 | 1897 | 0 | (0,34) | UPDATED|XMAX_INVALID|HASVARWIDTH | HEAP_ONLY_T UPLE
(34 rows)

pageinspect – pruned
----+--------+-------------+--------+--------+--------+--------+---------------------------------------------------+-----------------------------
1 | 40 | LP_REDIRECT | 0 | | | | |
2 | 7824 | LP_NORMAL | 184 | 1904 | 1905 | (0,3) | UPDATED|XMIN_COMMITTED|HASVARWIDTH | HOT_UPDATED|HEAP_ONLY_TUPLE
3 | 7640 | LP_NORMAL | 184 | 1905 | 0 | (0,3) | UPDATED|XMAX_INVALID|HASVARWIDTH | HEAP_ONLY_ TUPLE
4 | 0 | LP_UNUSED | 0 | | | | |
5 | 0 | LP_UNUSED | 0 | | | | |
6 | 0 | LP_UNUSED | 0 | | | | |
7 | 0 | LP_UNUSED | 0 | | | | |
8 | 0 | LP_UNUSED | 0 | | | | |
9 | 0 | LP_UNUSED | 0 | | | | |
10 | 0 | LP_UNUSED | 0 | | | | |
11 | 0 | LP_UNUSED | 0 | | | | |
12 | 0 | LP_UNUSED | 0 | | | | |
13 | 0 | LP_UNUSED | 0 | | | | |
...
28 | 0 | LP_UNUSED | 0 | | | | |
29 | 0 | LP_UNUSED | 0 | | | | |
30 | 0 | LP_UNUSED | 0 | | | | |
31 | 0 | LP_UNUSED | 0 | | | | |
32 | 0 | LP_UNUSED | 0 | | | | |
33 | 0 | LP_UNUSED | 0 | | | | |
34 | 0 | LP_UNUSED | 0 | | | | |
35 | 0 | LP_UNUSED | 0 | | | | |
36 | 0 | LP_UNUSED | 0 | | | | |
37 | 0 | LP_UNUSED | 0 | | | | |
38 | 0 | LP_UNUSED | 0 | | | | |
39 | 0 | LP_UNUSED | 0 | | | | |
(40 rows)

Fillfactor
heap
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple
block2
fillfactor=100 (default)
Continuous inserts and update the same single tuple 100 times

Fillfactor
heap
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple v2
block2
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple

Fillfactor
heap
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple v2 tuple v3
block2
tuple
tuple
tuple
tuple
tuple
tuple
tuple
tuple

Fillfactor
heap
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
block2
fillfactor=90

Fillfactor
heap
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
block2
fillfactor=90
tuple v2

Fillfactor
heap
tuple v1
block0 block1
tuple
tuple
tuple
tuple
tuple
tuple
tuple
block2
tuple
tuple
tuple
tuple
tuple
tuple
tuple
fillfactor=90
tuple v2 tuple v3
tuple

Fillfactor
heap
tuple v1
block0 block1
tuple
block2
tuple
fillfactor=10
tuple
tuple
tuple

Fillfactor
heap
tuple v1
block0 block1
tuple
block2
tuple
fillfactor=10
tuple
tuple v2
tuple
tuple

Fillfactor
heap
tuple v1
block0 block1
tuple
block2
tuple
fillfactor=10
tuple
tuple v2
tuple v3
tuple
tuple

Fillfactor on Insert & Single Update Workload
Insert at 2K TPS and update one row 100 times while
having a long running transaction
Fillfactor Single Key Fetch Table Scan
100
90
50
10
101 blocks
18 blocks
5 blocks
3 blocks
5.5K blocks
6K blocks
11K blocks
60K blocks
Bloat comes
in many
disguises

100
90
50
10
5.5K blocks
6K blocks
11K blocks
60K blocks
2 blocks
1 block
1 block Bloat comes
in many
disguises

100
90
50
10
5.5K blocks
6K blocks
11K blocks
60K blocks
2 blocks
1 block
1 block
1 block
Bloat comes
in many
disguises

0
1,000
2,000
3,000
4,000
5,000
6,000
7,000
8,000
1 2 3 4 5 6 7 8 9 101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
SizeinKB
Minutes
Heap + One Index - Update Single Row
Regular Long Transaction Regular 1 Min Transaction
Regular No Transaction HOT Long Transaction
HOT 1 Min Transaction HOT No Transaction

0
100
200
300
400
500
600
700
800
900
1 2 3 4 5 6 7 8 9 101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
SizeinKB
Minutes
One Index - Update Single Row
Regular Long Transaction Regular 1 Min Transaction
Regular No Transaction Hot Long Transaction
Hot 1 Min Transaction Hot No Transaction

Measuring Longest Running Transaction
postgres=# select max(now() - xact_start ) from pg_stat_activity;
max
-----------------
02:02:48.021408

One the fly cleanup - HEAP
heap
1 2 lp
index
leaf
tuple v1
block0 block1
tuple v2

One the fly cleanup - HEAP
heap
1 2 lp
index
leaf
block0 block1
tuple v2

On the fly Heap Tuple Pruning – Full Table
----+--------+-----------+--------+---------+---------+--------+---------------------------------------------------+-----------
1 | 7976 | LP_NORMAL | 216 | 3265456 | 3265457 | (0,2) | XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH |
2 | 7760 | LP_NORMAL | 216 | 3265457 | 3265458 | (0,3) | UPDATED|XMAX_COMMITTED|XMIN_COMMITTED|HASVARWIDTH |
18 | 4304 | LP_NORMAL | 216 | 3265473 | 3265474 | (0,19) | UPDATED|XMIN_COMMITTED|HASVARWIDTH |
(19 rows)

On the fly Heap Tuple Pruning – Pruned Table
----+--------+-----------+--------+---------+---------+--------+------------------------------------+-----------
1 | 0 | LP_DEAD | 0 | | | | |
2 | 0 | LP_DEAD | 0 | | | | |
3 | 0 | LP_DEAD | 0 | | | | |
4 | 0 | LP_DEAD | 0 | | | | |
5 | 0 | LP_DEAD | 0 | | | | |
6 | 0 | LP_DEAD | 0 | | | | |
7 | 0 | LP_DEAD | 0 | | | | |
8 | 0 | LP_DEAD | 0 | | | | |
9 | 0 | LP_DEAD | 0 | | | | |
10 | 0 | LP_DEAD | 0 | | | | |
11 | 0 | LP_DEAD | 0 | | | | |
12 | 0 | LP_DEAD | 0 | | | | |
13 | 0 | LP_DEAD | 0 | | | | |
14 | 0 | LP_DEAD | 0 | | | | |
15 | 0 | LP_DEAD | 0 | | | | |
16 | 0 | LP_DEAD | 0 | | | | |
17 | 0 | LP_DEAD | 0 | | | | |
18 | 0 | LP_DEAD | 0 | | | | |
19 | 7976 | LP_NORMAL | 216 | 3265474 | 3265475 | (0,20) | UPDATED|XMIN_COMMITTED|HASVARWIDTH |
(20 rows)

On the fly Heap Tuple Pruning – Index Not!
------------+--------+---------+-------+------+-------------------------
1 | (0,20) | 16 | f | f | b7 ab 03 00 00 00 00 00
2 | (0,19) | 16 | f | f | b7 ab 03 00 00 00 00 00
3 | (0,18) | 16 | f | f | b7 ab 03 00 00 00 00 00
4 | (0,17) | 16 | f | f | b7 ab 03 00 00 00 00 00
5 | (0,16) | 16 | f | f | b7 ab 03 00 00 00 00 00
6 | (0,15) | 16 | f | f | b7 ab 03 00 00 00 00 00
7 | (0,14) | 16 | f | f | b7 ab 03 00 00 00 00 00
8 | (0,13) | 16 | f | f | b7 ab 03 00 00 00 00 00
9 | (0,12) | 16 | f | f | b7 ab 03 00 00 00 00 00
10 | (0,11) | 16 | f | f | b7 ab 03 00 00 00 00 00
11 | (0,10) | 16 | f | f | b7 ab 03 00 00 00 00 00
12 | (0,9) | 16 | f | f | b7 ab 03 00 00 00 00 00
13 | (0,8) | 16 | f | f | b7 ab 03 00 00 00 00 00
14 | (0,7) | 16 | f | f | b7 ab 03 00 00 00 00 00
15 | (0,6) | 16 | f | f | b7 ab 03 00 00 00 00 00
16 | (0,5) | 16 | f | f | b7 ab 03 00 00 00 00 00
17 | (0,4) | 16 | f | f | b7 ab 03 00 00 00 00 00
18 | (0,3) | 16 | f | f | b7 ab 03 00 00 00 00 00
19 | (0,2) | 16 | f | f | b7 ab 03 00 00 00 00 00
20 | (0,1) | 16 | f | f | b7 ab 03 00 00 00 00 00
(20 rows)

One the fly cleanup - Index
heap
1 2 lp
index
leaf
block0 block1
tuple v2

Index on fly cleanup – almost full page
------------+---------+---------+-------+------+-------------------------
1 | (1,116) | 16 | f | f | b5 ab 03 00 00 00 00 00
2 | (1,115) | 16 | f | f | b5 ab 03 00 00 00 00 00
3 | (1,114) | 16 | f | f | b5 ab 03 00 00 00 00 00
4 | (1,113) | 16 | f | f | b5 ab 03 00 00 00 00 00
5 | (1,112) | 16 | f | f | b5 ab 03 00 00 00 00 00
6 | (1,111) | 16 | f | f | b5 ab 03 00 00 00 00 00
7 | (1,110) | 16 | f | f | b5 ab 03 00 00 00 00 00
8 | (1,109) | 16 | f | f | b5 ab 03 00 00 00 00 00
9 | (1,108) | 16 | f | f | b5 ab 03 00 00 00 00 00
10 | (1,107) | 16 | f | f | b5 ab 03 00 00 00 00 00
11 | (1,106) | 16 | f | f | b5 ab 03 00 00 00 00 00
12 | (1,105) | 16 | f | f | b5 ab 03 00 00 00 00 00
…
397 | (0,11) | 16 | f | f | b5 ab 03 00 00 00 00 00
398 | (0,10) | 16 | f | f | b5 ab 03 00 00 00 00 00
399 | (0,9) | 16 | f | f | b5 ab 03 00 00 00 00 00
400 | (0,8) | 16 | f | f | b5 ab 03 00 00 00 00 00
401 | (0,7) | 16 | f | f | b5 ab 03 00 00 00 00 00
402 | (0,6) | 16 | f | f | b5 ab 03 00 00 00 00 00
403 | (0,5) | 16 | f | f | b5 ab 03 00 00 00 00 00
404 | (0,4) | 16 | f | f | b5 ab 03 00 00 00 00 00
405 | (0,3) | 16 | f | f | b5 ab 03 00 00 00 00 00
406 | (0,2) | 16 | f | f | b5 ab 03 00 00 00 00 00
407 | (0,1) | 16 | f | f | b5 ab 03 00 00 00 00 00
(407 rows)

Index on fly cleanup – cleaned up
------------+---------+---------+-------+------+-------------------------
1 | (1,117) | 16 | f | f | b5 ab 03 00 00 00 00 00
2 | (1,116) | 16 | f | f | b5 ab 03 00 00 00 00 00
(2 rows)

Vacuum
heap
1 2 lp
index
leaf
block0 block1

Vacuum
heap
1 2 lp
index
leaf
block0 block1
Scan to
find id’s

Vacuum Block Cleanup – Lots of DEAD LP’s
-----+--------+----------+--------+--------+--------+--------+----------+-----------
1 | 0 | LP_DEAD | 0 | | | | |
2 | 0 | LP_DEAD | 0 | | | | |
3 | 0 | LP_DEAD | 0 | | | | |
4 | 0 | LP_DEAD | 0 | | | | |
5 | 0 | LP_DEAD | 0 | | | | |
6 | 0 | LP_DEAD | 0 | | | | |
7 | 0 | LP_DEAD | 0 | | | | |
8 | 0 | LP_DEAD | 0 | | | | |
9 | 0 | LP_DEAD | 0 | | | | |
10 | 0 | LP_DEAD | 0 | | | | |
...
281 | 0 | LP_DEAD | 0 | | | | |
282 | 0 | LP_DEAD | 0 | | | | |
283 | 0 | LP_DEAD | 0 | | | | |
284 | 0 | LP_DEAD | 0 | | | | |
285 | 0 | LP_DEAD | 0 | | | | |
286 | 0 | LP_DEAD | 0 | | | | |
287 | 0 | LP_DEAD | 0 | | | | |
288 | 0 | LP_DEAD | 0 | | | | |
289 | 0 | LP_DEAD | 0 | | | | |
290 | 0 | LP_DEAD | 0 | | | | |
291 | 0 | LP_DEAD | 0 | | | | |
(291 rows)

Vacuum Block Cleanup – Unused LP’s
-----+--------+-----------+--------+--------+--------+--------+----------+-----------
1 | 0 | LP_UNUSED | 0 | | | | |
2 | 0 | LP_UNUSED | 0 | | | | |
3 | 0 | LP_UNUSED | 0 | | | | |
4 | 0 | LP_UNUSED | 0 | | | | |
5 | 0 | LP_UNUSED | 0 | | | | |
6 | 0 | LP_UNUSED | 0 | | | | |
7 | 0 | LP_UNUSED | 0 | | | | |
8 | 0 | LP_UNUSED | 0 | | | | |
9 | 0 | LP_UNUSED | 0 | | | | |
10 | 0 | LP_UNUSED | 0 | | | | |
...
281 | 0 | LP_UNUSED | 0 | | | | |
282 | 0 | LP_UNUSED | 0 | | | | |
283 | 0 | LP_UNUSED | 0 | | | | |
284 | 0 | LP_UNUSED | 0 | | | | |
285 | 0 | LP_UNUSED | 0 | | | | |
286 | 0 | LP_UNUSED | 0 | | | | |
287 | 0 | LP_UNUSED | 0 | | | | |
288 | 0 | LP_UNUSED | 0 | | | | |
289 | 0 | LP_UNUSED | 0 | | | | |
290 | 0 | LP_UNUSED | 0 | | | | |
291 | 0 | LP_UNUSED | 0 | | | | |
(291 rows)

Is that an update? ORM cases
ORM and other software sometimes update all columns in table including all indexed columns
UPDATE to NEW VALUE
postgres=# update benchmark_uuid2 set last_updated = now() where id=2;
UPDATE 1
postgres=# select n_tup_upd, n_tup_hot_upd from pg_stat_all_tables where relname = 'benchmark_uuid2';
n_tup_upd | n_tup_hot_upd
-----------+---------------
1 | 0
UPDATE to SAME VALUE (i.e. ORM case)
postgres=# update benchmark_uuid2 set last_updated = ( select last_updated benchmark_uuid2 where id=2)
where id=2;
UPDATE 1
postgres=# select n_tup_upd, n_tup_hot_upd from pg_stat_all_tables where relname = 'benchmark_uuid2';
n_tup_upd | n_tup_hot_upd
-----------+---------------
2 | 1
(1 row)
!=

Index Tests – A table with 2 to 64 indexes
Create the table with a PK, 64 Random int Columns and 1 last updated timestamp
create table benchmark_serial (
pk serial constraint pk_benchmark_serial_pk PRIMARY KEY,
a1 int not null,
…..
a64 int not null,
last_updated timestamp
);
Create between 2 and 64 indexes on the random int columns
create index i_benchmark_serial_a1 on benchmark_serial (a1);
….
create index i_benchmark_serial_a64 on benchmark_serial (a64);
For the Regular Test
create index i_benchmark_serial_lu on benchmark_serial (last_updated);

Full Page Writes
Block in
Memory
PostgreSQL
update t set y = 6;
Full
Block
WAL

Full Page Writes
Block in
Memory
PostgreSQL
update t set y = 6;
Checkpoint
Datafile
Full
Block
WAL
Archive

Full Page Writes
Block in
Memory
PostgreSQL
update t set y = 6;
Checkpoint
Datafile
Full
Block
WAL
Archive
4K
4K
8K

Full Page Writes
Block in
Memory
PostgreSQL
update t set y = 6;
Checkpoint
Datafile
Full
Block
WAL
Archive
4K
4K
8K
During crash
recovery
PostgreSQL
uses the FPW
block in the
WAL to replace
the bad
checkpointed
block

1-
200
1-
100
101-
200
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
Insert a Sequence Number into a B-tree

1-
200
1-
100
101-
200
Insert 201
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
1-
201
101-
201
151-
201
176-
201
4 blocks
loaded

1-
200
1-
100
101-
200
Insert 201
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
Insert 202
1-
201
101-
201
151-
201
176-
201
1-
202
101-
202
151-
202
176-
202
4 blocks
loaded
0 blocks
loaded

1-
200
1-
100
101-
200
Insert 201
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
Insert 202
1-
201
101-
201
151-
201
176-
201
1-
202
101-
202
151-
202
176-
202
4 blocks
loaded
0 blocks
loaded
At least 1 FPW

1-
200
1-
100
101-
200
Insert
124
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
1-
200
101-
200
101-
150
101-
125
4 blocks
loaded
Insert a Random value into a B-tree

1-
200
1-
100
101-
200
Insert
124
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
Insert 99
1-
200
101-
200
101-
150
101-
125
4 blocks
loaded
1-
100
51-
100
76-
100
3 blocks
loaded

1-
200
1-
100
101-
200
Insert
124
1-50
51-
100
1-25
26-
50
51-
75
76-
100
101-
150
151-
200
101-
125
126-
150
151-
175
176-
200
Insert 99
1-
200
101-
200
101-
150
101-
125
4 blocks
loaded
1-
100
51-
100
76-
100
3 blocks
loaded
151-
200
151-
200
2 blocks
loaded
Insert 161
At least 3 FPW

HOT Updates – Looking at FPW in the logs
HOT Updated
Heap 14/ 68, , d: HOT_UPDATE off 19 xmax 2327993188 ; new off 3 xmax 0, blkref #0: rel 1663/41083/41086 blk 28
XLOG 0/ 3368, , d: FPI_FOR_HINT , blkref #0: rel 1663/41083/41092 blk 1492899 FPW
Transaction 8/ 34, , d: COMMIT 2017-09-07 00:07:17.532647 UTC
Non HOT Update
Heap 14/ 75, , d: UPDATE off 67 xmax 2327993195 ; new off 7 xmax 0, blkref #0: rel 1663/41083/41086 blk 285
Btree 2/ 120, , d: INSERT_LEAF off 17, blkref #0: rel 1663/41083/41090 blk 7039952
Btree 2/ 64, , d: INSERT_LEAF off 205, blkref #0: rel 1663/41083/41092 blk 29
Btree 2/ 2639, , d: INSERT_LEAF off 73, blkref #0: rel 1663/41083/41093 blk 4 FPW
Transaction 8/ 34, , d: COMMIT 2017-09-07 00:24:29.427017 UTC
3.4K VS 16.7K

-
5,000
10,000
15,000
20,000
25,000
30,000
35,000
40,000
45,000
2 4 8 16 32 64
TransactionsPerSecond(TPS)
Number of Indexes
Updates with 100 Clients
HOT Regular
35X

Index Tests – Vacuuming
benchdb=> vacuum verbose benchmark_serial;
INFO: vacuuming "public.benchmark_serial"
INFO: scanned index "pk_benchmark_serial_pk" to remove 1939503 row versions
DETAIL: CPU 0.05s/4.93u sec elapsed 5.52 sec
INFO: scanned index "i_benchmark_serial_a1" to remove 1939503 row versions
…..
…..

0
100
200
300
400
500
600
700
2 4 8 16 32 64
Seconds
Number of Indexes
Vacuum Time per 1M Updates
HOT Regular
101X

Example – Keep track of table changes
PK C1 C2 C3 C4 C4 Last Updated
1 X A1 FOO HOT 9 01-Jun-1999
2 Y A2 BAR IS 9 01-Jul-2001
3 Z A2 RLL REALLY 7 19-OCT-2009
4 A A1 MFM USEFUL 2 21-OCT-1972

1 X A1 FOO HOT 9 01-Jun-1999
2 Y A2 BAR IS 9 01-Jul-2001
Data Lake

1 X A1 FOO HOT 9 01-Jun-1999
2 Y A2 BAR IS 9 01-Jul-2001
Data Lake Full
Table
Scan

Example – Build an index on Last Updated
1 X A1 FOO HOT 9 01-Jun-1999
2 Y A2 BAR IS 9 01-Jul-2001
Data Lake Index
Scan

Example – Build an index on Last Updated
1 X A1 FOO HOT 9 01-Jun-1999
2 Y A2 BAR IS 9 01-Jul-2001
Data Lake Index
Scan
Before: Updates to C1,C3,C4, Last Updated are HOT
After: Every update is a regular update

Example – Logical Replication
1 X A1 FOO HOT 9 01-Jun-1999
2 Y A2 BAR IS 9 01-Jul-2001
All changes
WAL logged
WAL
Logical
Decoding
and
Replication
Data Lake

HOT Understanding this important update optimization

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to HOT Understanding this important update optimization

Similar to HOT Understanding this important update optimization (20)

More from Grant McAlister

More from Grant McAlister (10)

Recently uploaded

Recently uploaded (20)

HOT Understanding this important update optimization

Editor's Notes