Dive into EXPLAIN - PostgreSql

Dive into EXPLAIN - PostgreSQL
1
Dmytro Shylovskyi
CTO

Что за зверь EXPLAIN?
2
EXPLAIN - показывает какой план выполнения запроса был выбран планировщиком.
Планировщик решает задачу оптимизации времени выполнения запроса.
EXPLAIN ANALYZE - реально выполняет запрос и дополняет вывод EXPLAIN реальными метриками

Методы сканирования
3
Sequence Scan
Index Scan
Bitmap Heap Scan

Sequence Scan
4
План запроса
Seq Scan on users u (cost=0.00..118835.16 rows=373716 width=3260) (actual time=0.009..460.220 rows=372892 loops=1)
Planning Time: 0.939 ms
Execution Time: 474.262 ms
EXPLAIN ANALYZE select * from users u;

Метрики EXPLAIN
5
cost - приблизительная стоимость запуска и общая стоимость. Общая стоимость вычисляется по формуле:
(число_чтений_диска * seq_page_cost) + (число_просканированных_строк * cpu_tuple_cost)
rows - ожидаемое число строк, которое должен вывести узел плана
width - ожидаемый средний размер строк, выводимых узлом плана (в байтах)

Sequence Scan
6
Gather (cost=1000.00..118045.61 rows=14 width=2761) (actual time=0.906..48.498 rows=141 loops=1)
Workers Planned: 2
Workers Launched: 2
-> Parallel Seq Scan on users u (cost=0.00..117044.21 rows=6 width=2761) (actual time=0.767..44.362 rows=47 loops=3)
Filter: ((name)::text = 'Дима'::text)
Rows Removed by Filter: 124229
EXPLAIN ANALYZE select * from users u where name = 'Дима';
Filter не влияет на общую стоимость запроса

Sequence Scan
7
Limit (cost=1000.00..84604.17 rows=10 width=3260) (actual time=0.924..8.483 rows=10 loops=1)
-> Gather (cost=1000.00..118045.84 rows=14 width=3260) (actual time=0.923..8.480 rows=10 loops=1)
Workers Planned: 2
Workers Launched: 2
-> Parallel Seq Scan on users u (cost=0.00..117044.44 rows=6 width=3260) (actual time=0.662..4.601 rows=5
loops=3)
Filter: ((name)::text = 'Дима'::text)
EXPLAIN ANALYZE select * from users u where name = 'Дима' limit 10;

Index Scan
8
Index Scan using users_pkey on users u (cost=0.42..8.44 rows=1 width=3250) (actual time=0.014..0.015 rows=1 loops=1)
Index Cond: (id = 28)
EXPLAIN ANALYZE select * from users u where id = 28;

Внутренняя структура файла таблицы
9

Index Scan
10
-> Index Scan using users_pkey on users u (cost=0.42..471916.33 rows=373551 width=3275) (actual time=0.017..0.051
rows=10 loops=1)
EXPLAIN ANALYZE select * from users u order by id limit 10;

Index Scan
11
-> Index Scan using users_pkey on users u (cost=0.42..12406.00 rows=3133 width=3275) (actual time=0.011..0.033
rows=10 loops=1)
Index Cond: (id > 400000)
EXPLAIN ANALYZE select * from users u where id > 400000 limit 10;

Index Scan
12
-> Index Only Scan using users_pkey on users u (cost=0.42..14346.36 rows=373551 width=4) (actual time=0.011..0.013
rows=10 loops=1)
Heap Fetches: 0
EXPLAIN ANALYZE select id from users u limit 10;

Bitmap Heap Scan
13
Bitmap Heap Scan on users u (cost=347.25..28336.88 rows=9139 width=3250) (actual time=3.924..22.035 rows=8960
loops=1)
Recheck Cond: (created_at >= '2021-08-01 00:00:00+00'::timestamp with time zone)
Heap Blocks: exact=7153
-> Bitmap Index Scan on users_created_at_idx (cost=0.00..344.97 rows=9139 width=0) (actual time=2.978..2.979
rows=9027 loops=1)
Index Cond: (created_at >= '2021-08-01 00:00:00+00'::timestamp with time zone)
EXPLAIN ANALYZE select * from users u where created_at >= '2021-08-01';

Методы объединения
14
Nested Loop
Hash Join
Merge Join

Nested Loop
15
-> Nested Loop (cost=0.42..344484.57 rows=372933 width=4603) (actual time=0.034..0.115 rows=10 loops=1)
-> Seq Scan on users u (cost=0.00..118833.35 rows=373535 width=2755) (actual time=0.011..0.033 rows=10 loops=1)
-> Index Scan using user_id_unique_idx on user_meta um (cost=0.42..0.60 rows=1 width=1848) (actual
time=0.006..0.006 rows=1 loops=10)
Index Cond: (user_id = u.id)
EXPLAIN ANALYZE select * from users u
inner join user_meta um on u.id = um.user_id
limit 10;

Nested Loop
16
O(N*M)
Nested Loop
Nested Loop Left Join
Nested Loop Anti Join

Hash Join
17
Hash Right Join (cost=27633.34..277301.97 rows=50252 width=4097) (actual time=500.260..2343.089 rows=13147 loops=1)
Hash Cond: (l.student_id = u.id)
-> Seq Scan on lessons l (cost=0.00..244088.48 rows=2125748 width=1330) (actual time=0.011..1920.179 rows=2113972 loops=1)
-> Hash (cost=27523.00..27523.00 rows=8827 width=2767) (actual time=40.093..40.094 rows=9096 loops=1)
Buckets: 16384 Batches: 1 Memory Usage: 13284kB
-> Bitmap Heap Scan on users u (cost=336.83..27523.00 rows=8827 width=2767) (actual time=2.883..20.949 rows=9096 loops=1)
Recheck Cond: (created_at > '2021-08-01'::date)
Heap Blocks: exact=7265
-> Bitmap Index Scan on users_created_at_idx (cost=0.00..334.62 rows=8827 width=0) (actual time=1.924..1.924 rows=9120 loops=1)
Index Cond: (created_at > '2021-08-01'::date)
left join lessons l on u.id=l.student_id where u.created_at > date '2021-08-01';

Hash Join
18
O(N+M)
Hash Left Join
Hash Right Join
Hash Anti Join
Hash Full Join

Merge Join
19
left join (select * from lessons where is_paid=0 order by student_id) l on u.id=l.student_id
order by u.id limit 100 ;

Merge Join
20
left join (select * from lessons where is_paid=0 order by student_id) l on u.id=l.student_id order by u.id limit 100 ;Limit
(cost=246715.93..246851.60 rows=100 width=4592) (actual time=541.589..552.755 rows=100 loops=1)
-> Merge Right Join (cost=246715.93..753755.40 rows=373727 width=4592) (actual time=541.588..552.746 rows=100 loops=1)
Merge Cond: (lessons.student_id = u.id)
-> Gather Merge (cost=245746.43..274355.33 rows=245202 width=1330) (actual time=541.557..552.607 rows=92 loops=1)
Workers Planned: 2
Workers Launched: 2
-> Sort (cost=244746.41..245052.91 rows=122601 width=1330) (actual time=537.927..537.954 rows=104 loops=3)
Sort Key: lessons.student_id
Sort Method: quicksort Memory: 92752kB
Worker 0: Sort Method: quicksort Memory: 110826kB
-> Parallel Seq Scan on lessons (cost=0.00..234384.41 rows=122601 width=1330) (actual time=0.009..366.614 rows=96905 loops=3)
Filter: (is_paid = 0)
-> Index Scan using users_pkey on users u (cost=0.42..472948.71 rows=373727 width=3262) (actual time=0.021..0.045 rows=10 loops=1)

Merge Join
21
O(N+M)
O(N*log(N)+M*log(M))
O(N+M*log(M))
Merge Left Join
Merge Right Join
Merge Anti Join
Merge Full Join

HashAggregate/GroupAggregate/Sort
22
Finalize GroupAggregate (cost=118425.79..118456.45 rows=121 width=11) (actual time=166.130..167.678 rows=176 loops=1)
Group Key: reg_country_code
-> Gather Merge (cost=118425.79..118454.03 rows=242 width=11) (actual time=166.123..167.592 rows=468 loops=1)
Workers Planned: 2
Workers Launched: 2
Sort Key: reg_country_code
-> Partial HashAggregate (cost=117420.37..117421.58 rows=121 width=11) (actual time=163.769..163.790 rows=156 loops=3)
Group Key: reg_country_code
-> Parallel Seq Scan on users (cost=0.00..116646.25 rows=154825 width=3) (actual time=0.006..136.454 rows=124350 loops=3)
EXPLAIN ANALYZE select reg_country_code, count(*) from users group by reg_country_code;

Unique
23
-> Unique (cost=264201.52..264202.10 rows=20 width=4602) (actual time=154.775..155.281 rows=1 loops=1)
Sort Key: u.id, l.created_at
-> Gather (cost=1000.43..264197.59 rows=115 width=4602) (actual time=80.673..153.885 rows=111 loops=1)
Workers Planned: 2
Workers Launched: 2
-> Nested Loop Left Join (cost=0.43..263186.09 rows=48 width=4602) (actual time=55.822..82.079 rows=37 loops=3)
-> Parallel Seq Scan on users u (cost=0.00..117039.64 rows=8 width=3260) (actual time=55.809..55.811 rows=0 loops=3)
Filter: ((email)::text ~~ 'webdev%'::text)
-> Index Scan using lessons_group_lesson_id_student_id_idx on lessons l (cost=0.43..18267.49 rows=82 width=1330) (actual
time=0.029..78.700 rows=111 loops=1)
Index Cond: (u.id = student_id)
EXPLAIN ANALYZE select DISTINCT ON(u.id) * from users u
left join lessons l on u.id=l.student_id where email like 'webdev%' order by u.id, l.created_at limit 100;

Заключение
24
Планировщик черпает
данные с таблицы
pg_statistic, но можно
смотреть представление
pg_stats
Инструменты:
pgBadger - формирует
отчеты по логам PostgreSQL
explain.depesz.com -
помагает чиать explain’ы
pgMustard - дает советы как
улучшить запрос на основе
explain

25
Спасибо за внимание!
Появились вопросы?

26
https://postgrespro.ru/docs/postgrespro/9.5/using-explain
https://postgrespro.ru/docs/postgrespro/13/view-pg-stats
https://www.depesz.com/tag/unexplainable/
Ссылки

Dive into EXPLAIN - PostgreSql

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Dive into EXPLAIN - PostgreSql

Similar to Dive into EXPLAIN - PostgreSql (20)

Recently uploaded

Recently uploaded (20)

Dive into EXPLAIN - PostgreSql

Editor's Notes