Postgres db performance improvements

•Download as PPTX, PDF•

0 likes•452 views

Mahesh Chopker

Tips on Postgres DB Performance Improvements

Software Technology

Postgres DB Performance Improvements
By Mahesh Kumar Chopker

Current Performance Issues
• Slow Running Queries
– While fetching a large data set

Areas Already Looked into…
• Code Optimization
• SQL Query Tuning
• Keeping latest data into a separate table than
the table keeping historic data

Areas To Be Looked into…
• DB Design
– Disk I/O
• Number of Columns in a Table
• Choose right Data Types for Columns
– Tune DB Buffer Size
• To improve Caching
– Standard recommended solution to deal with large data
set tables
• Table Partitioning
• Application Design
– Tune JDBC Code and Design
– Apply Application Level Cache

How Postgres Stores Data
• Disk Files
– Files under path: /var/lib/pgsql/avvqdb/pgdata/base/16386/
• Page Size
– The size of a page is fixed at 8,192 bytes
– All disk I/O is performed on a page-by-page basis, when you select a
single row from a table, PostgreSQL will read at least one page
– Heap page & Index Page
• Heap and Index Cache Hits
– Disk I/O is expensive
– Postgres itself tracks access patterns of your data and will on its own
keep frequently accessed data in cache
– Caches heap and index pages
– Insertion order matters for effective caching

Minimize Disk I/O
– Normalize Database
• Remove unnecessary columns
– Choose Right Data Types
• Avoid using larger size data type if data values are small
and can fit into smaller data types
• It has a direct impact on Cache hits

Tune DB Buffer Size
– Adjust the DB Buffer Size
• To improve heap block cache hits
• And to improve index block cache hits

Handle Large Data Set Tables
• Table partition standard solution for large data set tables
• The average number of heap/index blocks you'll have to navigate in order
to find a row goes down
• Partition also benefits on choosing the right scan types during query
• There are some maintenance advantages too. You can DROP an individual
partition, to erase all of the data from that range. This is a common
technique for pruning historical data out of a partitioned table, one that
avoids the VACUUM cleanup work that DELETE leaves behind.
• Dynamic partition rules can be setup which minimizes maintenance
overhead and transparent to application layer
• Tips:
– On what column to partition matters
– No. of partitions should not be large
– Race condition if two separate transactions inserts
Handle Large Data Set Tables

Handle Large Data Set TablesPerformance Improvement Matrix
Original Table: TestTable (Total Records: 29756342)
Master Partition Table: TestTable _Master (with 10 child tables)
OS: RHEL 6.5, CPU: 8 core, RAM: 6 GB
Query
No.
Query Received Timestamp Range Total Records
Found
Total Record in
Table
1st between 1396915200 and 1397001600
(2 days)
12929330 29756342
2nd between 1396915200 1397001600
(1 day)
4320000 29757518
3rd between 1396915200 1397001600
(1 day)
4320000 29757518

Handle Large Data Set TablesPerformance Improvement Matrix
Query Attempt On Original
Table
On Master
Partition Table
On Master
Partition Table
with parallel
queries
1st 1st 949 sec 732 sec 294 sec
1st 2nd 938 sec 549 sec 290 sec
2nd 1st 367 sec 185 sec
3rd 1st 457 sec 128 sec

Thank You
For any queries please reach out to me at
mchopker@gmail.com

What's hot

Less05 storageImran Ali

MonetDB :column-store approach in databaseNikhil Patteri

8. column oriented databasesFabio Fumarola

Abhishek resume4Abhishek prasad vishwakarma

S3 l4 db2 environment - databasesMohammad Khan

File organizationGokul017

Chapter 06Kristin Harrison

Cheminfo Stories APAC 2020 - JChem Engines introduction ChemAxon

Spot db consistency checking and optimization in spatial databasePratik Udapure

Hive partitioning best practicesNabeel Moidu

Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...ChemAxon

S3 l7 db2 storage modelMohammad Khan

Ch06 records managementxtin101

Creating databaseHitesh Kumar Markam

UnixFederal Urdu University

[Altibase] 1-2 disk dbmsaltistory

Ch05 records managementxtin101

File organisationMukund Trivedi

The oracle database architectureAkash Pramanik

What's hot (19)

Less05 storage

MonetDB :column-store approach in database

8. column oriented databases

Abhishek resume4

S3 l4 db2 environment - databases

File organization

Chapter 06

Cheminfo Stories APAC 2020 - JChem Engines introduction

Spot db consistency checking and optimization in spatial database

Hive partitioning best practices

Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...

S3 l7 db2 storage model

Ch06 records management

Creating database

Unix

[Altibase] 1-2 disk dbms

Ch05 records management

File organisation

The oracle database architecture

Recently uploaded

Professional Resume Template for Software DevelopersVinodh Ram

Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea

chapter--4-software-project-planning.pptkotipi9215

GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp

Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig

Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran

Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.

What is Fashion PLM and Why Do You Need ItWave PLM

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3

Asset Management Software - InfographicHr365.us smith

React Server Component in Next.js by Hanief UtamaHanief Utama

Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz

MYjobs Presentation Django-based projectAnoyGreter

Implementing Zero Trust strategy with AzureDinusha Kumarasiri

Recently uploaded (20)

Professional Resume Template for Software Developers

Cloud Management Software Platforms: OpenStack

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样

chapter--4-software-project-planning.ppt

GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE

Automate your Kamailio Test Calls - Kamailio World 2024

Intelligent Home Wi-Fi Solutions | ThinkPalm

Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data

What is Fashion PLM and Why Do You Need It

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data

Asset Management Software - Infographic

React Server Component in Next.js by Hanief Utama

Folding Cheat Sheet #4 - fourth in a series

MYjobs Presentation Django-based project

Implementing Zero Trust strategy with Azure

Postgres db performance improvements

1. Postgres DB Performance Improvements By Mahesh Kumar Chopker

2. Current Performance Issues • Slow Running Queries – While fetching a large data set

3. Areas Already Looked into… • Code Optimization • SQL Query Tuning • Keeping latest data into a separate table than the table keeping historic data

4. Areas To Be Looked into… • DB Design – Disk I/O • Number of Columns in a Table • Choose right Data Types for Columns – Tune DB Buffer Size • To improve Caching – Standard recommended solution to deal with large data set tables • Table Partitioning • Application Design – Tune JDBC Code and Design – Apply Application Level Cache

5. How Postgres Stores Data • Disk Files – Files under path: /var/lib/pgsql/avvqdb/pgdata/base/16386/ • Page Size – The size of a page is fixed at 8,192 bytes – All disk I/O is performed on a page-by-page basis, when you select a single row from a table, PostgreSQL will read at least one page – Heap page & Index Page • Heap and Index Cache Hits – Disk I/O is expensive – Postgres itself tracks access patterns of your data and will on its own keep frequently accessed data in cache – Caches heap and index pages – Insertion order matters for effective caching

6. Minimize Disk I/O – Normalize Database • Remove unnecessary columns – Choose Right Data Types • Avoid using larger size data type if data values are small and can fit into smaller data types • It has a direct impact on Cache hits

7. Tune DB Buffer Size – Adjust the DB Buffer Size • To improve heap block cache hits • And to improve index block cache hits

8. Handle Large Data Set Tables • Table partition standard solution for large data set tables • The average number of heap/index blocks you'll have to navigate in order to find a row goes down • Partition also benefits on choosing the right scan types during query • There are some maintenance advantages too. You can DROP an individual partition, to erase all of the data from that range. This is a common technique for pruning historical data out of a partitioned table, one that avoids the VACUUM cleanup work that DELETE leaves behind. • Dynamic partition rules can be setup which minimizes maintenance overhead and transparent to application layer • Tips: – On what column to partition matters – No. of partitions should not be large – Race condition if two separate transactions inserts Handle Large Data Set Tables

9. Handle Large Data Set TablesPerformance Improvement Matrix Original Table: TestTable (Total Records: 29756342) Master Partition Table: TestTable _Master (with 10 child tables) OS: RHEL 6.5, CPU: 8 core, RAM: 6 GB Query No. Query Received Timestamp Range Total Records Found Total Record in Table 1st between 1396915200 and 1397001600 (2 days) 12929330 29756342 2nd between 1396915200 1397001600 (1 day) 4320000 29757518 3rd between 1396915200 1397001600 (1 day) 4320000 29757518

10. Handle Large Data Set TablesPerformance Improvement Matrix Query Attempt On Original Table On Master Partition Table On Master Partition Table with parallel queries 1st 1st 949 sec 732 sec 294 sec 1st 2nd 938 sec 549 sec 290 sec 2nd 1st 367 sec 185 sec 3rd 1st 457 sec 128 sec

11. Thank You For any queries please reach out to me at mchopker@gmail.com

Editor's Notes

Postgres partition feature

Postgres db performance improvements

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Postgres db performance improvements

Similar to Postgres db performance improvements (20)

Recently uploaded

Recently uploaded (20)

Postgres db performance improvements

Editor's Notes