SlideShare a Scribd company logo
In The Name Of God

large
1
Kazem Taghandiky
University of Isfahan
Taghandiky@gmail.com
Mobile:09335816144
1. Introduction
A very large database, or VLDB, is a database that contains an extremely high
number of tuples (database rows), or occupies an extremely large physical file
system storage space. The most common definition of VLDB is a database that
occupies more than 1 terabyte or contains several billion rows, although naturally
this definition changes over time

If I updated this list in 3-5 years, all of the fixed measurements would probably increase
by a factor of 10X to 100X, maybe more. Probably the most relevant description of a
“Large Database” is one that is causing you pain and forcing you to consider other
The definition of large database is always changing,
options
2

>>>See More
1. Introduction
Importan
ce

VLDB is a premier annual international forum for data management and database researchers, vendors, practitioners, application
developers, and users. The conference will feature research talks, tutorials, demonstrations, and workshops. It will cover current
issues in data management, database and information systems research. Data management and databases remain among the main
technological cornerstones of emerging applications of the twenty-first century

3
1. Introduction
The Factors that Define a

Large Database

Hardware

Throughput

Data volume

Software
4

>>>See More
2. DBMS vs. VLDB

5
2-1. Examples of VLDB

6
3. Challenges
Some of the top
challenges I hear:

 Loading a big database for a takes a long time.
 Adding rows to an existing database is slow.
 Adding or removing a column takes my table offline
for a long time.
 Adding an index takes my table offline long time.
 Backup is difficult. (Not in this talk.)
These activities are painless for small data, but can be
painful for big data.

7
3-1. Techniques for challenge
Partitioning
1.Partitioning addresses key issues in supporting very large tables and indexes
by letting you decompose them into smaller components called partitions. SQL
queries and DML statements do not need to be modified in order to access
partitioned tables.

Each partition of a table or index must have the same logical attributes, such as
column names,
1.Data types, and
2.constraints,
8
but each partition can have separate physical attributes such as table spaces.
3-1. Techniques for challenge
Overview of Partitioning Methods

Range partitioning divides a table into partitions based on a range of values. You can
specification for the partitions.

9
3-1. Techniques for challenge
Overview of Partitioning Methods
List Partitioning this by specifying a list of discrete values for the partitioning key in
You do
the description for each partition

10
3-1. Techniques for challenge
Overview of Partitioning Methods

HASH partitioning With this type of partitioning, a partition is selected based on the v
expression that operates on column values in rows to be inserted into the table

11
3-1. Techniques for challenge
SQL Server :

Oracle :

Tables can be partitioned into up to 64,000 separate partit
12
3-1. Techniques for challenge
Table partitioning delivers advantages in three main areas

• performance
• maintenance
• availability.
13

>>>See More
3-1. Techniques for challenge
Data Compression
You can use the data compression feature to help compress the data inside a
database, and to help reduce the size of the database. In addition to saving space,
data compression can help improve performance of I/O intensive workloads
because the data is stored in fewer pages and queries need to read fewer pages
from disk. However, extra CPU resources are required on the database server to
compress and decompress the data,

14
3-1. Techniques for challenge
Implementation
Row Compression

•It reduces the metadata overhead that is associated with the
record. This metadata is information about columns, their lengths
and offsets. In some cases, the metadata overhead might be larger
than the old storage format.
•It uses variable-length storage format for numeric types (for
example integer, decimal, and float) and the types that are
based on numeric (for example date time and money).
•It stores fixed character strings by using variable-length format by
not storing the blank characters

15
3-1. Techniques for challenge
Implementation
Page Compression

Compressing the leaf level of tables and indexes with page
compression consists of three operations in the following order:
1.Row compression
2.Prefix compression
3.Dictionary compression

16
3-1. Techniques for challenge

17
4. Practical

18

>>>See More
4. Practical

19

>>>See More
4. Practical

20

>>>See More
Thank You For Your Attention

21

More Related Content

What's hot

Database system concepts
Database system conceptsDatabase system concepts
Database system concepts
Kumar
 
Types of databases
Types of databasesTypes of databases
Types of databases
PAQUIAAIZEL
 
Relational database revised
Relational database revisedRelational database revised
Relational database revised
mnodalo
 

What's hot (20)

Database
DatabaseDatabase
Database
 
Database system concepts
Database system conceptsDatabase system concepts
Database system concepts
 
Database System Concepts and Architecture
Database System Concepts and ArchitectureDatabase System Concepts and Architecture
Database System Concepts and Architecture
 
DBMS languages/ Types of SQL Commands
DBMS languages/ Types of SQL CommandsDBMS languages/ Types of SQL Commands
DBMS languages/ Types of SQL Commands
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and Components
 
data abstraction in DBMS
data abstraction in DBMSdata abstraction in DBMS
data abstraction in DBMS
 
The data mining query language
The data mining query languageThe data mining query language
The data mining query language
 
Relational Database Management System
Relational Database Management SystemRelational Database Management System
Relational Database Management System
 
Ds intro
Ds introDs intro
Ds intro
 
Database overview
Database overviewDatabase overview
Database overview
 
Data definition language
Data definition languageData definition language
Data definition language
 
Types of databases
Types of databasesTypes of databases
Types of databases
 
Data Mining: Data processing
Data Mining: Data processingData Mining: Data processing
Data Mining: Data processing
 
Introduction to data structures (ss)
Introduction to data structures (ss)Introduction to data structures (ss)
Introduction to data structures (ss)
 
RDBMS
RDBMS RDBMS
RDBMS
 
D B M S Animate
D B M S AnimateD B M S Animate
D B M S Animate
 
Good PPT for RDBMS starter
Good PPT for RDBMS starter Good PPT for RDBMS starter
Good PPT for RDBMS starter
 
Chapter25
Chapter25Chapter25
Chapter25
 
Relational database revised
Relational database revisedRelational database revised
Relational database revised
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 

Similar to very large database

The High Performance DBA Optimizing Databases For High Performance
The High Performance DBA Optimizing Databases For High PerformanceThe High Performance DBA Optimizing Databases For High Performance
The High Performance DBA Optimizing Databases For High Performance
Embarcadero Technologies
 
Implementing the Databese Server session 02
Implementing the Databese Server session 02Implementing the Databese Server session 02
Implementing the Databese Server session 02
Guillermo Julca
 
1414Database DesignDatabase design is the process o.docx
 1414Database DesignDatabase design is the process o.docx 1414Database DesignDatabase design is the process o.docx
1414Database DesignDatabase design is the process o.docx
joyjonna282
 
The thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designThe thinking persons guide to data warehouse design
The thinking persons guide to data warehouse design
Calpont
 

Similar to very large database (20)

Data warehouse physical design
Data warehouse physical designData warehouse physical design
Data warehouse physical design
 
Complete first chapter rdbm 17332
Complete first chapter rdbm 17332Complete first chapter rdbm 17332
Complete first chapter rdbm 17332
 
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
 
The High Performance DBA Optimizing Databases For High Performance
The High Performance DBA Optimizing Databases For High PerformanceThe High Performance DBA Optimizing Databases For High Performance
The High Performance DBA Optimizing Databases For High Performance
 
2nd chapter dbms.pptx
2nd chapter dbms.pptx2nd chapter dbms.pptx
2nd chapter dbms.pptx
 
A STUDY ON GRAPH STORAGE DATABASE OF NOSQL
A STUDY ON GRAPH STORAGE DATABASE OF NOSQLA STUDY ON GRAPH STORAGE DATABASE OF NOSQL
A STUDY ON GRAPH STORAGE DATABASE OF NOSQL
 
A Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQLA Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQL
 
A STUDY ON GRAPH STORAGE DATABASE OF NOSQL
A STUDY ON GRAPH STORAGE DATABASE OF NOSQLA STUDY ON GRAPH STORAGE DATABASE OF NOSQL
A STUDY ON GRAPH STORAGE DATABASE OF NOSQL
 
A Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQLA Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQL
 
Implementing the Databese Server session 02
Implementing the Databese Server session 02Implementing the Databese Server session 02
Implementing the Databese Server session 02
 
1414Database DesignDatabase design is the process o.docx
 1414Database DesignDatabase design is the process o.docx 1414Database DesignDatabase design is the process o.docx
1414Database DesignDatabase design is the process o.docx
 
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTHYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
 
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTHYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
 
The thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designThe thinking persons guide to data warehouse design
The thinking persons guide to data warehouse design
 
MySQL 8 Server Optimization Swanseacon 2018
MySQL 8 Server Optimization Swanseacon 2018MySQL 8 Server Optimization Swanseacon 2018
MySQL 8 Server Optimization Swanseacon 2018
 
Open Source Datawarehouse
Open Source DatawarehouseOpen Source Datawarehouse
Open Source Datawarehouse
 
MySQL 8 Tips and Tricks from Symfony USA 2018, San Francisco
MySQL 8 Tips and Tricks from Symfony USA 2018, San FranciscoMySQL 8 Tips and Tricks from Symfony USA 2018, San Francisco
MySQL 8 Tips and Tricks from Symfony USA 2018, San Francisco
 
DDBMS Paper with Solution
DDBMS Paper with SolutionDDBMS Paper with Solution
DDBMS Paper with Solution
 
Cassandra data modelling best practices
Cassandra data modelling best practicesCassandra data modelling best practices
Cassandra data modelling best practices
 
Data Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File ManualData Mining And Data Warehousing Laboratory File Manual
Data Mining And Data Warehousing Laboratory File Manual
 

Recently uploaded

Recently uploaded (20)

Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxMatatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & EngineeringBasic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdfDanh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
 
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptxSolid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
Solid waste management & Types of Basic civil Engineering notes by DJ Sir.pptx
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
Forest and Wildlife Resources Class 10 Free Study Material PDF
Forest and Wildlife Resources Class 10 Free Study Material PDFForest and Wildlife Resources Class 10 Free Study Material PDF
Forest and Wildlife Resources Class 10 Free Study Material PDF
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 

very large database

  • 1. In The Name Of God large 1 Kazem Taghandiky University of Isfahan Taghandiky@gmail.com Mobile:09335816144
  • 2. 1. Introduction A very large database, or VLDB, is a database that contains an extremely high number of tuples (database rows), or occupies an extremely large physical file system storage space. The most common definition of VLDB is a database that occupies more than 1 terabyte or contains several billion rows, although naturally this definition changes over time If I updated this list in 3-5 years, all of the fixed measurements would probably increase by a factor of 10X to 100X, maybe more. Probably the most relevant description of a “Large Database” is one that is causing you pain and forcing you to consider other The definition of large database is always changing, options 2 >>>See More
  • 3. 1. Introduction Importan ce VLDB is a premier annual international forum for data management and database researchers, vendors, practitioners, application developers, and users. The conference will feature research talks, tutorials, demonstrations, and workshops. It will cover current issues in data management, database and information systems research. Data management and databases remain among the main technological cornerstones of emerging applications of the twenty-first century 3
  • 4. 1. Introduction The Factors that Define a Large Database Hardware Throughput Data volume Software 4 >>>See More
  • 5. 2. DBMS vs. VLDB 5
  • 7. 3. Challenges Some of the top challenges I hear:  Loading a big database for a takes a long time.  Adding rows to an existing database is slow.  Adding or removing a column takes my table offline for a long time.  Adding an index takes my table offline long time.  Backup is difficult. (Not in this talk.) These activities are painless for small data, but can be painful for big data. 7
  • 8. 3-1. Techniques for challenge Partitioning 1.Partitioning addresses key issues in supporting very large tables and indexes by letting you decompose them into smaller components called partitions. SQL queries and DML statements do not need to be modified in order to access partitioned tables. Each partition of a table or index must have the same logical attributes, such as column names, 1.Data types, and 2.constraints, 8 but each partition can have separate physical attributes such as table spaces.
  • 9. 3-1. Techniques for challenge Overview of Partitioning Methods Range partitioning divides a table into partitions based on a range of values. You can specification for the partitions. 9
  • 10. 3-1. Techniques for challenge Overview of Partitioning Methods List Partitioning this by specifying a list of discrete values for the partitioning key in You do the description for each partition 10
  • 11. 3-1. Techniques for challenge Overview of Partitioning Methods HASH partitioning With this type of partitioning, a partition is selected based on the v expression that operates on column values in rows to be inserted into the table 11
  • 12. 3-1. Techniques for challenge SQL Server : Oracle : Tables can be partitioned into up to 64,000 separate partit 12
  • 13. 3-1. Techniques for challenge Table partitioning delivers advantages in three main areas • performance • maintenance • availability. 13 >>>See More
  • 14. 3-1. Techniques for challenge Data Compression You can use the data compression feature to help compress the data inside a database, and to help reduce the size of the database. In addition to saving space, data compression can help improve performance of I/O intensive workloads because the data is stored in fewer pages and queries need to read fewer pages from disk. However, extra CPU resources are required on the database server to compress and decompress the data, 14
  • 15. 3-1. Techniques for challenge Implementation Row Compression •It reduces the metadata overhead that is associated with the record. This metadata is information about columns, their lengths and offsets. In some cases, the metadata overhead might be larger than the old storage format. •It uses variable-length storage format for numeric types (for example integer, decimal, and float) and the types that are based on numeric (for example date time and money). •It stores fixed character strings by using variable-length format by not storing the blank characters 15
  • 16. 3-1. Techniques for challenge Implementation Page Compression Compressing the leaf level of tables and indexes with page compression consists of three operations in the following order: 1.Row compression 2.Prefix compression 3.Dictionary compression 16
  • 17. 3-1. Techniques for challenge 17
  • 21. Thank You For Your Attention 21