SlideShare a Scribd company logo
1 of 22
DISTRIBUTED
DATABASE
Dr. Kamal Gulati
www.mybigdataanalytics.in
Database
Logical interrelated collection of
shared data, along with description
of data, physically distributed over a
computer network.
What is Distributed Database?
•A distributed database (DDB) is a collection of
multiple, logically interrelated databases
distributed over a computer network.
• A distributed database management system
(DDBMS) is the software that manages the DDB and
provides an access mechanism that makes this
distribution transparent to the users
Characteristics
All sites are interconnected.
Fragments can be replicated.
Logically related shared data can becollected.
Data at each and every site is controlled by the
DBMS.
Each Distributed Database Management System takes
part in at least one global application.
Functionality
1. Security
2. Keeping track of data
3. Replicated data management
4. System catalog management
5. Distributed transaction management
6. Distributed database recovery
A DDBMSmainly classified into two
types:
◦1. Homogeneous Distributed database
management systems
◦2. Heterogeneous Distributed database
management systems
Homogeneous DDBMS
1. In a homogeneous distributed database all sites have identical
software and are aware of each other and agree to cooperate in
processing user requests.
2. The homogeneous system is much easier to design and manage.
3. The operating system used, at each location must be same or
compatible.
4. The database application (or DBMS) used at each location must be
same or compatible.
Heterogeneous DDBMS
1. In a heterogeneous distributed database different sites may use different
schema and software.
2. In heterogeneous systems, different nodes may have different hardware &
software and data structures at various nodes or locations are also
incompatible.
3. Different computers and operating systems, database applications or data
models may be used at each of the locations.
4. On heterogeneous system, translations are required to allow
communication between different sites (or DBMS).
5. The heterogeneous system is often not technically or economically feasible.
In this system, a user at one location may be able to read but not update
the data at another location.
Advantages
1. Less danger of a single-point failure. When one of the computers fails, the
workload is picked up by other workstations.
2. Data are also distributed at multiple sites.
3. The end user is able to access any available copy of the data, and an end
user's request is processed by any processor at the data location.
4. Improved communications. Because local sites are smaller and located
closer to customers.
5. Reduced operating costs. It is more cost- effective to add workstations to a
network than to update a mainframe system.
6. Faster data access, faster data processing.
7. A distributed database system spreads out the systems workload by
processing data at several sites.
Disadvantages
1. Complexity of management and control.
2. Applications must recognize data location, and they must be
able to stitch together data from various sites.
3. Security.
4. Increased storage and infrastructure requirements.
5. Multiple copies of data has to be at different sites, thus an
additional disk storage space will be required.
6. The probability of security lapses increases when data are
located at multiple sites.
What is Parallel database...??
1. A parallel database system is to improve performance through
parallelization of various operations, such as loading data, building
indexes and evaluating queries.
2. The distribution is solely done on the bases of performance.
3. Parallel databases improve processing and input/output speeds
by using multiple CPUs and disks in parallel.
4. Many operations are performed simultaneously
5. Data may be stored in a distributed fashion.
Database and Parallel Database
Characteristics ParallelDatabase Distributed database
Definition It is a software system
where multiple
processors or machinesare used
to
execute and run queries in
parallel.
It is a software system that
manages multiple logically
interrelated databases distributed
over a computer network.
Geographical
Location
The nodes are locatedat
geographically
same location.
The nodes are usually located
at geographically different
locations.
Execution
Speed
Quicker Slower
Overhead Less More
Nodetypes Compulsorily Homogeneous Need not be homogeneous
Performance Lower reliability &
availability.
Higher reliability &
availability.
Scopeof
Expansion
Difficult to expand Easier to expand
Backup Backup at one site only Backup at multiple sites
Consistency Maintaining consistency is easier Maintaining consistency is
difficult.
Data fragmentation
1. Fragmentation is a process of division or the mapping of the
tables based on the columns and rows of data into the smallest
unit of data.
2. Data that has broken down is still possible to be combined again
with the intention to complete the data collection using
fragmentation.
3. Fragmentation is a database server feature that allows you to
control where data is stored at the table level.
4. Fragmentation enables you to define groups of rows or index keys
within a table.
Replication
Replication is that we store severalcopies of a relation or relation
fragment. An entire relation can be replicated at one or more
sites.
Similarly, one or more fragments of a relation can be replicated
at othersites.
For example, if a relation R is fragmented into R1,R2, and R3, there
might be just one copy of R1, whereas R2 is replicated at two other
sites and R3 is replicated at all sites.
Two Fold Replication
The motivation for replication is twofold:
1. Increased Availability of Data:Ifa site that contains a replica
goes down, wecan find the same data at other sites. Similarly,
if local copies of remote relations are available, we are less
vulnerable to failure of communication links.
2. Faster Query Evaluation: Queries can execute faster by using a
local copy of a relation instead of going to a remotesite.
Distributed Transaction
In a distributed DBMS, a given transaction is submitted at some one site, but it can
access data at other sites as well.
When a transaction is submitted at some site, the transaction manager at that
site breaks it up into a collection of one or more sub-transactions that execute at
different sites, submits them to transaction managers at the other sites, and
coordinates their activity.
Distributed Concurrency Control: How can locks for objects stored across several
sites be managed?
Distributed Recovery: Transaction atomicity must be ensured when a transaction
commits, all its actions, across all the sites at which it executes, must persist.
Similarly, when a transaction aborts, none of its actions must be allowed to persist.
Distributed Concurrency Control
• The choice of technique determines which objects are
to be locked. When locks are obtained and released is
determined by the concurrency control protocol. We
now consider how lock and unlock requests are
implemented in a distributed environment. Lock
management can be distributed across sites in many
ways:
Centralized : A single site is in charge of handling
lock and unlock requests for all objects.
Primary Copy: One copy of each object is designated
the primary copy. All requests to lock or unlock a copy
of this object are handled by the lock manager at the
site where the primary copy is stored, regardless of
where the copy itself is stored.
Fully Distributed : Requests to lock or unlock a copy
of an object stored at a site are handled by the lock
manager at the site where the copy is stored.
DISTRIBUTED RECOVERY
Recovery in a distributed DBMS is more complicated than in a
centralized DBMS for the following reasons:
◦ New kinds of failure can arise : Failure of communication
links and failure of a remote site at which a sub-transaction
is executing.
◦ Either all sub-transactions of a given transaction must commit
or none must commit, and this property must be guaranteed
despite any combination of site and link failures. This
guarantee is achieved using a commit protocol.
Concepts Of Locks
A lock is used when multiple users need to access a database
concurrently. This prevents data from being corrupted or invalidated
when multiple users try to write to the database.
Any single user can only modify those database records (that is, items in the
database) to which they have applied a lock that gives them exclusive
access to the record until the lock is released.
Locking not only provides exclusivity to write but also prevents (or controls)
reading of unfinished modifications.

More Related Content

What's hot

Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query Optimization
Ali Usman
 

What's hot (20)

11. dfs
11. dfs11. dfs
11. dfs
 
Network topology And Its Types
Network topology And Its Types Network topology And Its Types
Network topology And Its Types
 
Query Decomposition and data localization
Query Decomposition and data localization Query Decomposition and data localization
Query Decomposition and data localization
 
Distributed system notes unit I
Distributed system notes unit IDistributed system notes unit I
Distributed system notes unit I
 
Chapter 6-Consistency and Replication.ppt
Chapter 6-Consistency and Replication.pptChapter 6-Consistency and Replication.ppt
Chapter 6-Consistency and Replication.ppt
 
Distributed Database Management System
Distributed Database Management SystemDistributed Database Management System
Distributed Database Management System
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query Optimization
 
Operating Systems - Process Synchronization and Deadlocks
Operating Systems - Process Synchronization and DeadlocksOperating Systems - Process Synchronization and Deadlocks
Operating Systems - Process Synchronization and Deadlocks
 
Naming in Distributed System
Naming in Distributed SystemNaming in Distributed System
Naming in Distributed System
 
Computer Networks Unit 1 Introduction and Physical Layer
Computer Networks Unit 1 Introduction and Physical Layer Computer Networks Unit 1 Introduction and Physical Layer
Computer Networks Unit 1 Introduction and Physical Layer
 
Unit 2 data link control
Unit 2 data link controlUnit 2 data link control
Unit 2 data link control
 
Clock synchronization in distributed system
Clock synchronization in distributed systemClock synchronization in distributed system
Clock synchronization in distributed system
 
Process synchronization
Process synchronizationProcess synchronization
Process synchronization
 
Message passing in Distributed Computing Systems
Message passing in Distributed Computing SystemsMessage passing in Distributed Computing Systems
Message passing in Distributed Computing Systems
 
Distributed deadlock
Distributed deadlockDistributed deadlock
Distributed deadlock
 
Parallel Database
Parallel DatabaseParallel Database
Parallel Database
 
Memory management
Memory managementMemory management
Memory management
 
Replication in Distributed Systems
Replication in Distributed SystemsReplication in Distributed Systems
Replication in Distributed Systems
 
Transaction management and concurrency control
Transaction management and concurrency controlTransaction management and concurrency control
Transaction management and concurrency control
 
8. mutual exclusion in Distributed Operating Systems
8. mutual exclusion in Distributed Operating Systems8. mutual exclusion in Distributed Operating Systems
8. mutual exclusion in Distributed Operating Systems
 

Similar to Distributed Database

Chapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptChapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).ppt
latigudata
 
Distributed databases
Distributed databasesDistributed databases
Distributed databases
Suneel Dogra
 
AdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptxAdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptx
XanGwaps
 

Similar to Distributed Database (20)

Distributed database
Distributed databaseDistributed database
Distributed database
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
DDBMS.pptx
DDBMS.pptxDDBMS.pptx
DDBMS.pptx
 
lecture-13.pptx
lecture-13.pptxlecture-13.pptx
lecture-13.pptx
 
nnnn.pptx
nnnn.pptxnnnn.pptx
nnnn.pptx
 
DBMS.pptx
DBMS.pptxDBMS.pptx
DBMS.pptx
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
 
Chapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptChapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).ppt
 
Distribution transparency and Distributed transaction
Distribution transparency and Distributed transactionDistribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
 
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptxSantosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
 
Distributed D B
Distributed  D BDistributed  D B
Distributed D B
 
Introduction to distributed database
Introduction to distributed databaseIntroduction to distributed database
Introduction to distributed database
 
Distributed databases
Distributed databasesDistributed databases
Distributed databases
 
AdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptxAdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptx
 
1 ddbms jan 2011_u
1 ddbms jan 2011_u1 ddbms jan 2011_u
1 ddbms jan 2011_u
 
Distributed dbms (ddbms)
Distributed dbms (ddbms)Distributed dbms (ddbms)
Distributed dbms (ddbms)
 
INJRV01I10005.pdf
INJRV01I10005.pdfINJRV01I10005.pdf
INJRV01I10005.pdf
 
Distributed Database Management System.pptx
Distributed Database Management System.pptxDistributed Database Management System.pptx
Distributed Database Management System.pptx
 
ditributed databases
ditributed databasesditributed databases
ditributed databases
 
Unit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptxUnit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptx
 

More from Amity University | FMS - DU | IMT | Stratford University | KKMI International Institute | AIMA | DTU

All About DBMS - Interview Question and Answers
All About DBMS - Interview Question and AnswersAll About DBMS - Interview Question and Answers

More from Amity University | FMS - DU | IMT | Stratford University | KKMI International Institute | AIMA | DTU (20)

All About DBMS - Interview Question and Answers
All About DBMS - Interview Question and AnswersAll About DBMS - Interview Question and Answers
All About DBMS - Interview Question and Answers
 
Concept of Governance - Management of Operational Risk for IT Officers/Execut...
Concept of Governance - Management of Operational Risk for IT Officers/Execut...Concept of Governance - Management of Operational Risk for IT Officers/Execut...
Concept of Governance - Management of Operational Risk for IT Officers/Execut...
 
Emerging Technologies in IT
Emerging Technologies in ITEmerging Technologies in IT
Emerging Technologies in IT
 
Introduction to DBMS - Notes in Layman...
Introduction to DBMS - Notes in Layman...Introduction to DBMS - Notes in Layman...
Introduction to DBMS - Notes in Layman...
 
Fundamentals of DBMS
Fundamentals of DBMSFundamentals of DBMS
Fundamentals of DBMS
 
CASE (Computer Aided Software Design)
CASE (Computer Aided Software Design)CASE (Computer Aided Software Design)
CASE (Computer Aided Software Design)
 
SOFTWARE RELIABILITY AND QUALITY ASSURANCE
SOFTWARE RELIABILITY AND QUALITY ASSURANCESOFTWARE RELIABILITY AND QUALITY ASSURANCE
SOFTWARE RELIABILITY AND QUALITY ASSURANCE
 
Software Testing (Contd..) SDLC Model
Software Testing (Contd..) SDLC ModelSoftware Testing (Contd..) SDLC Model
Software Testing (Contd..) SDLC Model
 
Software Testing - SDLC Model
Software Testing - SDLC ModelSoftware Testing - SDLC Model
Software Testing - SDLC Model
 
Coding - SDLC Model
Coding - SDLC ModelCoding - SDLC Model
Coding - SDLC Model
 
Software Design - SDLC Model
Software Design - SDLC ModelSoftware Design - SDLC Model
Software Design - SDLC Model
 
Models of SDLC (Contd..) & Feasibility Study
Models of SDLC (Contd..)  & Feasibility StudyModels of SDLC (Contd..)  & Feasibility Study
Models of SDLC (Contd..) & Feasibility Study
 
Models of SDLC (Software Development Life Cycle / Program Development Life Cy...
Models of SDLC (Software Development Life Cycle / Program Development Life Cy...Models of SDLC (Software Development Life Cycle / Program Development Life Cy...
Models of SDLC (Software Development Life Cycle / Program Development Life Cy...
 
Introduction to Software Engineering
Introduction to Software EngineeringIntroduction to Software Engineering
Introduction to Software Engineering
 
CLOUD SECURITY IN INSURANCE INDUSTRY WITH RESPECT TO INDIAN MARKET
CLOUD SECURITY IN INSURANCE INDUSTRY WITH RESPECT TO INDIAN MARKETCLOUD SECURITY IN INSURANCE INDUSTRY WITH RESPECT TO INDIAN MARKET
CLOUD SECURITY IN INSURANCE INDUSTRY WITH RESPECT TO INDIAN MARKET
 
Application Software
Application SoftwareApplication Software
Application Software
 
Application Software – Horizontal & Vertical Software
Application Software – Horizontal & Vertical SoftwareApplication Software – Horizontal & Vertical Software
Application Software – Horizontal & Vertical Software
 
Software: Systems and Application Software
Software:  Systems and Application SoftwareSoftware:  Systems and Application Software
Software: Systems and Application Software
 
Programming Languages
Programming LanguagesProgramming Languages
Programming Languages
 
Number Codes and Registers
Number Codes and RegistersNumber Codes and Registers
Number Codes and Registers
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Recently uploaded (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 

Distributed Database

  • 2. Database Logical interrelated collection of shared data, along with description of data, physically distributed over a computer network.
  • 3. What is Distributed Database? •A distributed database (DDB) is a collection of multiple, logically interrelated databases distributed over a computer network. • A distributed database management system (DDBMS) is the software that manages the DDB and provides an access mechanism that makes this distribution transparent to the users
  • 4.
  • 5. Characteristics All sites are interconnected. Fragments can be replicated. Logically related shared data can becollected. Data at each and every site is controlled by the DBMS. Each Distributed Database Management System takes part in at least one global application.
  • 6. Functionality 1. Security 2. Keeping track of data 3. Replicated data management 4. System catalog management 5. Distributed transaction management 6. Distributed database recovery
  • 7. A DDBMSmainly classified into two types: ◦1. Homogeneous Distributed database management systems ◦2. Heterogeneous Distributed database management systems
  • 8. Homogeneous DDBMS 1. In a homogeneous distributed database all sites have identical software and are aware of each other and agree to cooperate in processing user requests. 2. The homogeneous system is much easier to design and manage. 3. The operating system used, at each location must be same or compatible. 4. The database application (or DBMS) used at each location must be same or compatible.
  • 9. Heterogeneous DDBMS 1. In a heterogeneous distributed database different sites may use different schema and software. 2. In heterogeneous systems, different nodes may have different hardware & software and data structures at various nodes or locations are also incompatible. 3. Different computers and operating systems, database applications or data models may be used at each of the locations. 4. On heterogeneous system, translations are required to allow communication between different sites (or DBMS). 5. The heterogeneous system is often not technically or economically feasible. In this system, a user at one location may be able to read but not update the data at another location.
  • 10. Advantages 1. Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations. 2. Data are also distributed at multiple sites. 3. The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location. 4. Improved communications. Because local sites are smaller and located closer to customers. 5. Reduced operating costs. It is more cost- effective to add workstations to a network than to update a mainframe system. 6. Faster data access, faster data processing. 7. A distributed database system spreads out the systems workload by processing data at several sites.
  • 11. Disadvantages 1. Complexity of management and control. 2. Applications must recognize data location, and they must be able to stitch together data from various sites. 3. Security. 4. Increased storage and infrastructure requirements. 5. Multiple copies of data has to be at different sites, thus an additional disk storage space will be required. 6. The probability of security lapses increases when data are located at multiple sites.
  • 12. What is Parallel database...?? 1. A parallel database system is to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. 2. The distribution is solely done on the bases of performance. 3. Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in parallel. 4. Many operations are performed simultaneously 5. Data may be stored in a distributed fashion.
  • 13. Database and Parallel Database Characteristics ParallelDatabase Distributed database Definition It is a software system where multiple processors or machinesare used to execute and run queries in parallel. It is a software system that manages multiple logically interrelated databases distributed over a computer network. Geographical Location The nodes are locatedat geographically same location. The nodes are usually located at geographically different locations. Execution Speed Quicker Slower Overhead Less More
  • 14. Nodetypes Compulsorily Homogeneous Need not be homogeneous Performance Lower reliability & availability. Higher reliability & availability. Scopeof Expansion Difficult to expand Easier to expand Backup Backup at one site only Backup at multiple sites Consistency Maintaining consistency is easier Maintaining consistency is difficult.
  • 15. Data fragmentation 1. Fragmentation is a process of division or the mapping of the tables based on the columns and rows of data into the smallest unit of data. 2. Data that has broken down is still possible to be combined again with the intention to complete the data collection using fragmentation. 3. Fragmentation is a database server feature that allows you to control where data is stored at the table level. 4. Fragmentation enables you to define groups of rows or index keys within a table.
  • 16. Replication Replication is that we store severalcopies of a relation or relation fragment. An entire relation can be replicated at one or more sites. Similarly, one or more fragments of a relation can be replicated at othersites. For example, if a relation R is fragmented into R1,R2, and R3, there might be just one copy of R1, whereas R2 is replicated at two other sites and R3 is replicated at all sites.
  • 17. Two Fold Replication The motivation for replication is twofold: 1. Increased Availability of Data:Ifa site that contains a replica goes down, wecan find the same data at other sites. Similarly, if local copies of remote relations are available, we are less vulnerable to failure of communication links. 2. Faster Query Evaluation: Queries can execute faster by using a local copy of a relation instead of going to a remotesite.
  • 18. Distributed Transaction In a distributed DBMS, a given transaction is submitted at some one site, but it can access data at other sites as well. When a transaction is submitted at some site, the transaction manager at that site breaks it up into a collection of one or more sub-transactions that execute at different sites, submits them to transaction managers at the other sites, and coordinates their activity. Distributed Concurrency Control: How can locks for objects stored across several sites be managed? Distributed Recovery: Transaction atomicity must be ensured when a transaction commits, all its actions, across all the sites at which it executes, must persist. Similarly, when a transaction aborts, none of its actions must be allowed to persist.
  • 19. Distributed Concurrency Control • The choice of technique determines which objects are to be locked. When locks are obtained and released is determined by the concurrency control protocol. We now consider how lock and unlock requests are implemented in a distributed environment. Lock management can be distributed across sites in many ways:
  • 20. Centralized : A single site is in charge of handling lock and unlock requests for all objects. Primary Copy: One copy of each object is designated the primary copy. All requests to lock or unlock a copy of this object are handled by the lock manager at the site where the primary copy is stored, regardless of where the copy itself is stored. Fully Distributed : Requests to lock or unlock a copy of an object stored at a site are handled by the lock manager at the site where the copy is stored.
  • 21. DISTRIBUTED RECOVERY Recovery in a distributed DBMS is more complicated than in a centralized DBMS for the following reasons: ◦ New kinds of failure can arise : Failure of communication links and failure of a remote site at which a sub-transaction is executing. ◦ Either all sub-transactions of a given transaction must commit or none must commit, and this property must be guaranteed despite any combination of site and link failures. This guarantee is achieved using a commit protocol.
  • 22. Concepts Of Locks A lock is used when multiple users need to access a database concurrently. This prevents data from being corrupted or invalidated when multiple users try to write to the database. Any single user can only modify those database records (that is, items in the database) to which they have applied a lock that gives them exclusive access to the record until the lock is released. Locking not only provides exclusivity to write but also prevents (or controls) reading of unfinished modifications.