SlideShare a Scribd company logo
1 of 27
Distributed Database
 Database:- Logical interrelated
collection of shared data, along with
description of data, physically
distributed over a computer network.
• A distributed database (DDB) is a collection of
multiple, logically interrelated databases distributed
over a computer network.
• A distributed database management system (DDBMS)
is the software that manages the DDB and provides an
access mechanism that makes this distribution
transparent to the users
 A DDBMS mainly classified into two
types:
◦ Homogeneous Distributed database
management systems
◦ Heterogeneous Distributed database
management systems
 All sites are interconnected.
 Fragments can be replicated.
 Logically related shared data can be collected.
 Data at each and every site is controlled by the
DBMS.
 Each Distributed Database Management System
takes part in at least one global application.
 Security
 Keeping track of data
 Replicated data management
 System catalog management
 Distributed transaction management
 Distributed database recovery
 In a homogeneous distributed database all sites
have identical software and are aware of each other
and agree to cooperate in processing user requests.
 The homogeneous system is much easier to design
and manage
 The operating system used, at each location must be
same or compatible.
 The database application (or DBMS) used at each
location must be same or compatible.
 In a heterogeneous distributed database different
sites may use different schema and software.
 In heterogeneous systems, different nodes may
have different hardware & software and data
structures at various nodes or locations are also
incompatible.
 Different computers and operating systems,
database applications or data models may be used
at each of the locations.
 On heterogeneous system, translations are required
to allow communication between different sites (or
DBMS).
 The heterogeneous system is often not technically
or economically feasible. In this system, a user at
one location may be able to read but not update the
data at another location.
 Less danger of a single-point failure. When one of
the computers fails, the workload is picked up by
other workstations.
 Data are also distributed at multiple sites.
 The end user is able to access any available copy of
the data, and an end user's request is processed by
any processor at the data location.
 Improved communications. Because local sites are
smaller and located closer to customers.
 Reduced operating costs. It is more cost-effective to
add workstations to a network than to update a
mainframe system.
 Faster data access, faster data processing.
 A distributed database system spreads out the
systems workload by processing data at several
sites.
 Complexity of management and control.
 Applications must recognize data location, and they
must be able to stitch together data from various
sites.
 Security.
 Increased storage and infrastructure requirements.
 Multiple copies of data has to be at different sites,
thus an additional disk storage space will be
required.
 The probability of security lapses increases when
data are located at multiple sites.
 A parallel database system is to improve
performance through parallelization of various
operations, such as loading data, building
indexes and evaluating queries.
 The distribution is solely done on the bases of
performance.
 Parallel databases improve processing and
input/output speeds by using multiple CPUs
and disks in parallel.
 Many operations are performed
simultaneously
 Data may be stored in a distributed fashion.
Characteristics Parallel Database Distributed database
Definition It is a software system
where multiple
processors or machines are
used to
execute and run queries in
parallel.
It is a software system that
manages multiple logically
interrelated databases
distributed over a computer
network.
Geographical
Location
The nodes are located at
geographically
same location.
The nodes are usually
located at geographically
different locations.
Execution
Speed
Quicker Slower
Overhead Less More
Node types Compulsorily Homogeneous Need not be homogeneous
Performance Lower reliability &
availability.
Higher reliability &
availability.
Scope of
Expansion
Difficult to expand Easier to expand
Backup Backup at one site only Backup at multiple sites
Consistency Maintaining consistency is
easier
Maintaining consistency is
difficult.
 Fragmentation is a process of division or the
mapping of the tables based on the columns
and rows of data into the smallest unit of data.
 Data that has broken down is still possible to
be combined again with the intention to
complete the data collection using
fragmentation.
 Fragmentation is a database server feature that
allows you to control where data is stored at
the table level.
 Fragmentation enables you to define groups of
rows or index keys within a table.
 Replication is that we store several copies of a
relation or relation fragment. An entire relation
can be replicated at one or more sites.
 Similarly, one or more fragments of a relation
can be replicated at other sites.
 For example, if a relation R is fragmented into
R1,R2, and R3, there might be just one copy of
R1, whereas R2 is replicated at two other sites
and R3 is replicated at all sites.
The motivation for replication is twofold:
1. Increased Availability of Data: If a site that contains
a replica goes down, we can find the same data at
other sites. Similarly, if local copies of remote
relations are available, we are less vulnerable to
failure of communication links.
2. Faster Query Evaluation: Queries can execute faster
by using a local copy of a relation instead of going
to a remote site.
 In a distributed DBMS, a given transaction is
submitted at some one site, but it can access data at
other sites as well.
 When a transaction is submitted at some site, the
transaction manager at that site breaks it up into a
collection of one or more sub-transactions that
execute at different sites, submits them to
transaction managers at the other sites, and
coordinates their activity.
 Distributed Concurrency Control: How can
locks for objects stored across several sites be
managed?
 Distributed Recovery: Transaction atomicity
must be ensured when a transaction commits,
all its actions, across all the sites at which it
executes, must persist. Similarly, when a
transaction aborts, none of its actions must be
allowed to persist.
 The choice of technique determines which objects
are to be locked. When locks are obtained and
released is determined by the concurrency control
protocol. We now consider how lock and unlock
requests are implemented in a distributed
environment. Lock management can be distributed
across sites in many ways:
 Centralized : A single site is in charge of handling
lock and unlock requests for all objects.
 Primary Copy: One copy of each object is designated
the primary copy. All requests to lock or unlock a
copy of this object are handled by the lock manager
at the site where the primary copy is stored,
regardless of where the copy itself is stored.
 Fully Distributed : Requests to lock or unlock a copy
of an object stored at a site are handled by the lock
manager at the site where the copy is stored.
 Recovery in a distributed DBMS is more
complicated than in a centralized DBMS for the
following reasons:
◦ New kinds of failure can arise : Failure of
communication links and failure of a remote site at
which a sub-transaction is executing.
◦ Either all sub-transactions of a given transaction
must commit or none must commit, and this property
must be guaranteed despite any combination of site
and link failures. This guarantee is achieved using a
commit protocol.
 A lock is used when multiple users need to
access a database concurrently. This prevents
data from being corrupted or invalidated when
multiple users try to write to the database.
 Any single user can only modify those database
records (that is, items in the database) to which
they have applied a lock that gives them exclusive
access to the record until the lock is released.
Locking not only provides exclusivity to write but
also prevents (or controls) reading of unfinished
modifications.
Distributed database

More Related Content

What's hot (20)

Distributed database
Distributed databaseDistributed database
Distributed database
 
Concurrency Control in Database Management System
Concurrency Control in Database Management SystemConcurrency Control in Database Management System
Concurrency Control in Database Management System
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Homogeneous ddbms
Homogeneous ddbmsHomogeneous ddbms
Homogeneous ddbms
 
Transaction processing ppt
Transaction processing pptTransaction processing ppt
Transaction processing ppt
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Distributed databases and dbm ss
Distributed databases and dbm ssDistributed databases and dbm ss
Distributed databases and dbm ss
 
2 phase locking protocol DBMS
2 phase locking protocol DBMS2 phase locking protocol DBMS
2 phase locking protocol DBMS
 
Distributed Database System
Distributed Database SystemDistributed Database System
Distributed Database System
 
Introduction to distributed database
Introduction to distributed databaseIntroduction to distributed database
Introduction to distributed database
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
File organization 1
File organization 1File organization 1
File organization 1
 
Database , 12 Reliability
Database , 12 ReliabilityDatabase , 12 Reliability
Database , 12 Reliability
 
Distributed Database
Distributed DatabaseDistributed Database
Distributed Database
 
Database Management System Introduction
Database Management System IntroductionDatabase Management System Introduction
Database Management System Introduction
 
Distributed data processing
Distributed data processingDistributed data processing
Distributed data processing
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
 
Distribution transparency and Distributed transaction
Distribution transparency and Distributed transactionDistribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systems
 
Data cubes
Data cubesData cubes
Data cubes
 

Similar to Distributed database

Distributed database
Distributed databaseDistributed database
Distributed databasesanjay joshi
 
lecture-13.pptx
lecture-13.pptxlecture-13.pptx
lecture-13.pptxlaiba29012
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management systemPooja Dixit
 
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptxSantosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptxSANTOSH KUMAR MEHER
 
Chapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptChapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptlatigudata
 
Distributed databases
Distributed databasesDistributed databases
Distributed databasesSuneel Dogra
 
Unit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptxUnit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptxAnkitChauhan817826
 
AdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptxAdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptxXanGwaps
 
DBMS - Distributed Databases
DBMS - Distributed DatabasesDBMS - Distributed Databases
DBMS - Distributed DatabasesMythiliMurugan3
 
DFS PPT.pptx
DFS PPT.pptxDFS PPT.pptx
DFS PPT.pptxVMahesh5
 
Lec 8 (distributed database)
Lec 8 (distributed database)Lec 8 (distributed database)
Lec 8 (distributed database)Sudarshan Mondal
 

Similar to Distributed database (20)

Distributed database
Distributed databaseDistributed database
Distributed database
 
DDBMS.pptx
DDBMS.pptxDDBMS.pptx
DDBMS.pptx
 
lecture-13.pptx
lecture-13.pptxlecture-13.pptx
lecture-13.pptx
 
nnnn.pptx
nnnn.pptxnnnn.pptx
nnnn.pptx
 
DBMS.pptx
DBMS.pptxDBMS.pptx
DBMS.pptx
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
 
Distributed D B
Distributed  D BDistributed  D B
Distributed D B
 
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptxSantosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
 
Chapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptChapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).ppt
 
1 ddbms jan 2011_u
1 ddbms jan 2011_u1 ddbms jan 2011_u
1 ddbms jan 2011_u
 
Distributed databases
Distributed databasesDistributed databases
Distributed databases
 
Distributed dbms (ddbms)
Distributed dbms (ddbms)Distributed dbms (ddbms)
Distributed dbms (ddbms)
 
Advance DBMS
Advance DBMSAdvance DBMS
Advance DBMS
 
Unit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptxUnit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptx
 
Dos unit3
Dos unit3Dos unit3
Dos unit3
 
AdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptxAdvanceDatabaseChapter6Advance Dtabases.pptx
AdvanceDatabaseChapter6Advance Dtabases.pptx
 
DBMS - Distributed Databases
DBMS - Distributed DatabasesDBMS - Distributed Databases
DBMS - Distributed Databases
 
DFS PPT.pptx
DFS PPT.pptxDFS PPT.pptx
DFS PPT.pptx
 
INJRV01I10005.pdf
INJRV01I10005.pdfINJRV01I10005.pdf
INJRV01I10005.pdf
 
Lec 8 (distributed database)
Lec 8 (distributed database)Lec 8 (distributed database)
Lec 8 (distributed database)
 

More from sanjay joshi

Array in c language
Array in c languageArray in c language
Array in c languagesanjay joshi
 
Introduction to c programming language
Introduction to c programming languageIntroduction to c programming language
Introduction to c programming languagesanjay joshi
 
Static and dynamic polymorphism
Static and dynamic polymorphismStatic and dynamic polymorphism
Static and dynamic polymorphismsanjay joshi
 
introduction to c programming language
introduction to c programming languageintroduction to c programming language
introduction to c programming languagesanjay joshi
 
Cascading Style Sheets
Cascading Style SheetsCascading Style Sheets
Cascading Style Sheetssanjay joshi
 
Data Structure And Queue
Data Structure And Queue Data Structure And Queue
Data Structure And Queue sanjay joshi
 
Introduction to java
Introduction to javaIntroduction to java
Introduction to javasanjay joshi
 
Static and dynamic polymorphism
Static and dynamic polymorphismStatic and dynamic polymorphism
Static and dynamic polymorphismsanjay joshi
 

More from sanjay joshi (20)

Ccna security
Ccna security Ccna security
Ccna security
 
Array in c language
Array in c languageArray in c language
Array in c language
 
Introduction to c programming language
Introduction to c programming languageIntroduction to c programming language
Introduction to c programming language
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Static and dynamic polymorphism
Static and dynamic polymorphismStatic and dynamic polymorphism
Static and dynamic polymorphism
 
Embeded system
Embeded systemEmbeded system
Embeded system
 
Vb and asp.net
Vb and asp.netVb and asp.net
Vb and asp.net
 
Angular js
Angular jsAngular js
Angular js
 
introduction to c programming language
introduction to c programming languageintroduction to c programming language
introduction to c programming language
 
Oops in php
Oops in phpOops in php
Oops in php
 
Cascading Style Sheets
Cascading Style SheetsCascading Style Sheets
Cascading Style Sheets
 
Css3 responsive
Css3 responsive Css3 responsive
Css3 responsive
 
Html ppt
Html pptHtml ppt
Html ppt
 
Java script
Java scriptJava script
Java script
 
Data Structure And Queue
Data Structure And Queue Data Structure And Queue
Data Structure And Queue
 
Introduction to java
Introduction to javaIntroduction to java
Introduction to java
 
Static and dynamic polymorphism
Static and dynamic polymorphismStatic and dynamic polymorphism
Static and dynamic polymorphism
 
Angularjs
AngularjsAngularjs
Angularjs
 
Visual basic
Visual basicVisual basic
Visual basic
 
Embeded system
Embeded systemEmbeded system
Embeded system
 

Recently uploaded

Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayMakMakNepo
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........LeaCamillePacle
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 

Recently uploaded (20)

Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up Friday
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 

Distributed database

  • 1.
  • 2. Distributed Database  Database:- Logical interrelated collection of shared data, along with description of data, physically distributed over a computer network.
  • 3. • A distributed database (DDB) is a collection of multiple, logically interrelated databases distributed over a computer network. • A distributed database management system (DDBMS) is the software that manages the DDB and provides an access mechanism that makes this distribution transparent to the users
  • 4.
  • 5.  A DDBMS mainly classified into two types: ◦ Homogeneous Distributed database management systems ◦ Heterogeneous Distributed database management systems
  • 6.  All sites are interconnected.  Fragments can be replicated.  Logically related shared data can be collected.  Data at each and every site is controlled by the DBMS.  Each Distributed Database Management System takes part in at least one global application.
  • 7.  Security  Keeping track of data  Replicated data management  System catalog management  Distributed transaction management  Distributed database recovery
  • 8.  In a homogeneous distributed database all sites have identical software and are aware of each other and agree to cooperate in processing user requests.  The homogeneous system is much easier to design and manage  The operating system used, at each location must be same or compatible.  The database application (or DBMS) used at each location must be same or compatible.
  • 9.  In a heterogeneous distributed database different sites may use different schema and software.  In heterogeneous systems, different nodes may have different hardware & software and data structures at various nodes or locations are also incompatible.  Different computers and operating systems, database applications or data models may be used at each of the locations.
  • 10.  On heterogeneous system, translations are required to allow communication between different sites (or DBMS).  The heterogeneous system is often not technically or economically feasible. In this system, a user at one location may be able to read but not update the data at another location.
  • 11.  Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations.  Data are also distributed at multiple sites.  The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location.
  • 12.  Improved communications. Because local sites are smaller and located closer to customers.  Reduced operating costs. It is more cost-effective to add workstations to a network than to update a mainframe system.  Faster data access, faster data processing.  A distributed database system spreads out the systems workload by processing data at several sites.
  • 13.  Complexity of management and control.  Applications must recognize data location, and they must be able to stitch together data from various sites.  Security.
  • 14.  Increased storage and infrastructure requirements.  Multiple copies of data has to be at different sites, thus an additional disk storage space will be required.  The probability of security lapses increases when data are located at multiple sites.
  • 15.  A parallel database system is to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries.  The distribution is solely done on the bases of performance.  Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in parallel.  Many operations are performed simultaneously  Data may be stored in a distributed fashion.
  • 16. Characteristics Parallel Database Distributed database Definition It is a software system where multiple processors or machines are used to execute and run queries in parallel. It is a software system that manages multiple logically interrelated databases distributed over a computer network. Geographical Location The nodes are located at geographically same location. The nodes are usually located at geographically different locations. Execution Speed Quicker Slower Overhead Less More
  • 17. Node types Compulsorily Homogeneous Need not be homogeneous Performance Lower reliability & availability. Higher reliability & availability. Scope of Expansion Difficult to expand Easier to expand Backup Backup at one site only Backup at multiple sites Consistency Maintaining consistency is easier Maintaining consistency is difficult.
  • 18.  Fragmentation is a process of division or the mapping of the tables based on the columns and rows of data into the smallest unit of data.  Data that has broken down is still possible to be combined again with the intention to complete the data collection using fragmentation.  Fragmentation is a database server feature that allows you to control where data is stored at the table level.  Fragmentation enables you to define groups of rows or index keys within a table.
  • 19.  Replication is that we store several copies of a relation or relation fragment. An entire relation can be replicated at one or more sites.  Similarly, one or more fragments of a relation can be replicated at other sites.  For example, if a relation R is fragmented into R1,R2, and R3, there might be just one copy of R1, whereas R2 is replicated at two other sites and R3 is replicated at all sites.
  • 20. The motivation for replication is twofold: 1. Increased Availability of Data: If a site that contains a replica goes down, we can find the same data at other sites. Similarly, if local copies of remote relations are available, we are less vulnerable to failure of communication links. 2. Faster Query Evaluation: Queries can execute faster by using a local copy of a relation instead of going to a remote site.
  • 21.  In a distributed DBMS, a given transaction is submitted at some one site, but it can access data at other sites as well.  When a transaction is submitted at some site, the transaction manager at that site breaks it up into a collection of one or more sub-transactions that execute at different sites, submits them to transaction managers at the other sites, and coordinates their activity.
  • 22.  Distributed Concurrency Control: How can locks for objects stored across several sites be managed?  Distributed Recovery: Transaction atomicity must be ensured when a transaction commits, all its actions, across all the sites at which it executes, must persist. Similarly, when a transaction aborts, none of its actions must be allowed to persist.
  • 23.  The choice of technique determines which objects are to be locked. When locks are obtained and released is determined by the concurrency control protocol. We now consider how lock and unlock requests are implemented in a distributed environment. Lock management can be distributed across sites in many ways:
  • 24.  Centralized : A single site is in charge of handling lock and unlock requests for all objects.  Primary Copy: One copy of each object is designated the primary copy. All requests to lock or unlock a copy of this object are handled by the lock manager at the site where the primary copy is stored, regardless of where the copy itself is stored.  Fully Distributed : Requests to lock or unlock a copy of an object stored at a site are handled by the lock manager at the site where the copy is stored.
  • 25.  Recovery in a distributed DBMS is more complicated than in a centralized DBMS for the following reasons: ◦ New kinds of failure can arise : Failure of communication links and failure of a remote site at which a sub-transaction is executing. ◦ Either all sub-transactions of a given transaction must commit or none must commit, and this property must be guaranteed despite any combination of site and link failures. This guarantee is achieved using a commit protocol.
  • 26.  A lock is used when multiple users need to access a database concurrently. This prevents data from being corrupted or invalidated when multiple users try to write to the database.  Any single user can only modify those database records (that is, items in the database) to which they have applied a lock that gives them exclusive access to the record until the lock is released. Locking not only provides exclusivity to write but also prevents (or controls) reading of unfinished modifications.