Distributed Database Management System(DDMS)

www.folio3.com
Distributed Databases
Name: Mobeen Ahmed
Designation: Lead Software Engineer

www.folio3.com
www.folio3.com
Agenda
• Introduction to distributed databases
• Distributed DBMS (DDBMS)
• Types of DDBMS
• Issues in Distributed Database Design
• Replication
• Types of Replication
• The Publisher/Subscriber Metaphor
• Publication Limitations
• Push Subscriptions
• Pull Subscriptions
• Hands On Lab

www.folio3.com
www.folio3.com
Why distributed databases?
• Some initial motivations:
– The development of computer networks promotes
decentralization.
– In a company, the database organization might reflect
the organizational structure, which is distributed into
units. Each unit maintains its own database.
• Sharing of data can be achieved by developing a
distributed database system which:
– makes data accessible by all units
– stores data close to where it is most frequently used

www.folio3.com
www.folio3.com
Distributed Database
• A logically interrelated collection of shared data
(and a description of this data), physically
distributed over a computer network.

www.folio3.com
www.folio3.com
Distributed DBMS (DDBMS)
• Software system that permits the management of
the distributed database and makes the
distribution transparent to users.

www.folio3.com
www.folio3.com
Advantages of DDBMSs
• Reflects Organizational Structure
• Improved Sharing and Local Autonomy
• Improved Availability
– A failure does not make the entire system inoperable
• Improved Reliability Data may be replicated
• Improved Performance
– Data are local to the site of “greatest demand”
• Economics
– Many small computers cost less than a big one!
• Modular Growth
– easy to add new modules

www.folio3.com
www.folio3.com
Disadvantages of DDBMSs
• Complexity
• Cost
– Especially in system management
• Security
– network must be made secure
• Integrity Control More Difficult
• Lack of Standards
• Lack of Experience

www.folio3.com
www.folio3.com
Types of DDBMS
• Homogeneous DDBMS
– All sites use same DBMS product (eg. Sql Server or
Oracle)
– Fairly easy to design and manage.
• Heterogeneous DDBMS
– Sites may run different DBMS products (eg. Oracle and
Ingress)
– Possibly different underlying data models (eg. relational
DB and OO database)

www.folio3.com
www.folio3.com
Issues in Distributed Database Design
Three key issues we have to consider:
•Data Allocation:
– where are data placed? Data should be stored at site
with "optimal" distribution.
•Fragmentation:
– relation may be divided into a number of sub-relations
(called fragments) , which are stored in different sites.
•Replication:
– copy of fragment may be maintained at several sites

www.folio3.com
www.folio3.com
Data Allocation
• Four strategies regarding placement of data:
– Centralized
• Consists of single database stored at one site with
users distributed across the network. (This is not a
DDB but distributed processing!!)
– Partitioned (or Fragmented)
• Database partitioned into disjoint fragments, each
fragment assigned to one site.
– Complete Replication
• Consists of maintaining complete copy of database at
each site
– Selective Replication
• Combination of partitioning, replication, and
centralization.

www.folio3.com
www.folio3.com
Fragmentation
• A relation R is divided into fragments r1, r2, …rn,
which contain enough information to allow
reconstruction of R

www.folio3.com
www.folio3.com
Replication
• Replication is the process of copying and
maintaining database objects in multiple
databases that make up a distributed database
system.
• Replication uses a publishing industry metaphor to
represent the components in a replication
topology, which include Publisher, Distributor,
Subscribers, publications, articles, and
subscriptions.
• Replication can improve the performance and
protect the availability of applications because
alternate data access options exist.

www.folio3.com
www.folio3.com
Magazine Metaphor
• A magazine publisher produces one or more
publications
• A publication contains articles
• The publisher either distributes the magazine
directly or uses a distributor
• Subscribers receive publications to which they
have subscribed

www.folio3.com
www.folio3.com
Types of Replication
• Merge replication
• Snapshot replication
• Snapshot replication with updating subscribers
• Transactional replication
• Transactional replication with updating
subscribers

www.folio3.com
www.folio3.com
Merge Replication

www.folio3.com
www.folio3.com
Snapshot Replication

www.folio3.com
www.folio3.com
Snapshot Replication with Updating Subscribers

www.folio3.com
www.folio3.com
Transactional Replication

www.folio3.com
www.folio3.com
Transactional Replication with Updating
Subscribers
• Changes written on subscriber can be moved to
publisher
• Guaranteed transactional consistency
• The change will then be converged with other
updating subscribers and then sent back out to all
the subscription databases
• Example: include low-volume reservation systems.
– Subscriber can look through a schedule of availability
and then attempt to make a reservation. After the
reservation has been scheduled, it can be replicated
within a few minutes to all the other subscription
databases.

www.folio3.com
www.folio3.com
The Publisher/Subscriber Metaphor
• The publisher is the owner of the source database
information. The publisher will make data
available for replication and will send changes to
the published data to the distributor.
• The subscriber database receives copies of the
data (snapshot replication) or transactions held in
the distribution database.
• The distributor receives all changes made to
published data. It then stores the data and
forwards it to subscribers at the appropriate time.
A single distribution server can support multiple
publishers and multiple subscribers at the same
time.

www.folio3.com
www.folio3.com
• Article An individual collection of replicated data
usually associated with a table. Creating an
article from a table allows the administrator to
filter out columns or rows that they want to
exclude from the replication scenario.

www.folio3.com
www.folio3.com

www.folio3.com
www.folio3.com
Publication Limitations
• Tables must have a primary key to ensure
integrity. (The exception is when you are using snapshot replication.)
• You cannot replicate the following databases:
– Master
– model
– msdb
– tempdb
– distribution databases
• Publications might not span multiple databases.
Each publication can contain articles from one
database only.
• IMAGE, TEXT, and NTEXT data have limited
support.

www.folio3.com
www.folio3.com
• When you set up a subscription at the same time
that you create your publications, you are
essentially setting up for a push subscription. This
helps to centralize subscription administration
because the subscription is defined at the
publisher along with the subscribers'
synchronization schedule. All the administration
of the subscription is handled from the publisher.
The data is "pushed" to the subscriber when the
publisher decides to send it.
Push Subscriptions

www.folio3.com
www.folio3.com
• A pull subscription is set up from each individual
subscriber. The subscribers initiate the transfer of
information on a timely basis. This is useful for
applications that can allow for a lower level of
security. The publisher can allow certain
subscribers to pull information, or the publisher
can allow anonymous subscriptions. Pull
subscriptions are also useful in situations in which
there might be a large number of subscribers.
Internet-based solutions are good candidates for
pull subscriptions.
Pull Subscriptions

www.folio3.com
www.folio3.com
What to Publish
• What am I going to publish?
• Do the subscribers receive all the data or just
subsets of my data?
• Should my data be partitioned by region values or
zip codes?
• Should I allow subscribers of my data to send me
updates?
• If I do allow updates, how should they be
implemented?
• Who can have access to my data?
• Are these users online or offline?

www.folio3.com
www.folio3.com
• Are they across the country and connected with
expensive phone lines?
• How often should I synchronize my data with the
subscribers?
• How often do they get changes sent to them?
What to Publish

www.folio3.com
www.folio3.com
Hands On Lab

www.folio3.com
www.folio3.com
End

Distributed Database Management System(DDMS)

More Related Content

What's hot

Viewers also liked

Similar to Distributed Database Management System(DDMS)

Recently uploaded

Distributed Database Management System(DDMS)

Editor's Notes