12/16/20141
I’m:
Iftikhar Ahmad
Roll No: 13
From:
M-C-S (3rd sem)
AWKUM (shankar)2
Presentation Topic
īąDistributed Database & Issues + Challenges
īƒŧ Homogeneous Database
īƒŧ Heterogeneous Database
12/16/20143
Introduction:
A “Distributed Database”: is a logically interrelated collection
of shared data (and a description of this data), physically
distributed over a computer network.
A “Distributed DBMS” (DDBMS): is a Software system that
permits the management of the distributed database and
makes the distribution transparent to users.
Distributed Database
12/16/20144
īļ DDBMS has following characteristics:
o Collection of logically-related shared data.
o Data split into fragments.
o Sites linked by a communication network.
o Data at each site is under control of a DBMS.
o Each DBMS participates in at least one global application.
o Data sharing
o Data communication reliability and costs
o Database recovery
o Transaction and analytic processing
12/16/20145
Characteristics DDB
12/16/20146
Important difference between DDBMS and
distributed processing !
DDBMS Distributed processing of
centralised DBMS
12/16/20147
Why use a DDBMS? (!)
Advantages:
â€ĸ Reflects organizational structure.
â€ĸ Improved share ability.
â€ĸ Improved availability.
â€ĸ Improved reliability.
â€ĸ Improved performance.
â€ĸ Economics.
â€ĸ Modular growth.
Disadvantages:
â€ĸ Complexity.
â€ĸ Cost.
â€ĸ Security.
â€ĸ Integrity control more difficult.
â€ĸ Lack of standards.
â€ĸ Lack of experience.
â€ĸ Database design more complex.
12/16/20148
Types of Distributed Database System
DDBMS
Homogenous Heterogeneous
īļ Homogeneous
īļ Heterogeneous
The prefix “homo” – indicate sameness
ī€Ē All sites use same DBMS product.
â€ĸ
īƒŧIn this type of database has all data center have same
software.
īƒŧMuch easier to design and manage.
īƒŧIt appears to user as a single system.
īƒŧAllows increased performance.
īƒŧMuch easier to design and manage.
12/16/20149
Homogeneous DB
12/16/201410
Same software
Homogeneous DB
ī€Ē Advantages of Homogeneous DB:
īƒŧ Easy to use
īƒŧ Easy to mange
īƒŧ Easy to Design
ī€Ē Disadvantages of Homogeneous DB:
īƒŧ Difficult for most organizations to force a homogeneous environment.
12/16/201411
The prefix “hetero” – indicate difference
ī€Ē Sites may run different DBMS products, underlying data models.
ī€Ē
īƒŧ In this type of database , Different data center may run different
DBMS products.
īƒŧ Occurs when sites have implemented their own databases and
integration is considered later.
â€ĸ Translations required to allow for:
â€ĸ Different hardware.
â€ĸ Different DBMS products.
â€ĸ Different hardware and DBMS products.
12/16/201412
Heterogeneous DB
12/16/201413
Sql oracle
Heterogeneous DB
ī€Ē Advantages of Heterogeneous DB:
īƒŧ Huge data can be stored in one Global center from different data
center.
īƒŧ Remote access is done using the global schema.
īƒŧ Different DBMSs may be used at each node.
ī€Ē Disadvantages of Heterogeneous DB:
īƒŧ Difficult to mange.
īƒŧ Difficult to design. 12/16/201414
īą Problems & their solutions:
The challenges encountered in distributed
database transaction and proffered likely solutions. Few
of the mentioned challenges are; problems of
distribution of resources, search and updating of
resources.
12/16/201415
Problems of Distributed Database
(Transactions)
īƒŧ Distribution of resources problem:
This has to do with equal distribution of resources across
all servers. There are two approaches to solving this problem in a
distributed database environment; decentralize by function, or
decentralize by location.
(a) Decentralize by function: This works well with data that will be
accessed repeatedly by the same users. Example includes putting
manufacturing materials list at the appropriate manufacturing plants
and customer information at sales locations.
(b) Decentralize by location: Occurs when you partition customer
information on a node per region or other geographical based entity
within an organisation as seen in figure.
12/16/201416
12/16/201417
Each telephone company independently implements the
common protocols by the international phone network,
e.g. for dialling and billing as shown in figure 2, the
dialling and billing of customer vary from one country to
another. The only centralize function is the architecture
of these protocols. It is worthy to note that design and
architecture are typically centralize within each
company; while operation and control are delegated to
the operating countries which in turn delegate operation
and maintenance to individuals responsible for
maintaining their own hardware and software.
12/16/201418
īƒŧ SEARCHING:
Another notable problem of distributed database results
from lack of adequate knowledge of the entire database. For example, locating
where file x, y, z was stored. The solution to the challenge is using the concepts of
global & local data.
i. Global data involves information that is common to all sites and shared by all sites.
ii. Local data is information that is strictly meant for an individual site although it is
accessible to all sites.
Information is made available to the whole network by partitioning or replicating
the data files. Partitioning involves splitting a data file into records and then
distributing the records across networks while replication means duplicating
records at more than one location/node. Partitioning only works with local data
while global data are replicated.
Example
i. A typical example is found in bank transactions where servers are located in local
vicinity where customers can transact which often increases the speed of
transaction when traffics are eradicated.
12/16/201419
Advantages of partitioning a network:
i. It makes data available to all remote users
ii. It reduces traffics and increase speed of access
Advantages of replicating a network:
i. Replication improves data availability
ii. Replication improves response time.
12/16/201420
īƒŧ UPDATING:
As discussed above, partitioning of
data is most efficient when the data are kept current that is,
updated regularly. The single copy of each data item makes
updating an efficient process. However. nonlocal “read”
operations are more expensive, making partitioning less
efficient for data that is widely used but updated infrequently.
Replicated data is most efficient when multiple reads of the
data are expected, but updates are not as regular compared to
partitioning process. The data is duplicated at nodes where
high- volume reads are expected, producing high availability and
good response time. When replicated data must be updated,
however, an update to a record at one node should cause an
identical update at all other nodes where that record resides. If
any one replica is unavailable there could be problems.
12/16/201421
A variety of schemes can be employed for updating Replicated
data, even though the copy of the record may be temporarily
unavailable at one or more of the nodes. One technique requires
that a majority of the replicas be read and updated as part of each
transaction, though the definition of “majority” varies with the
application. This scheme has the advantage of tolerating some
nodal unavailability, but it is not practical for either very small or
very large networks. In a very small network of, say, two nodes,
having either node unavailable prevents an update of a majority of
the nodes. In larger networks, delays in completing the update
transaction are proportional to network size: As the network
grows, transactions will take longer to complete.
12/16/201422
īą Future work:
The studies on researching new update
strategies include the experimental distributed database
management system extension by the implementation of
the chosen update methods in the EDDBMS, work out of
the replicated data update method in the VSM
environment and its implementation. The next step will
concentrate on setting the assumptions on the use of this
algorithm in the other distributed programming
environments and on consideration of the possibility of
applying the system for building a data warehouse.
12/16/201423
The presentation has treated the proposed challenges
of distributed database system & basics. However, new
challenges show up daily in transactions. Therefore
there may be need for constant reviewers of these
challenges.
12/16/201424
CONCLUSION
Iftkr
iffi910
12/16/201425

Database 2 ddbms,homogeneous & heterognus adv & disadvan

  • 1.
  • 2.
    I’m: Iftikhar Ahmad Roll No:13 From: M-C-S (3rd sem) AWKUM (shankar)2
  • 3.
    Presentation Topic īąDistributed Database& Issues + Challenges īƒŧ Homogeneous Database īƒŧ Heterogeneous Database 12/16/20143
  • 4.
    Introduction: A “Distributed Database”:is a logically interrelated collection of shared data (and a description of this data), physically distributed over a computer network. A “Distributed DBMS” (DDBMS): is a Software system that permits the management of the distributed database and makes the distribution transparent to users. Distributed Database 12/16/20144
  • 5.
    īļ DDBMS hasfollowing characteristics: o Collection of logically-related shared data. o Data split into fragments. o Sites linked by a communication network. o Data at each site is under control of a DBMS. o Each DBMS participates in at least one global application. o Data sharing o Data communication reliability and costs o Database recovery o Transaction and analytic processing 12/16/20145 Characteristics DDB
  • 6.
    12/16/20146 Important difference betweenDDBMS and distributed processing ! DDBMS Distributed processing of centralised DBMS
  • 7.
    12/16/20147 Why use aDDBMS? (!) Advantages: â€ĸ Reflects organizational structure. â€ĸ Improved share ability. â€ĸ Improved availability. â€ĸ Improved reliability. â€ĸ Improved performance. â€ĸ Economics. â€ĸ Modular growth. Disadvantages: â€ĸ Complexity. â€ĸ Cost. â€ĸ Security. â€ĸ Integrity control more difficult. â€ĸ Lack of standards. â€ĸ Lack of experience. â€ĸ Database design more complex.
  • 8.
    12/16/20148 Types of DistributedDatabase System DDBMS Homogenous Heterogeneous īļ Homogeneous īļ Heterogeneous
  • 9.
    The prefix “homo”– indicate sameness ī€Ē All sites use same DBMS product. â€ĸ īƒŧIn this type of database has all data center have same software. īƒŧMuch easier to design and manage. īƒŧIt appears to user as a single system. īƒŧAllows increased performance. īƒŧMuch easier to design and manage. 12/16/20149 Homogeneous DB
  • 10.
  • 11.
    ī€Ē Advantages ofHomogeneous DB: īƒŧ Easy to use īƒŧ Easy to mange īƒŧ Easy to Design ī€Ē Disadvantages of Homogeneous DB: īƒŧ Difficult for most organizations to force a homogeneous environment. 12/16/201411
  • 12.
    The prefix “hetero”– indicate difference ī€Ē Sites may run different DBMS products, underlying data models. ī€Ē īƒŧ In this type of database , Different data center may run different DBMS products. īƒŧ Occurs when sites have implemented their own databases and integration is considered later. â€ĸ Translations required to allow for: â€ĸ Different hardware. â€ĸ Different DBMS products. â€ĸ Different hardware and DBMS products. 12/16/201412 Heterogeneous DB
  • 13.
  • 14.
    ī€Ē Advantages ofHeterogeneous DB: īƒŧ Huge data can be stored in one Global center from different data center. īƒŧ Remote access is done using the global schema. īƒŧ Different DBMSs may be used at each node. ī€Ē Disadvantages of Heterogeneous DB: īƒŧ Difficult to mange. īƒŧ Difficult to design. 12/16/201414
  • 15.
    īą Problems &their solutions: The challenges encountered in distributed database transaction and proffered likely solutions. Few of the mentioned challenges are; problems of distribution of resources, search and updating of resources. 12/16/201415 Problems of Distributed Database (Transactions)
  • 16.
    īƒŧ Distribution ofresources problem: This has to do with equal distribution of resources across all servers. There are two approaches to solving this problem in a distributed database environment; decentralize by function, or decentralize by location. (a) Decentralize by function: This works well with data that will be accessed repeatedly by the same users. Example includes putting manufacturing materials list at the appropriate manufacturing plants and customer information at sales locations. (b) Decentralize by location: Occurs when you partition customer information on a node per region or other geographical based entity within an organisation as seen in figure. 12/16/201416
  • 17.
  • 18.
    Each telephone companyindependently implements the common protocols by the international phone network, e.g. for dialling and billing as shown in figure 2, the dialling and billing of customer vary from one country to another. The only centralize function is the architecture of these protocols. It is worthy to note that design and architecture are typically centralize within each company; while operation and control are delegated to the operating countries which in turn delegate operation and maintenance to individuals responsible for maintaining their own hardware and software. 12/16/201418
  • 19.
    īƒŧ SEARCHING: Another notableproblem of distributed database results from lack of adequate knowledge of the entire database. For example, locating where file x, y, z was stored. The solution to the challenge is using the concepts of global & local data. i. Global data involves information that is common to all sites and shared by all sites. ii. Local data is information that is strictly meant for an individual site although it is accessible to all sites. Information is made available to the whole network by partitioning or replicating the data files. Partitioning involves splitting a data file into records and then distributing the records across networks while replication means duplicating records at more than one location/node. Partitioning only works with local data while global data are replicated. Example i. A typical example is found in bank transactions where servers are located in local vicinity where customers can transact which often increases the speed of transaction when traffics are eradicated. 12/16/201419
  • 20.
    Advantages of partitioninga network: i. It makes data available to all remote users ii. It reduces traffics and increase speed of access Advantages of replicating a network: i. Replication improves data availability ii. Replication improves response time. 12/16/201420
  • 21.
    īƒŧ UPDATING: As discussedabove, partitioning of data is most efficient when the data are kept current that is, updated regularly. The single copy of each data item makes updating an efficient process. However. nonlocal “read” operations are more expensive, making partitioning less efficient for data that is widely used but updated infrequently. Replicated data is most efficient when multiple reads of the data are expected, but updates are not as regular compared to partitioning process. The data is duplicated at nodes where high- volume reads are expected, producing high availability and good response time. When replicated data must be updated, however, an update to a record at one node should cause an identical update at all other nodes where that record resides. If any one replica is unavailable there could be problems. 12/16/201421
  • 22.
    A variety ofschemes can be employed for updating Replicated data, even though the copy of the record may be temporarily unavailable at one or more of the nodes. One technique requires that a majority of the replicas be read and updated as part of each transaction, though the definition of “majority” varies with the application. This scheme has the advantage of tolerating some nodal unavailability, but it is not practical for either very small or very large networks. In a very small network of, say, two nodes, having either node unavailable prevents an update of a majority of the nodes. In larger networks, delays in completing the update transaction are proportional to network size: As the network grows, transactions will take longer to complete. 12/16/201422
  • 23.
    īą Future work: Thestudies on researching new update strategies include the experimental distributed database management system extension by the implementation of the chosen update methods in the EDDBMS, work out of the replicated data update method in the VSM environment and its implementation. The next step will concentrate on setting the assumptions on the use of this algorithm in the other distributed programming environments and on consideration of the possibility of applying the system for building a data warehouse. 12/16/201423
  • 24.
    The presentation hastreated the proposed challenges of distributed database system & basics. However, new challenges show up daily in transactions. Therefore there may be need for constant reviewers of these challenges. 12/16/201424 CONCLUSION
  • 25.