Chapter - 7 Distributed Database System
Chapter - 7 Distributed Database System
1
Outline
1 Distributed Database Concepts
4 Query Processing
Site 5
Site 1
Site 3 Site 2
Con…
− Network transparency: Users do not have to worry about operational details of
the network.
− Location transparency: refers to freedom of issuing command from any location
without affecting its working.
− Naming transparency: allows access to any named object (files, relations, etc.)
from any location.
− Replication transparency: Allows to store copies of a data at multiple sites.
relation).
Advantages DDS
2. Increase reliability and availability:
− Reliability refers to system live time, that is, system is running efficiently most
of the time.
− Availability is the probability that the system is continuously available (usable
or accessible) during a time interval.
− A distributed database system has multiple nodes (computers) and if one fails
then others are available to do the job.
3. Improved performance:
− A DDBMS fragments the database to keep data closer to where it is needed
most.
− This reduces data management (access and modification) time significantly.
4. Easier expansion (scalability):
− Allows new nodes (computers) to be added anytime without chaining the entire
configuration.
Disadvantages DDS
– Complexity
– Cost
– Security
– Lack of standards
– Lack of experience
Network
Object DBMS
Oriented Site 3 Site 2 Relational
Linux Linux
Heterogeneous
Advantages
Huge data can be stored in one Global center from different data center
Remote access is done using the global schema.
Different DBMSs may be used at each node
Disadvantages
Difficult to mange
Difficult to design.
.
Federated Database Management Systems
− Client/Server architecture.
– Fully distributed DBMS with support for multiple DPs and
TPs at multiple sites.
– Homogeneous DDMS
Integrate only one type of centralized DBMS over the network.
– Heterogeneous DDBMS
Integrate different types of centralized DBMSs over a network.
Distributed DB Transparency
– Distribution transparency
– Transaction transparency
– Failure transparency
– Performance transparency
– Heterogeneity transparency
Distribution Transparency
• Distribution transparency allows us to manage a physically
dispersed database as though it were a centralized database.
– Fragmentation transparency
– Location transparency
• Related Concepts:
– Remote Requests
– Remote Transactions
– Distributed Transactions
– Distributed Requests
A Remote Request
Allows us to access data to be processed by a single remote database
processor.
A Remote Transaction
Composed of several requests, may access data at only a single
site.
Allows a transaction to reference several different (local or remote) DP
sites.
A Distributed Request
Reference data from several remote DP sites.
Allows a single request to reference a physically partitioned table.
Example2:
Distributed Request
Transaction Transparency
Two-Phase Commit Protocol
DO performs the operation and records the “before” and “after” values
in the transaction log.
UNDO reverses an operation, using the log entries written by the DO
portion of the sequence.
REDO redoes an operation, using the log entries written by DO
portion of the sequence.
Horizontal fragmentation
Vertical fragmentation
Mixed fragmentation
Data Fragmentation
Horizontal Fragmentation - Consists of a subset of the tuples
of a relation.
Fragment represents the equivalent of a SELECT statement, with
the WHERE clause on a single attribute.
Data Fragmentation
Vertical fragment Consists of a subset of the attributes of a
relation.
Equivalent to the PROJECT statement.
Data Fragmentation
Site 1
Client/Server vs. DDBMS
• Client/server architecture refers to the way in which computers
interact to form a system.
Reference architecture for a DDBMS
Questions ?