0% found this document useful (0 votes)
22 views32 pages

IFS02A1 Week 2

The document discusses data modeling, emphasizing its iterative nature and importance in database design, implementation, and management. It outlines key components of data models, including entities, attributes, relationships, and constraints, and introduces business rules as essential for defining data structures. Additionally, it covers the relational model, entity relationship model, and emerging data models like Big Data and NoSQL, along with the ANSI/SPARC framework for data abstraction levels.

Uploaded by

sibahletanaka999
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views32 pages

IFS02A1 Week 2

The document discusses data modeling, emphasizing its iterative nature and importance in database design, implementation, and management. It outlines key components of data models, including entities, attributes, relationships, and constraints, and introduces business rules as essential for defining data structures. Additionally, it covers the relational model, entity relationship model, and emerging data models like Big Data and NoSQL, along with the ANSI/SPARC framework for data abstraction levels.

Uploaded by

sibahletanaka999
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

INFORMATION SYSTEMS 2A

PRESENTED BY PROF TANIA PRINSLOO

WEEK 2
Database Systems:
Design,
Implementation, and
Management, 14e
Module 2: Data Models

Footnote Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023 Cengage. All Rights 2
Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. 2
Data Modeling and Data Models

Data modeling refers to the process of creating a specific data model for a determined problem domain

Data modeling is an iterative, progressive process

A data model is a relatively simple representation of more complex real-world data structures

Database designers make use of existing data-modeling constructs and powerful database design tools that
diminish the potential for errors in database modeling

Footnote 3
The Importance of Data Models

Data models are a communication tool

Applications are created to manage data and to help transform data into information,
but data is viewed in different ways by different people
A sound data environment requires an overall database blueprint based on an
appropriate data model
When a good database blueprint is available, it does not matter that an applications
programmer’s view of the data is different from that of the manager or the end user
You will be unlikely to create a good database without first creating an appropriate
data model

Footnote 4
Data Model Basic Building Blocks (1 of 2)

An entity is a person, place, thing, concept, or event about which data will be collected
and stored

An attribute is a characteristic of an entity

A relationship describes an association among entities

The following are three different types of relationships:

• One-to-many (1:M or 1..*) relationship


• Many-to-many (M:N or *..*) relationship
• One-to-one (1:1 or 1..1) relationship

Footnote 5
Data Model Basic Building Blocks (2 of 2)

A constraint is a restriction placed on the data

• Constraints help ensure data integrity

Constraints are normally expressed in the form of rules:

• An employee’s salary must have values that are between 6,000 and
350,000
• A student’s final mark must be between 50% and 74% to pass, higher
passes with a distinction, lower fails.
• Each class must have one and only one teacher

Footnote 6
Business Rules

A business rule is a brief, precise, and


They apply to any organization that stores and uses data to
unambiguous description of a policy, procedure, generate information
or principle within a specific organization

Business rules are used to define entities, They must be easy to understand and widely disseminated
attributes, relationships, and constraints

A customer may generate many invoices


Examples of business rules include the following:
An invoice is generated by only one customer

Footnote 7
Discovering Business Rules

The main sources of business rules are company managers, policy makers,
department managers, and written documentation such as company
procedures

Business rules are essential to database design due to the following reasons:
It allows the designer to develop
It allows the designer to appropriate relationship
It helps to standardize the It can be a communication tool It allows the designer to
understand the nature, role, and participation rules and constraints
company’s view of data between users and designers understand business processes
scope of the data and to create an accurate data
model

Footnote 8
Translating Business Rules into Data Model Components

For example, the business rule “a customer may generate many


invoices” contains two nouns and a verb that associates the nouns

Customer and invoice are objects of interest for the


From this business rule, you could environment and should be represented by their
respective entities
deduce the following: There is a generate relationship between customer and
invoice

The rule above is complemented by


the business rule “an invoice is The relationship is one-to-many (1:M)

generated by only one customer”

Footnote 9
Naming Conventions

Entity names should be descriptive of the objects in the business environment


and use technology that is familiar to the users

An attribute name should also be descriptive of the data represented

It is good practice to prefix the name of an attribute with the name or


abbreviation of the entity in which it occurs
• For example, in the CUSTOMER entity, customer’s credit limit may be called CUS_CREDIT_LIMIT

A proper naming convention can help make your model self-documenting

Footnote 10
The Relational Model (1 of 4)

A relation is a two-dimensional structure


The relational model’s foundation is a composed of intersecting rows and columns
mathematical concept known as a
Each row in a relation is called a tuple and each
relation column represents an attribute

The relational data model is implemented The RDBMS performs the same basic functions
through a very sophisticated relational provided by the hierarchical and network DBMS
database management system (RDBMS) systems

The RDBMS manages all of the details, while the users sees a collection of
tables in which the data is stored

Footnote 11
The Relational Model (2 of 4)

Footnote Figure 2.1 Linking Relational Tables 12


The Relational Model (3 of 4)

Footnote Figure 2.2 A Relational Diagram 13


The Relational Model (4 of 4)

• Any SQL-based relational database application involves the following three parts:
• The end user interface – the interface allows the end user to interact with the data
• A collection of tables stored in the database – the tables “present” the data to the end user in a way
that is easy to understand
• SQL engine – the SQL engine executes all queries or data requests

Footnote 14
The Entity Relationship Model (1 of 3)

Complex design activities require conceptual simplicity to yield successful results

Database designers prefer to use a graphical tool


The entity relationship (ER) model (ERM) was developed to do
in which entities and their relationships are just that
pictured

The relational data model and ERM combined to provide the foundation for tightly structured database
design

An entity relationship diagram (ERD) uses graphical representations to model database


components

Footnote 15
The Entity Relationship Model (2 of 3)

The ER model is based on the The following are three ER


following components: notations:
Entity – an entity is represented in the ERD by a Chen notation
rectangle (entity box) Crow’s Foot notation
Attributes – each entity consists of a set of Class diagram notation (part of the Unified
attributes that describes particular Modeling Language (UML))
characteristics of the entity
Relationships – relationships describe
associations among data

Footnote 16
The Entity Relationship Model (3 of 3)

Footnote Figure 2.3 The ER Model Notations 17


Emerging Data Models: Big Data and NoSQL (1 of 3)

Internet of Things (IoT) is a web The IoT has accelerated the rate of data
of Internet-connected devices growth so that about 2.5 quintillion bytes of
exchanging and collecting data data are created daily

Big Data refers to a movement to find new and better ways to manage
large amounts of web- and sensor-generated data and derive business
insight from it

A basic characteristic of Big Data databases can be described as


volume, velocity, and variety, or the 3 Vs

Footnote 18
Emerging Data Models: Big Data and NoSQL (2 of 3)

• Some of the most frequently used Big Data technologies are Hadoop and NoSQL databases:
• Hadoop is a Java-based, open-source, high-speed, fault-tolerant distributed storage and
computational framework
• Hadoop Distributed File System (HDFS) is a highly distributed, fault-tolerant file storage system
designed to manage large amounts of data at high speeds
• MapReduce is an open-source application programming interface (API) that provides fast data
analytics services
• NoSQL is a large-scale distributed database system that stores structured and unstructured data in
efficient ways

Footnote 19
Emerging Data Models: Big Data and NoSQL (3 of 3)

• NoSQL databases have the following general characteristics:


• They are not based on the relational model and SQL
• They support highly distributed database architectures
• They provide high scalability, high availability, and fault tolerance
• They support very large amounts of sparse data
• They are geared toward performance rather than transaction consistency

Footnote 20
Data Models: A Summary

Footnote Figure 2.5 The Evolution of Data Models 21


Degrees of Data Abstraction (1 of 2)

The American National Standards Institute (ANSI) Standards


Planning and Requirements Committee (SPARC) defined a
framework for data modeling based on degrees of data abstraction
The three levels of data abstraction are external, conceptual, and
internal

In Figure 2.6, on the following slide, the ANSI/SPARC framework has


been expanded with the addition of a physical model to explicitly
address physical-level implementation details of the internal model

Footnote 22
Degrees of Data Abstraction (2 of 2)

Footnote Figure 2.6 Data Abstraction Levels 23


The External Model (1 of 3)

The external model is the end users’ view of the data environment

End users usually operate in an environment in which an application has a specific


business unit focus
• End users within those business units view their data subsets as separate from or eternal to other units
within the organization

ER diagrams will be used to represent the external views

A specific representation of an external view is known as an external schema

Footnote 24
The External Model (2 of 3)

Footnote Figure 2.7 External Models for Tiny College 25


The External Model (3 of 3)

• The use of external views that represent subsets of the database has some important advantages:
• It is easy to identify specific data required to support each business unit
• It makes the designer’s job easy by providing feedback about the model’s adequacy
• It helps to ensure security constraints in the database design
• It makes application program development much simpler

Footnote 26
The Conceptual Model (1 of 2)

• The conceptual model represents a global view of the entire database by the entire organization
• Also known as a conceptual schema, it is the basis for the identification and high-level description of
the main data objects
• The most widely used conceptual model is the ER model
• The following are advantages of the conceptual model:
• It provides a bird’s-eye view of the data environment that is easy to understand
• The conceptual model is independent of both software and hardware
• The term logical design refers to creating a conceptual data model

Footnote 27
The Conceptual Model (2 of 2)

Footnote Figure 2.8 Conceptual Model for Tiny College 28


The Internal Model (1 of 2)
The internal model requires a designer to match the conceptual
The internal model is the representation of model’s characteristics and constraints to those of the selected
the database as “seen” by the DBMS implementation model

An internal schema depicts a specific representation of an internal model, using the database
constructs supported by the chosen database

Because the internal model depends on specific database software, it is said to be software
dependent

When you change the internal model without affecting the conceptual model, you have logical
independence

However, the internal model is still hardware independent

Footnote 29
The Internal Model (2 of 2)

Footnote Figure 2.9 Internal Model for Tiny College 30


The Physical Model

The physical model operates at the


lowest level of abstraction, describing
the way data is saved on storage media

The physical model requires the


definition of both the physical storage
devices and the (physical) access
• This means the physical model is both
methods required to reach the data software and hardware dependent
within those storage devices

When you can change the physical


model without affecting the internal
model, you have physical
independence

Footnote 31
Questions???

Footnote 32

You might also like