0% found this document useful (0 votes)

5 views17 pages

Functional Dependency and Normalization

Uploaded by

aditidagar00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views17 pages

Functional Dependency and Normalization

Uploaded by

aditidagar00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Functional Dependency

The functional dependency is a relationship that exists between two attributes. It typically
exists between the primary key and non-key attribute within a table.

1. X → Y

The left side of FD is known as a determinant, the right side of the production is known
as a dependent.

For example:

Assume we have an employee table with attributes: Emp_Id, Emp_Name, Emp_Address.

Here Emp_Id attribute can uniquely identify the Emp_Name attribute of employee table
because if we know the Emp_Id, we can tell that employee name associated with it.

Functional dependency can be written as:

Emp_Id → Emp_Name

We can say that Emp_Name is functionally dependent on Emp_Id.

Types of Functional dependency

1. Trivial functional dependency

o A → B has trivial functional dependency if B is a subset of A.

o The following dependencies are also trivial like: A → A, B → B

Example:

1. Consider a table with two columns Employee_Id and Employee_Name.

2. {Employee_id, Employee_Name} → Employee_Id is a trivial functional depende
ncy as
3. Employee_Id is a subset of {Employee_Id, Employee_Name}.
4. Also, Employee_Id → Employee_Id and Employee_Name → Employee_Name ar
e trivial dependencies too.

2. Non-trivial functional dependency

o A → B has a non-trivial functional dependency if B is not a subset of A.

o When A intersection B is NULL, then A → B is called as complete non-trivial.

Example:

1. ID → Name,
2. Name → DOB
Inference Rule (IR):
o The Armstrong's axioms are the basic inference rule.
o Armstrong's axioms are used to conclude functional dependencies on a relational
database.
o The inference rule is a type of assertion. It can apply to a set of FD(functional
dependency) to derive other FD.
o Using the inference rule, we can derive additional functional dependency from the
initial set.

The Functional dependency has 6 types of inference rule:

1. Reflexive Rule (IR1)

In the reflexive rule, if Y is a subset of X, then X determines Y.

If X ⊇ Y then X → Y

Example:

X = {a, b, c, d, e}
Y = {a, b, c}

2. Augmentation Rule (IR2)

The augmentation is also called as a partial dependency. In augmentation, if X determines
Y, then XZ determines YZ for any Z.

If X → Y then XZ → YZ

Example:

For R(ABCD), if A → B then AC → BC

3. Transitive Rule (IR3)

In the transitive rule, if X determines Y and Y determine Z, then X must also determine Z.
If X → Y and Y → Z then X → Z

4. Union Rule (IR4)

Union rule says, if X determines Y and X determines Z, then X must also determine Y and
Z.

If X → Y and X → Z then X → YZ

5. Decomposition Rule (IR5)

Decomposition rule is also known as project rule. It is the reverse of union rule.

This Rule says, if X determines Y and Z, then X determines Y and X determines Z separately.

If X → YZ then X → Y and X → Z

6. Pseudo transitive Rule (IR6)

In Pseudo transitive Rule, if X determines Y and YZ determines W, then XZ determines W.

If X → Y and YZ → W then XZ → W
Normalization
A large database defined as a single relation may result in data duplication. This repetition
of data may result in:

o Making relations very large.

o It isn't easy to maintain and update data as it would involve searching many records
in relation.
o Wastage and poor utilization of disk space and resources.
o The likelihood of errors and inconsistencies increases.

So to handle these problems, we should analyze and decompose the relations with
redundant data into smaller, simpler, and well-structured relations that are satisfy
desirable properties. Normalization is a process of decomposing the relations into
relations with fewer attributes.

What is Normalization?
o Normalization is the process of organizing the data in the database.
o Normalization is used to minimize the redundancy from a relation or set of
relations. It is also used to eliminate undesirable characteristics like Insertion,
Update, and Deletion Anomalies.
o Normalization divides the larger table into smaller and links them using
relationships.
o The normal form is used to reduce redundancy from the database table.

Why do we need Normalization?

The main reason for normalizing the relations is removing these anomalies. Failure to
eliminate anomalies leads to data redundancy and can cause data integrity and other
problems as the database grows. Normalization consists of a series of guidelines that
helps to guide you in creating a good database structure.

• Update anomalies − If data items are scattered and are not linked to each
other properly, then it could lead to strange situations. For example, when
we try to update one data item having its copies scattered over several
places, a few instances get updated properly while a few others are left
with old values. Such instances leave the database in an inconsistent state.
• Deletion anomalies − We tried to delete a record, but parts of it was left
undeleted because of unawareness, the data is also saved somewhere
else.
• Insert anomalies − We tried to insert data in a record that does not exist
at all.

First Normal Form

First Normal Form is defined in the definition of relations (tables) itself. This rule defines
that all the attributes in a relation must have atomic domains. The values in an atomic
domain are indivisible units.

We re-arrange the relation (table) as below, to convert it to First Normal Form.

Each attribute must contain only a single value from its pre-defined domain.

Second Normal Form

Before we learn about the second normal form, we need to understand the following −
• Prime attribute − An attribute, which is a part of the candidate-key, is
known as a prime attribute.
• Non-prime attribute − An attribute, which is not a part of the prime-key, is
said to be a non-prime attribute.
If we follow second normal form, then every non-prime attribute should be fully
functionally dependent on prime key attribute. That is, if X → A holds, then there should
not be any proper subset Y of X, for which Y → A also holds true.

We see here in Student_Project relation that the prime key attributes are Stu_ID and
Proj_ID. According to the rule, non-key attributes, i.e. Stu_Name and Proj_Name must
be dependent upon both and not on any of the prime key attribute individually. But we
find that Stu_Name can be identified by Stu_ID and Proj_Name can be identified by
Proj_ID independently. This is called partial dependency, which is not allowed in
Second Normal Form.

We broke the relation in two as depicted in the above picture. So there exists no partial
dependency.

Third Normal Form

For a relation to be in Third Normal Form, it must be in Second Normal form and the
following must satisfy −

• No non-prime attribute is transitively dependent on prime key attribute.

• For any non-trivial functional dependency, X → A, then either −
o X is a superkey or,

o A is prime attribute.

We find that in the above Student_detail relation, Stu_ID is the key and only prime key
attribute. We find that City can be identified by Stu_ID as well as Zip itself. Neither Zip
is a superkey nor is City a prime attribute. Additionally, Stu_ID → Zip → City, so there
exists transitive dependency.
To bring this relation into third normal form, we break the relation into two relations as
follows −

Boyce-Codd Normal Form

Boyce-Codd Normal Form (BCNF) is an extension of Third Normal Form on strict terms.
BCNF states that −

• For any non-trivial functional dependency, X → A, X must be a super-key.

In the above image, Stu_ID is the super-key in the relation Student_Detail and Zip is the
super-key in the relation ZipCodes. So,
Stu_ID → Stu_Name, Zip
and
Zip → City
Which confirms that both the relations are in BCNF.
Joins
Join is a combination of a Cartesian product followed by a selection process. A Join
operation pairs two tuples from different relations, if and only if a given join condition is
satisfied.
We will briefly describe various join types in the following sections.

Theta (θ) Join

Theta join combines tuples from different relations provided they satisfy the theta
condition. The join condition is denoted by the symbol θ.

Notation
R1 ⋈θ R2
R1 and R2 are relations having attributes (A1, A2, .., An) and (B1, B2,.. ,Bn) such that the
attributes don’t have anything in common, that is R1 ∩ R2 = Φ.
Theta join can use all kinds of comparison operators.

Student

SID Name Std

101 Alex 10

102 Maria 11

Subjects

Class Subject

10 Math

10 English

11 Music

11 Sports

Student_Detail −

STUDENT ⋈Student.Std = Subject.Class SUBJECT

Student_detail

SID Name Std Class Subject

101 Alex 10 10 Math

101 Alex 10 10 English

102 Maria 11 11 Music

102 Maria 11 11 Sports

Equijoin
When Theta join uses only equality comparison operator, it is said to be equijoin. The
above example corresponds to equijoin.

Natural Join (⋈)

Natural join does not use any comparison operator. It does not concatenate the way a
Cartesian product does. We can perform a Natural Join only if there is at least one
common attribute that exists between two relations. In addition, the attributes must
have the same name and domain.
Natural join acts on those matching attributes where the values of attributes in both the
relations are same.

Courses

CID Course Dept

CS01 Database CS

ME01 Mechanics ME

EE01 Electronics EE

HoD

Dept Head
CS Alex

ME Maya

EE Mira

Courses ⋈ HoD

Dept CID Course Head

CS CS01 Database Alex

ME ME01 Mechanics Maya

EE EE01 Electronics Mira

Outer Joins
Theta Join, Equijoin, and Natural Join are called inner joins. An inner join includes only
those tuples with matching attributes and the rest are discarded in the resulting relation.
Therefore, we need to use outer joins to include all the tuples from the participating
relations in the resulting relation. There are three kinds of outer joins − left outer join,
right outer join, and full outer join.

Left Outer Join(R S)

All the tuples from the Left relation, R, are included in the resulting relation. If there are
tuples in R without any matching tuple in the Right relation S, then the S-attributes of
the resulting relation are made NULL.

Left

A B

100 Database

101 Mechanics

102 Electronics

Right
A B

100 Alex

102 Maya

104 Mira

Courses HoD

A B C D

100 Database 100 Alex

101 Mechanics --- ---

102 Electronics 102 Maya

Right Outer Join: ( R S)

All the tuples from the Right relation, S, are included in the resulting relation. If there
are tuples in S without any matching tuple in R, then the R-attributes of resulting relation
are made NULL.

Courses HoD

A B C D

100 Database 100 Alex

102 Electronics 102 Maya

--- --- 104 Mira

Full Outer Join: ( R S)

All the tuples from both participating relations are included in the resulting relation. If
there are no matching tuples for both relations, their respective unmatched attributes
are made NULL.
Courses HoD

A B C D

100 Database 100 Alex

101 Mechanics --- ---

102 Electronics 102 Maya

--- --- 104 Mira

Relational Decomposition
o When a relation in the relational model is not in appropriate normal form then the
decomposition of a relation is required.
o In a database, it breaks the table into multiple tables.
o If the relation has no proper decomposition, then it may lead to problems like loss
of information.
o Decomposition is used to eliminate some of the problems of bad design like
anomalies, inconsistencies, and redundancy.

Types of Decomposition

Lossless Decomposition

o If the information is not lost from the relation that is decomposed, then the
decomposition will be lossless.
o The lossless decomposition guarantees that the join of relations will result in the
same relation as it was decomposed.
o The relation is said to be lossless decomposition if natural joins of all the
decomposition give the original relation.

Example:
EMPLOYEE_DEPARTMENT table:

EMP_ID EMP_NAME EMP_AGE EMP_CITY DEPT_ID DEPT_NAME

22 Denim 28 Mumbai 827 Sales

33 Alina 25 Delhi 438 Marketing

46 Stephan 30 Bangalore 869 Finance

52 Katherine 36 Mumbai 575 Production

60 Jack 40 Noida 678 Testing

The above relation is decomposed into two relations EMPLOYEE and DEPARTMENT

EMPLOYEE table:

EMP_ID EMP_NAME EMP_AGE EMP_CITY

22 Denim 28 Mumbai

33 Alina 25 Delhi

46 Stephan 30 Bangalore

52 Katherine 36 Mumbai

60 Jack 40 Noida

DEPARTMENT table

DEPT_ID EMP_ID DEPT_NAME

827 22 Sales

438 33 Marketing

869 46 Finance
575 52 Production

678 60 Testing

Now, when these two relations are joined on the common column "EMP_ID", then the
resultant relation will look like:

Employee ⋈ Department

EMP_ID EMP_NAME EMP_AGE EMP_CITY DEPT_ID DEPT_NAME

22 Denim 28 Mumbai 827 Sales

33 Alina 25 Delhi 438 Marketing

46 Stephan 30 Bangalore 869 Finance

52 Katherine 36 Mumbai 575 Production

60 Jack 40 Noida 678 Testing

Hence, the decomposition is Lossless join decomposition.

Dependency Preserving

o It is an important constraint of the database.

o In the dependency preservation, at least one decomposed table must satisfy every
dependency.
o If a relation R is decomposed into relation R1 and R2, then the dependencies of R
either must be a part of R1 or R2 or must be derivable from the combination of
functional dependencies of R1 and R2.
o For example, suppose there is a relation R (A, B, C, D) with functional dependency
set (A->BC). The relational R is decomposed into R1(ABC) and R2(AD) which is
dependency preserving because FD A->BC is a part of relation R1(ABC).

Dbms 3rd Unit..
No ratings yet
Dbms 3rd Unit..
51 pages
204 - SQLUnit 3
No ratings yet
204 - SQLUnit 3
11 pages
Unit 3
No ratings yet
Unit 3
23 pages
DBMS Lecture of Unit 3 H
No ratings yet
DBMS Lecture of Unit 3 H
8 pages
Database Normalization PDF
No ratings yet
Database Normalization PDF
3 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
20 pages
Unit-Iii Normalization Functional Dependency: For Example
No ratings yet
Unit-Iii Normalization Functional Dependency: For Example
18 pages
UNIT-3 Functional Dependency
No ratings yet
UNIT-3 Functional Dependency
30 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
16 pages
Functional Dependency & Normalization
No ratings yet
Functional Dependency & Normalization
10 pages
Mod 4 DBMS
No ratings yet
Mod 4 DBMS
48 pages
Database Management: Functional Dependencies
No ratings yet
Database Management: Functional Dependencies
12 pages
18CS53 - 2022 - 23 - Module4 - DBMS
No ratings yet
18CS53 - 2022 - 23 - Module4 - DBMS
53 pages
DBMS - Unit - 3 - Chapter - 2 - Relationl Database Design
No ratings yet
DBMS - Unit - 3 - Chapter - 2 - Relationl Database Design
45 pages
DBMS Unit-2
No ratings yet
DBMS Unit-2
39 pages
Unit 4
No ratings yet
Unit 4
33 pages
NORMALISATION
No ratings yet
NORMALISATION
15 pages
Understanding Functional Dependency and Normalization
No ratings yet
Understanding Functional Dependency and Normalization
42 pages
Module 3 Part 1
No ratings yet
Module 3 Part 1
14 pages
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
No ratings yet
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
36 pages
Dbms Unit III
No ratings yet
Dbms Unit III
14 pages
Unit 3 NEP DBMS
No ratings yet
Unit 3 NEP DBMS
27 pages
NORMALIZATION
No ratings yet
NORMALIZATION
51 pages
Normalization 1
No ratings yet
Normalization 1
10 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
17 pages
Database Design for Students
No ratings yet
Database Design for Students
4 pages
Functional Dependency
No ratings yet
Functional Dependency
17 pages
Mod 2
No ratings yet
Mod 2
79 pages
Dbms Unit III Normalforms
No ratings yet
Dbms Unit III Normalforms
20 pages
Normalization
No ratings yet
Normalization
145 pages
Database Systems Notes1 2
No ratings yet
Database Systems Notes1 2
2 pages
DBMS Unit 3.0 Functional Dependencies
No ratings yet
DBMS Unit 3.0 Functional Dependencies
44 pages
MYSQL DAY - 20 (Normalization)
No ratings yet
MYSQL DAY - 20 (Normalization)
13 pages
Functional Dependency
No ratings yet
Functional Dependency
11 pages
ADBMS Functional Dependency & Normalization
No ratings yet
ADBMS Functional Dependency & Normalization
12 pages
Normalization DBMS
No ratings yet
Normalization DBMS
10 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
12 pages
Mca DBMS3
No ratings yet
Mca DBMS3
19 pages
Unit 3
No ratings yet
Unit 3
42 pages
DBMS Design Guidelines for KTU Students
No ratings yet
DBMS Design Guidelines for KTU Students
4 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
88 pages
Relational Database Design
No ratings yet
Relational Database Design
52 pages
A Functional Dependency (FD) Is A Constraint Between Two Sets
No ratings yet
A Functional Dependency (FD) Is A Constraint Between Two Sets
2 pages
DBMS 1
No ratings yet
DBMS 1
30 pages
Normalization Unit 4
No ratings yet
Normalization Unit 4
34 pages
Bcs403 Dbms m3 Notes
No ratings yet
Bcs403 Dbms m3 Notes
12 pages
Functional Dependancy and Normalization
No ratings yet
Functional Dependancy and Normalization
33 pages
Presentation 3
No ratings yet
Presentation 3
23 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
56 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
25 pages
CH-4 DBMS Normalisation
No ratings yet
CH-4 DBMS Normalisation
38 pages
Unit-6 Note
No ratings yet
Unit-6 Note
5 pages
2019-Cpe-27 DBMS Assignment 2
No ratings yet
2019-Cpe-27 DBMS Assignment 2
9 pages
Understanding Functional Dependency in DBMS
No ratings yet
Understanding Functional Dependency in DBMS
45 pages
SQL Modules
No ratings yet
SQL Modules
52 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
44 pages
Unit 3 Dic
No ratings yet
Unit 3 Dic
22 pages
Unit 4 Dic
No ratings yet
Unit 4 Dic
45 pages
Unit 5 Dic
No ratings yet
Unit 5 Dic
37 pages
Unit 2 Dic
No ratings yet
Unit 2 Dic
54 pages
DBMS R Project
No ratings yet
DBMS R Project
13 pages
Latestlog 1
No ratings yet
Latestlog 1
28 pages
WWD Products 25182 AG
No ratings yet
WWD Products 25182 AG
81 pages
C0604238513 MTBL BS6 Diagnostic v3.4.1 Setup SOP V1
No ratings yet
C0604238513 MTBL BS6 Diagnostic v3.4.1 Setup SOP V1
10 pages
DICOM Connection Guide
100% (1)
DICOM Connection Guide
3 pages
Capstone 101
No ratings yet
Capstone 101
14 pages
Riser Bond Model 3300: Metallic Time Domain Reflectometer Cable Fault Locator
No ratings yet
Riser Bond Model 3300: Metallic Time Domain Reflectometer Cable Fault Locator
2 pages
Microsoft OneNote Step by Step
No ratings yet
Microsoft OneNote Step by Step
321 pages
TiVo Stream4k Manual
No ratings yet
TiVo Stream4k Manual
33 pages
CV&SKCK
No ratings yet
CV&SKCK
2 pages
001-2022-1114 DLAPITE02 Course Book
No ratings yet
001-2022-1114 DLAPITE02 Course Book
142 pages
Reet Task Booklet 03
No ratings yet
Reet Task Booklet 03
81 pages
Cobas 6000 Analyzer Series Host Interfac
No ratings yet
Cobas 6000 Analyzer Series Host Interfac
104 pages
USB Driver Upgrade Manual: Revision: 1.000 Date: 3 Aug, 2004
No ratings yet
USB Driver Upgrade Manual: Revision: 1.000 Date: 3 Aug, 2004
14 pages
DWDM Unit1
No ratings yet
DWDM Unit1
93 pages
Foreign Admission Prediction Report
No ratings yet
Foreign Admission Prediction Report
39 pages
Hazard and Operability (Hazop) Guideline
No ratings yet
Hazard and Operability (Hazop) Guideline
34 pages
Odisha Graduate Level Exam 2021 Recruitment
No ratings yet
Odisha Graduate Level Exam 2021 Recruitment
11 pages
B.Sc1Sem2 NEP Exam Result 2022, Mahatma Jyotiba Phule Rohilkhand University, Uttar Pradesh
No ratings yet
B.Sc1Sem2 NEP Exam Result 2022, Mahatma Jyotiba Phule Rohilkhand University, Uttar Pradesh
2 pages
PG Syllabi Vol 02 720 782
No ratings yet
PG Syllabi Vol 02 720 782
63 pages
VAE Molecular Graphs Niloy AAAI19
No ratings yet
VAE Molecular Graphs Niloy AAAI19
8 pages
Content
No ratings yet
Content
8 pages
Omanarp International Journal of Library and Information Science
No ratings yet
Omanarp International Journal of Library and Information Science
10 pages
Revenue Recognition Scenarios Analysis
No ratings yet
Revenue Recognition Scenarios Analysis
1 page
Generating Functions
No ratings yet
Generating Functions
27 pages
Proposal For Uniforms and Event Shirts Business Website
No ratings yet
Proposal For Uniforms and Event Shirts Business Website
3 pages
Copier Error Troubleshooting Guide
No ratings yet
Copier Error Troubleshooting Guide
182 pages
ITS323Y13S1E02 Final Exam Answers
No ratings yet
ITS323Y13S1E02 Final Exam Answers
18 pages
OpenAir Manual
No ratings yet
OpenAir Manual
287 pages
RecyclerView View Unhide Error
No ratings yet
RecyclerView View Unhide Error
2 pages
Tenfoot Polish
No ratings yet
Tenfoot Polish
373 pages

Functional Dependency and Normalization

Uploaded by

Functional Dependency and Normalization

Uploaded by

Functional Dependency

Assume we have an employee table with attributes: Emp_Id, Emp_Name, Emp_Address.

Functional dependency can be written as:

We can say that Emp_Name is functionally dependent on Emp_Id.

Types of Functional dependency

o A → B has trivial functional dependency if B is a subset of A.

1. Consider a table with two columns Employee_Id and Employee_Name.

2. Non-trivial functional dependency

o A → B has a non-trivial functional dependency if B is not a subset of A.

The Functional dependency has 6 types of inference rule:

1. Reflexive Rule (IR1)

2. Augmentation Rule (IR2)

For R(ABCD), if A → B then AC → BC

3. Transitive Rule (IR3)

4. Union Rule (IR4)

5. Decomposition Rule (IR5)

6. Pseudo transitive Rule (IR6)

o Making relations very large.

Why do we need Normalization?

First Normal Form

We re-arrange the relation (table) as below, to convert it to First Normal Form.

Second Normal Form

Third Normal Form

• No non-prime attribute is transitively dependent on prime key attribute.

Boyce-Codd Normal Form

• For any non-trivial functional dependency, X → A, X must be a super-key.

Theta (θ) Join

SID Name Std

STUDENT ⋈Student.Std = Subject.Class SUBJECT

SID Name Std Class Subject

101 Alex 10 10 Math

101 Alex 10 10 English

102 Maria 11 11 Music

102 Maria 11 11 Sports

Natural Join (⋈)

CID Course Dept

Dept CID Course Head

CS CS01 Database Alex

ME ME01 Mechanics Maya

EE EE01 Electronics Mira

Left Outer Join(R S)

100 Database 100 Alex

101 Mechanics --- ---

102 Electronics 102 Maya

Right Outer Join: ( R S)

100 Database 100 Alex

102 Electronics 102 Maya

--- --- 104 Mira

Full Outer Join: ( R S)

100 Database 100 Alex

101 Mechanics --- ---

102 Electronics 102 Maya

--- --- 104 Mira

EMP_ID EMP_NAME EMP_AGE EMP_CITY DEPT_ID DEPT_NAME

22 Denim 28 Mumbai 827 Sales

33 Alina 25 Delhi 438 Marketing

46 Stephan 30 Bangalore 869 Finance

52 Katherine 36 Mumbai 575 Production

60 Jack 40 Noida 678 Testing

EMP_ID EMP_NAME EMP_AGE EMP_CITY

DEPT_ID EMP_ID DEPT_NAME

EMP_ID EMP_NAME EMP_AGE EMP_CITY DEPT_ID DEPT_NAME

22 Denim 28 Mumbai 827 Sales

33 Alina 25 Delhi 438 Marketing

46 Stephan 30 Bangalore 869 Finance

52 Katherine 36 Mumbai 575 Production

60 Jack 40 Noida 678 Testing

Hence, the decomposition is Lossless join decomposition.

o It is an important constraint of the database.

You might also like