Graphs and Algorithms

The document discusses graphs and two common graph algorithms: depth-first search (DFS) and breadth-first search (BFS). It defines what a graph is and how they can be represented. It then explains DFS in detail, including how it is implemented and its runtime of O(V+E), where V is the number of vertices and E is the number of edges. Finally, it provides a high-level overview of how BFS works and computes the distance between all nodes and the starting node.

Uploaded by

Ying Chow

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

26 views7 pages

Graphs and Algorithms

Uploaded by

Ying Chow

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 7

Lecture 8 Graphs

Scribes: Romil Verma, Juliana Cook (2015), Virginia Williams, Date: Oct 22, 2019
and Seth Hildick-Smith (2016), G. Valiant (2017), M. Wootters (2017)
Adapted From Virginia Williams’ lecture notes

1 Graphs
A graph is a set of vertices and edges connecting those vertices. Formally, we define a graph G as
G = (V, E) where E ⊆ V × V . For ease of analysis, the variables n and m typically stand for the number
of vertices and edges, respectively. Graphs can come in two flavors, directed or undirected. If a graph
is undirected, it must satisfy the property that (i, j) ∈ E iff (j, i) ∈ E (i.e., all edges are bidirectional). In
undirected graphs, m ≤ n(n−1)
2 . In directed graphs, m ≤ n(n − 1). Thus, m = O(n2 ) and log m = O(log n).
A connected graph is a graph in which for any two nodes u and v there exists a path from u to v. For an
undirected connected graph m ≥ n − 1. A sparse graph is a graph with few edges (for example, Θ(n) edges)
while a dense graph is a graph with many edges (for example, m = Θ(n2 )).

1.1 Representation
A common issue is the topic of how to represent a graph’s edges in memory. There are two standard methods
for this task.
An adjacency matrix uses an arbitrary ordering of the vertices from 1 to |V |. The matrix consists of
an n × n binary matrix such that the (i, j)th element is 1 if (i, j) is an edge in the graph, 0 otherwise.
An adjacency list consists of an array A of |V | lists, such that A[u] contains a linked list of vertices v
such that (u, v) ∈ E (the neighbors of u). In the case of a directed graph, it’s also helpful to distinguish
between outgoing and ingoing edges by storing two different lists at A[u]: a list of v such that (u, v) ∈ E
(the out-neighbors of u) as well as a list of v such that (v, u) ∈ E (the in-neighbors of u).
What are the tradeoffs between these two methods? To help our analysis, let deg(v) denote the degree
of v, or the number of vertices connected to v. In a directed graph, we can distinguish between out-degree
and in-degree, which respectively count the number of outgoing and incoming edges.

• The adjacency matrix can check if (i, j) is an edge in G in constant time, whereas the adjacency list
representation must iterate through up to deg(i) list entries.
• The adjacency matrix takes Θ(n2 ) space, whereas the adjacency list takes Θ(m + n) space.
• The adjacency matrix takes Θ(n) operations to enumerate the neighbors of a vertex v since it must
iterate across an entire row of the matrix. The adjacency list takes deg(v) time.

What’s a good rule of thumb for picking the implementation? One useful property is the sparsity of the
graph’s edges. If the graph is sparse, and the number of edges is considerably less than the max (m n2 ),
then the adjacency list is a good idea. If the graph is dense and the number of edges is nearly n2 , then
the matrix representation makes sense because it speeds up lookups without too much space overhead. Of
course, some applications will have lots of space to spare, making the matrix feasible no matter the structure
of the graphs. Other applications may prefer adjacency lists even for dense graphs. Choosing the appropriate
structure is a balancing act of requirements and priorities.

2 Depth First Search (DFS)

Given a starting vertex, it’s desirable to find all vertices reachable from the start. There are many algorithms
to do this, the simplest of which is depth-first search. As the name implies, DFS enumerates the deepest

1
paths, only backtracking when it hits a dead end or an already-explored section of the graph. DFS by itself
is fairly simple, so we introduce some augmentations to the basic algorithm.

• To prevent loops, DFS keeps track of a “color” attribute for each vertex. Unvisited vertices are white
by default. Vertices that have been visited but still may be backtracked to are colored gray. Vertices
which are completely processed are colored black. The algorithm can then prevent loops by skipping
non-white vertices.1
• Instead of just marking visited vertices, the algorithm also keeps track of the tree generated by the
depth-first traversal. It does so by marking the “parent” of each visited vertex, aka the vertex that
DFS visited immediately prior to visiting the child.
• The augmented DFS also marks two auto-incrementing timestamps d and f to indicate when a node
was first discovered and finished.

The algorithm takes as input a start vertex s and a starting timestamp t, and returns the timestamp at
which the algorithm finishes. Let N (s) denote the neighbors of s; for a directed graph, let Nout (s) denote
the out-neighbors of s.
Algorithm 1: init(G)
foreach v ∈ G do
color(v) ← white
d(v), f(v) ← ∞
p(v) ← nil

Algorithm 2: DFS(s, t) s ∈ V . t = time, s is white

color(s) ← gray
\\ d(s) is the discovery time of s
d(s) ← t
t++;
\\ In the loop below, replace N (s) with Nout (s) for directed G
foreach v ∈ N (s) do
if color(v) = white then
p(v) ← s
\\ update t to be the finish time of DFS starting at v:
t ← DFS(v, t)
t++
\\ finish time
f(s) ← t
\\ s is finished
color(s) ← black
return f(s)
There are multiple ways we can search using DFS. One way is to search from some source node s, which
will give us a set of black nodes reachable from s and white nodes unreachable from s.
Algorithm 3: DFS(s) \\ DFS from a source node s
init(G)
DFS(S, 1)
Another way to use DFS is to search over the entire graph, choosing some white node and finding
everything we can reach from that node, and repeating until we have no white nodes remaining. In an
undirected graph this will give us all of the connected components.
1 In the slides, white, gray, and black are replaced with light green, orange, and dark green, respectively.

2
Algorithm 4: DFS(G) \\ DFS on an entire graph G
init(G);
t←1
foreach v ∈ G do
if color(v) = white then
t ← DFS(v, t)
t++

2.1 Runtime of DFS

We will now look at the runtime for the standard DFS algorithm (Algorithm 2).
Everything above the loop runs in O(1) time per node visit. Excluding the recursive call, everything
inside of the for loop takes O(1) time every time an edge is scanned. Everything after the for loop also runs
in O(1) time per node visit.
We can express the runtime of DFS as O(# of node visits + # of edge scans). Assume we have a graph
with n nodes and m edges. We know that the # of node visits is ≤ n, since we only visit white nodes and
whenever we visit a node we change its color from white to gray and never change it back to white again.
We also know that an edge (u, v) is scanned only when u or v is visited. Since every node is visited at most
once, we know that an edge (u, v) is scanned at most twice (or only once for directed graphs). Thus, # of
edges scanned is O(m), and the overall runtime of DFS is O(m + n).

2.2 DFS Example

We will now try running DFS on the example graph below.

We mark all of the nodes as unvisited and start at a white node, in our case node a.

3
From node a we will visit all of a’s children, namely node b.

We now visit b’s child, node c.

Node c has two children that we must visit. When we try to visit node a we find that node a has already
been visited (and would be colored gray, as we are in the process of searching a’s children), so we do not
continue searching down that path. We will next search c’s second child, node d.

Since node d has no children, we return back to its parent node, c, and continue to go back up the path
we took, marking nodes with a finish time when we have searched all of their children.

4
Once we reach our first source node a we find that we have searched all of its children, so we look in the
graph to see if there are any unvisited nodes remaining. For our example, we start with a new source node
e and run DFS to completion.

3 Breadth First Search (BFS)

In depth first search, we search “deeper” in the graph whenever possible, exploring edges out of the most
recently discovered node that still has unexplored edges leaving it. Breadth first search (BFS) instead
expands the frontier between discovered and undiscovered nodes uniformly across the breadth of the frontier,
discovering all nodes at a distance k from the source node before nodes at distance k + 1.
BFS(s) computes for every node v ∈ G the distance from s to v in G. d(u, v) is the length of the shortest
path from u to v.
A simple property of unweighted graphs is as follows: let P be a shortest u → v path and let x be the
node before v on P . Then d(u, v) = d(u, x) + 1.

BFS(s) computes sets Li , the set of nodes at distance i from s, as seen in the diagram below.

5
Algorithm 5: BFS(s)
Set vis[v] ← false for all v;
Set all Li for i = 1 to n − 1 to ∅;
L0 = s;
vis[s] ← true;
for i = 0 to n − 1 do
if Li = ∅ then
exit
while Li 6= ∅ do
u ← Li .pop();
\\ In the loop below, replace N with Nout for a directed graph.;
foreach x ∈ N (u) do
if vis[u] is false then
vis[u] ← true;
Li+1 .insert(x);
p(x)← u;

3.1 Runtime Analysis

We will now look at the runtime for our BFS algorithm (Algorithm 5) for a graph with n nodes and m edges.
All of the initialization above the first for loop runs in O(n) time. Visiting each node within the while loop
takes O(1) time per node visited. Everything inside the inner foreach loop takes O(1) time per edge scanned,
which we can simplify to a runtime of O(m) time overall for the entire inner for loop.
Overall, we see that our runtime is O(# nodes visited + # edges scanned) = O(m + n).

3.2 Correctness
We will now show that BFS correctly computes the shortest path between the source node and all other
nodes in the graph. Recall that Li is the set of nodes that BFS calculates to be distance i from the source
node.

6
Claim 1. For all i, Li = {x|d(s, x) = i}.
Proof of Claim 1. We will prove this by (strong) induction on i. Base case (i = 0): L0 = s.
Suppose that Lj = {x|d(s, x) = j} for every j ≤ i (induction hypothesis for i).
We will show two things: (1) if y was added to Li+1 , then d(s, y) = i + 1, and (2) if d(s, y) = i + 1, then
y is added to Li+1 . After proving (1) and (2) we can conclude that Li+1 = {y|d(s, y) = i + 1} and complete
the induction.
Let’s prove (1). First, if y is added to Li+1 , it was added by traversing an edge (x, y) where x ∈ Li , so
that there is a path from s to y taking the shortest path from s to x followed by the edge (x, y), and so
d(s, y) ≤ d(s, x) + 1. Since x ∈ Li , by the induction hypothesis, d(s, x) = i, so that d(s, y) ≤ i + 1. However,
since y ∈
/ Lj for any j ≤ i, by the induction hypothesis, d(s, y) > i, and so d(s, y) = i + 1.
Let’s prove (2). If d(s, y) = i + 1, then by the inductive hypothesis y ∈ / Lj for j ≤ i. Let x be the node
before y on the s → y shortest path P . As d(s, y) = i + 1 and the portion of P from s to x is a shortest
path and has length exactly i. Thus, by the induction hypothesis, x ∈ Li . Thus, when x was scanned, edge
(x, y) was scanned as well. If y had not been visited when (x, y) was scanned, then y will be added to Li+1 .
Hence assume that y was visited before (x, y) was scanned. However, since y ∈ / Lj for any j ≤ i, y must
have been visited by scanning another edge out of a node from Li , and hence again y is added to Li+1 .

3.3 BFS versus DFS

If you simplify BFS and DFS to the basics, ignoring all timestamps and levels that we would usually create,
BFS and DFS have a very similar structure. Breadth first search explores the nodes closest and then moves
outwards, so we can use a queue (first in first out data structure) to put new nodes at the end of the list and
pull the oldest/nearest nodes from the top of the list. Depth first search goes as far down a path as it can
before coming back to explore other options, so we can use a stack (last in first out data structure) which
pushes new nodes on the top and also pulls the newest nodes from the top. See the pseudocode below for
more detail.
Algorithm 6: DFS(s) \\ s is the source node
T ← stack
push s onto T
while T is not empty do
u ← pop from top of T
push all unvisited neighbors of u on top of stack T

Algorithm 7: BFS(s) \\ s is the source node

T ← queue
push s onto T
while T is not empty do
u ← pop from front of T
push all unvisited neighbors of u on back of queue T

Data Structures and Algorithm Unit 5
No ratings yet
Data Structures and Algorithm Unit 5
57 pages
Back Tracking
No ratings yet
Back Tracking
36 pages
Question 5 (Simplex Method) PDF
No ratings yet
Question 5 (Simplex Method) PDF
3 pages
Stock Market Prediction Using MLP and Random Forest
No ratings yet
Stock Market Prediction Using MLP and Random Forest
18 pages
Graphs
No ratings yet
Graphs
29 pages
Graphs
No ratings yet
Graphs
43 pages
UNIT-5 21CSC201J
No ratings yet
UNIT-5 21CSC201J
24 pages
DS Graphs
No ratings yet
DS Graphs
88 pages
Graph-Introduction
No ratings yet
Graph-Introduction
29 pages
DSA MK Lect4 PDF
No ratings yet
DSA MK Lect4 PDF
73 pages
08 Graph Algorithms Part1
No ratings yet
08 Graph Algorithms Part1
76 pages
Lec03 SearchSort_2023
No ratings yet
Lec03 SearchSort_2023
82 pages
Graph-Based Algorithms: CSE373: Design and Analysis of Algorithms
No ratings yet
Graph-Based Algorithms: CSE373: Design and Analysis of Algorithms
46 pages
Ch8 Graph
No ratings yet
Ch8 Graph
22 pages
07 CS316 - Algorithms - Graph 1 - Search
No ratings yet
07 CS316 - Algorithms - Graph 1 - Search
59 pages
L-8-- DSA Graph
No ratings yet
L-8-- DSA Graph
50 pages
Lec 12
No ratings yet
Lec 12
41 pages
Basic Graph
No ratings yet
Basic Graph
8 pages
Recitation 9: Graphs
No ratings yet
Recitation 9: Graphs
5 pages
Graph Part01 BFS DFS
No ratings yet
Graph Part01 BFS DFS
46 pages
UNIT V
No ratings yet
UNIT V
25 pages
Unit - 5
No ratings yet
Unit - 5
34 pages
Graph Algorithms
No ratings yet
Graph Algorithms
51 pages
DSA Unit 5
No ratings yet
DSA Unit 5
36 pages
Breadth First Search (BFS) : Input: Graph G (V, E), Either Directed or Undirected, Output
No ratings yet
Breadth First Search (BFS) : Input: Graph G (V, E), Either Directed or Undirected, Output
53 pages
Unit-5 21CSC201J
No ratings yet
Unit-5 21CSC201J
23 pages
BFS and DFS
No ratings yet
BFS and DFS
63 pages
Unit III - Graphs
No ratings yet
Unit III - Graphs
36 pages
4 1 Graphs
No ratings yet
4 1 Graphs
120 pages
Unit 5ds
No ratings yet
Unit 5ds
16 pages
Graphs
No ratings yet
Graphs
46 pages
Data structures (Graph) (1)
No ratings yet
Data structures (Graph) (1)
51 pages
Graphs - 1
No ratings yet
Graphs - 1
20 pages
2023-04-14
No ratings yet
2023-04-14
35 pages
Class_ppt1_Graphs
No ratings yet
Class_ppt1_Graphs
22 pages
Elementary Graph Algorithms: Manoj Agnihotri M.Tech I.T Dept of CSE ACET Amritsar
No ratings yet
Elementary Graph Algorithms: Manoj Agnihotri M.Tech I.T Dept of CSE ACET Amritsar
58 pages
CPgrahs
No ratings yet
CPgrahs
7 pages
Graph Data Structure
No ratings yet
Graph Data Structure
26 pages
BFS DFS
No ratings yet
BFS DFS
75 pages
Graphs Are A Generalization of Trees. Like Trees, Graphs Have Nodes and Edges. (The Nodes Are Sometimes
No ratings yet
Graphs Are A Generalization of Trees. Like Trees, Graphs Have Nodes and Edges. (The Nodes Are Sometimes
18 pages
Breadth First Search
No ratings yet
Breadth First Search
8 pages
Data Structures: R. K. Ghosh
No ratings yet
Data Structures: R. K. Ghosh
138 pages
Graph
No ratings yet
Graph
26 pages
Graph-1 Bfs
100% (1)
Graph-1 Bfs
40 pages
Graph Algorithms: BFS, Dfs
No ratings yet
Graph Algorithms: BFS, Dfs
8 pages
M4_graphsearchalgo
No ratings yet
M4_graphsearchalgo
11 pages
Chap 22
No ratings yet
Chap 22
29 pages
3 Notes
No ratings yet
3 Notes
4 pages
GraphAlgorithms
No ratings yet
GraphAlgorithms
68 pages
Graph: Dr. Inayat-ur-Rehman COMSATS Institute of Information Technology, Islamabad
No ratings yet
Graph: Dr. Inayat-ur-Rehman COMSATS Institute of Information Technology, Islamabad
39 pages
GRAPH
No ratings yet
GRAPH
49 pages
Unit 6 graphs
No ratings yet
Unit 6 graphs
53 pages
Depth First Search, Breadth First Search and Best First Search
No ratings yet
Depth First Search, Breadth First Search and Best First Search
12 pages
Graphs in ds2 Bca 4
No ratings yet
Graphs in ds2 Bca 4
20 pages
Httpslms Ou Edu Vn241pluginfile Php217729mod - resourcecontent1DsA-04-Graph PDF
No ratings yet
Httpslms Ou Edu Vn241pluginfile Php217729mod - resourcecontent1DsA-04-Graph PDF
36 pages
DS-unit IV
No ratings yet
DS-unit IV
32 pages
COMP 482: Design and Analysis of Algorithms: Spring 2013
No ratings yet
COMP 482: Design and Analysis of Algorithms: Spring 2013
34 pages
CSCE 3110 Data Structures & Algorithm Analysis: Rada Mihalcea Graphs (I) Reading: Chap.9, Weiss
No ratings yet
CSCE 3110 Data Structures & Algorithm Analysis: Rada Mihalcea Graphs (I) Reading: Chap.9, Weiss
34 pages
Graph
No ratings yet
Graph
90 pages
L21 Graphs Clean
No ratings yet
L21 Graphs Clean
166 pages
Week 10 (Graphs and Trees)
No ratings yet
Week 10 (Graphs and Trees)
65 pages
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
The Logical Solution Syracuse Conjecture
From Everand
The Logical Solution Syracuse Conjecture
Rolando Zucchini
No ratings yet
L11 Trees and Traversals
No ratings yet
L11 Trees and Traversals
30 pages
An Efficient Methodology To Sort Large Volume of Data
No ratings yet
An Efficient Methodology To Sort Large Volume of Data
5 pages
LeetCode Round 2
No ratings yet
LeetCode Round 2
3 pages
Chapter 03 Linear Programming - Simplex Method
No ratings yet
Chapter 03 Linear Programming - Simplex Method
21 pages
Merge Sort Based On Link List
No ratings yet
Merge Sort Based On Link List
9 pages
SWE-SDE Resources and Roadmap
No ratings yet
SWE-SDE Resources and Roadmap
5 pages
Zoho CheatSheet
No ratings yet
Zoho CheatSheet
3 pages
ME 2016 Spring 24 Homework 3
No ratings yet
ME 2016 Spring 24 Homework 3
4 pages
Maths Python Code
No ratings yet
Maths Python Code
4 pages
Lecture 08
No ratings yet
Lecture 08
24 pages
Big o Notation by Saturn God
No ratings yet
Big o Notation by Saturn God
22 pages
Algorithms
No ratings yet
Algorithms
20 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
Unit Ii ML MCQ
No ratings yet
Unit Ii ML MCQ
9 pages
Trees (Unit Review) PDF
No ratings yet
Trees (Unit Review) PDF
2 pages
DSA Viva Stack + Tree + Graph
No ratings yet
DSA Viva Stack + Tree + Graph
20 pages
CS F211 DSA Handout 2024
No ratings yet
CS F211 DSA Handout 2024
3 pages
10 TSP Exam Sol
No ratings yet
10 TSP Exam Sol
8 pages
Handbook of Exact String-Matching Algorithmss
No ratings yet
Handbook of Exact String-Matching Algorithmss
220 pages
Grand Test 6
100% (1)
Grand Test 6
18 pages
TMA4215 Report (10056,10047,10023)
No ratings yet
TMA4215 Report (10056,10047,10023)
5 pages
Lecture 2 - Clustering Methods
No ratings yet
Lecture 2 - Clustering Methods
19 pages
3 - Numerical Methods - Nov 2022
No ratings yet
3 - Numerical Methods - Nov 2022
3 pages
Data Structures
No ratings yet
Data Structures
2 pages
Depth-First Search Implementation:: CS-488 Artificial Intelligence
No ratings yet
Depth-First Search Implementation:: CS-488 Artificial Intelligence
10 pages
Chapter 2 Basic Sorting and Simple Searching
No ratings yet
Chapter 2 Basic Sorting and Simple Searching
92 pages
Gauss-Seidel Method of Load Flow Analysis: Algorithm Flowchart Problems Advantages & Disadvantages
No ratings yet
Gauss-Seidel Method of Load Flow Analysis: Algorithm Flowchart Problems Advantages & Disadvantages
13 pages