Network Routing: &: Algorithms
Network Routing: &: Algorithms
5
• c(x,x’) = cost of link (x,x’)
3
v w 5
2 - e.g., c(w,z) = 5
u 2 1 z
3 • cost could always be 1, or
1 2 inversely related to bandwidth,
x 1
y
or inversely related to
congestion
5/9/05 2 CS118/Spring05
Dijkstra’s algorithm
Assume net topology, link costs 1 Initialization:
is known
2 N' = {A}
computes least cost paths from
one node to all other nodes 3 for all nodes v
Create forwarding table for that 4 if v adjacent to A
node 5 then D(v) = c(A,v)
6 else D(v) =
Notation: 7
c(i,j): link cost from node i to j 8 Loop
(∞ if not known) 9 find w not in N' such that D(w) is
D(v): current value of cost of minimum
path from source to dest. V 10 add w to N'
p(v): predecessor node along 11 update D(v) for all v adjacent to w
path from source to v, (neighbor and not in N':
of v) 12 D(v) = min( D(v), D(w) + c(w,v) )
N': set of nodes whose least cost 13 /* new cost to v is either the old
path already known cost, or known shortest path cost to
w plus cost from w to v */
14 until all nodes in N'
5/9/05 3 CS118/Spring05
Dijkstra’s algorithm: example
Step start N' D(B),p(B) D(C),p(C) D(D),p(D) D(E),p(E) D(F),p(F)
0 A 2,A 5,A 1,A infinity infinity
1 AD 2,A 4,D 2,D infinity
2 ADE 2,A 3,E 4,E
3 ADEB 3,E 4,E
4 ADEBC 4,E
5 ADEBCF
5
B 3 C
2 5
A 2 1 F
3
1 2
D 1 E
5/9/05 4 CS118/Spring05
Dijkstra’s algorithm: example
Step start N D(B),p(B) D(C),p(C) D(D),p(D) D(E),p(E) D(F),p(F)
0 A 2,A 5,A 1,A infinity infinity
1 AD 2,A 4,D 2,D infinity
2 ADB 4,D 2,D infinity
3 ADBE 3,E 4,E
4 ADBEC 4,E
5 ADEBCF
Resulting forwarding table at A:
Resulting shortest-path tree for A: destination link
5 B (A, B)
3 D (A, D)
B C 5
2 E (A, D)
A 2 1 F C (A, D)
3
1 2 F (A, D)
D E
1
5/9/05 5 CS118/Spring05
Dijkstra’s algorithm, discussion
Algorithm complexity: n nodes
each iteration: need to check all nodes, w, not in N
n(n+1)/2 comparisons: O(n2)
more efficient implementations possible: O(nlogn)
Oscillations possible:
e.g., link cost = amount of carried traffic
1 A A A A
1+e 2+e 0 0 2+e 2+e 0
D 0 0 B D 1+e 1 B D B D 1+e 1 B
0 0
0 e 0 0 1 1+e 0 e
1
C C C C
1
e
… recompute … recompute … recompute
initially
routing
5/9/05 6 CS118/Spring05
Bellman-Ford Equation
Define: Dx(y) := cost of least-cost path from x to y
Then Dx(y) = min {c(x,v) + Dv(y) }
where min is taken over all neighbors v of x
5/9/05 8 CS118/Spring05
Distance Table: example
1 cost to destination via
B C E Outgoing
7 D() A B D DE link
A 8 2
1 A 1 14 5 A A,1
E D
2
B 7 8 5 B D,5
C 6 9 4 C D,4
D 4 11 2 D D,2
forwarding
table
5/9/05 9 CS118/Spring05
Distance Vector Protocol (2)
5/9/05 10 CS118/Spring05
Distance Vector: an example
Y
2 1
X Z X Z
7 D (Y,Z) = c(X,Z) + minw{D (Y,w)}
= 7+1 = 8
X Y
D (Z,Y) = c(X,Y) + minw {D (Z,w)}
= 2+1 = 3
5/9/05 11 CS118/Spring05
Distance Vector: link cost changes
Link cost changes:
node detects local link cost change
updates distance table (line 15) 1
if cost change in least cost path, notify Y
4 1
neighbors (lines 23,24)
X Z
50
algorithm
“good terminates
news
travels
fast”
5/9/05 12 CS118/Spring05
Distance Vector: link cost changes (2)
Link cost changes:
bad news travels slow - “count to infinity”
problem! 60
Y
4 1
X Z
50
algorithm
continues
on!
5/9/05 13 CS118/Spring05
Distance Vector: poisoned reverse
If Z routes through Y to get to X : 60
Z tells Y its (Z’s) distance to X is Y
4 1
infinite (so Y won’t route to X via Z) X Z
50
algorithm
terminates
5/9/05 17 CS118/Spring05
Routing in the Internet
The Global Internet: a large number of
Autonomous Systems (AS) interconnected with
each other:
Stub AS: end user networks (corporations, campuses)
• Multihomed AS: stub ASes that are connected to multiple
service providers
Transit AS: Internet service provider
Two-level routing hierarchy:
Intra-AS
Inter-AS
5/9/05 18 CS118/Spring05
Internet Hierarchical Routing
Inter-AS border (exterior gateway) routers
Intra-AS
(interior
gateway)
routers autonomous system (AS): a set of routers under the same
administrative domain
Each AS makes its own decision on internal routing
protocol (IGP) to use
All routers in one AS run the same IGP
border routers also run BGP
5/9/05 19 CS118/Spring05
Intra-AS and Inter-AS routing
C.b
Border routers:
B.a • perform inter-AS
A.a routing across AS
b A.c c boundaries
a C a
b • perform intra-AS
a B
d c
routing with other
A b routers in each's own
AS
intra-AS inter-AS network layer
routing routing
protocol protocol
inter-AS, intra-AS link layer
routing in
gateway A.c physical layer
5/9/05 20 CS118/Spring05
Intra-AS and Inter-AS routing
Inter-AS
C.b routing
between B.a
A.a A and B
Host
b A.c c 18.2.4.157
a C a
b
a B
Host-1 d c Intra-AS routing
A b within AS B
Intra-AS routing Forwarding table
within AS A
131.179.0.0 outf-1
18.0.0.0 outf-2
23.0.0.0 outf-2
157.34.128.0 outf-3
222.8.192.0 outf-4
5/9/05 21 CS118/Spring05
Intra-AS Routing:
Interior Gateway Protocols (IGP)
Most commonly used IGPs:
IS-IS:Intermediate System to Intermediate System
Routing protocol
OSPF: Open Shortest Path First
IGRP: Interior Gateway Routing Protocol (Cisco
proprietary)
RIP: Routing Information Protocol
5/9/05 22 CS118/Spring05
RIP ( Routing Information Protocol)
Distance vector algorithm
Distance metric: # of hops (max = 15 hops)
z
w x y
A D B
C
Destination Network Next Router Num. of hops to dest.
w A 2
y B 2
z B 7
x -- 1
…. …. ....
Routing table in D
5/9/05 24 CS118/Spring05
RIP: Example
Dest. distance
w 1 Advertisement
x 1 from A to D
z 4
…. ...
z
w x y
A D B
C
Destination Network Next Router Num. of hops to dest.
w A 2
y B 2
z BA 75
x -- 1
…. …. ....
Routing table in D
5/9/05 25 CS118/Spring05
RIP Implementation
route-d (daemon): an application-level process that
manages RIP routing table and generates periodic RIP
routing updates
Process updates from neighbors
send updates periodically to neighbors (if detect a failure, send
right away)
Keeps the resulting routing table only (not all the updates)
routed routed
Transport Transport
(UDP) (UDP)
network forwarding forwarding network
(IP) table table (IP)
link link
physical physical
5/9/05 26 CS118/Spring05
OSPF (Open Shortest Path First)
A Link State protocol
each node knows its directly connected neighbors & the link
distance to each (link-state)
each node periodically broadcasts its link-state to the entire
network
Link-State Packet: one entry per neighbor router
ID of the node that created the LSP
a list of direct neighbors, with link cost to each
sequence number for this LSP message (SEQ)
time-to-live (TTL) for information carried in this LSP
Use raw IP packet (protocol ID = 89)
5/9/05 27 CS118/Spring05
Building a complete map using Link State
Everyone broadcasts a piece of the topology
Put all the pieces together, you get the complete
map
Then each node carries out its own routing calculation independently
5/9/05 28 CS118/Spring05
Link-State Routing Protocol
The routing daemon running at each node: Builds
and maintains topology map at each node
Stores and forwards most recent LSP from all other
nodes
• decrement TTL of stored LSP; discard info when TTL=0
Compute routes using Dijkstra’s algorithm
generates its own LSP periodically with increasing
SEQ
5/9/05 29 CS118/Spring05
Reliable Flooding of LSP
forward each received LSP to all neighbor nodes
but the one that sent it
each ISP is reliably delivered over each link
use the source-ID and SEQ in a LSP to detect
duplicates
LSPs sent both periodically and event-driven
X A X A X A X A
C B D C B D C B D C B D
5/9/05 30 CS118/Spring05
Advanced features supported by OSPF
Security: all OSPF messages authenticated
Multiple same-cost paths allowed
For each link, multiple cost metrics for different
TOS (eg, satellite link cost set “low” for best
effort; high for real time)
Integrated uni- and multicast support:
Multicast
OSPF (MOSPF) uses same topology data
base as OSPF
Hierarchical OSPF in large domains.
5/9/05 31 CS118/Spring05
Hierarchical OSPF
5/9/05 32 CS118/Spring05
Hierarchical OSPF
Two-level hierarchy: local area, backbone.
Link-state advertisements only in area
each nodes has detailed area topology; only know direction
(shortest path) to nets in other areas.
Area border routers: “summarize” distances to nets in own
area, advertise to other Area Border routers.
Backbone routers: run OSPF routing limited to backbone.
Boundary routers: connect to other AS’s.
5/9/05 33 CS118/Spring05
Inter-AS routing
5/9/05 34 CS118/Spring05
Example:
Forwarding Table in Router d of AS A
Suppose AS A learns from the inter-AS protocol that
subnet x is reachable from AS B (gateway A.c) but not
from AS C.
Inter-AS protocol propagates reachability info to all
internal routers.
Router d determines from intra-AS routing info that its
interface I is on the least cost path to c.
Puts in forwarding table entry (x, I).
5/9/05 35 CS118/Spring05
Choosing among multiple ASes
Now suppose AS1 learns from the inter-AS protocol
that subnet x is reachable from AS3 and from AS2.
To configure forwarding table, router 1d must determine
towards which gateway it should forward packets for
dest x.
This is also the job on inter-AS routing protocol!
Hot potato routing: send packet towards closest of two
routers.
5/9/05 36 CS118/Spring05
Internet inter-AS routing: BGP
BGP (Border Gateway Protocol): the de facto standard
BGP provides each AS a means to:
1. Obtain subnet reachability information from neighboring ASs.
2. Propagate the reachability information to all routers internal to
the AS.
3. Determine “good” routes to subnets based on reachability
information and policy.
Allows a subnet to advertise its existence to rest of the
Internet: “I am here”
5/9/05 37 CS118/Spring05
BGP basics
Pairs of routers (BGP peers) exchange routing info over a
TCP connection: BGP sessions
BGP sessions do not necessarily correspond to physical links.
When AS2 advertises a prefix to AS1, AS2 is promising it
will forward any datagrams destined to that prefix
towards the prefix.
3c
3a 2c
3b AS3 2a
1c 2b
AS2
1a 1b
AS1 1d
eBGP session
iBGP session
5/9/05 38 CS118/Spring05
Distributing reachability info
With eBGP session between 3a and 1c, AS3 sends prefix
reachability info to AS1.
1c can then use iBGP to distribute this new prefix reach info to all
routers in AS1
1b can then re-advertise the new reach info to AS2 over the 1b-to-
2a eBGP session
When router learns about a new prefix, it creates an entry for the
prefix in its forwarding table.
3c P
3a 2c
3b AS3 2a
1c 2b
AS2
1a 1b
AS1 1d eBGP session
5/9/05 40 CS118/Spring05
BGP route selection
Router may learn about more than 1 route to some prefix. Router
must select route.
Elimination rules:
1. Local preference value attribute: policy decision
2. Shortest AS-PATH
3. Closest NEXT-HOP router: hot potato routing
4. Additional criteria
5/9/05 41 CS118/Spring05
BGP messages
5/9/05 42 CS118/Spring05
BGP routing policy
legend: provider
B network
X
W A
customer
C network:
legend: provider
B network
X
W A
customer
C network:
5/9/05 45 CS118/Spring05