0% found this document useful (0 votes)

20 views22 pages

Lec 5

This document discusses block replacement techniques and write strategies for cache memory. It describes common block replacement algorithms like random, FIFO, LRU, LFU, and NRU. It also discusses implementations of pseudo-LRU and techniques like RRIP. For write strategies, it explains write-through versus write-back approaches and whether to use write allocation on a write miss. It categorizes cache misses as compulsory, capacity, or conflict misses.

Uploaded by

jettychetan524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views22 pages

Lec 5

Uploaded by

jettychetan524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Multicore Computer Architecture - Storage and Interconnects

Lecture 5
Block Replacement Techniques & Write Strategy

Dr. John Jose

Assistant Professor
Department of Computer Science & Engineering
Indian Institute of Technology Guwahati, Assam.
Processor Memory Performance Gap
Memory Hierarchy
Four cache memory design choices
 Where can a block be placed in the cache?
– Block Placement
 How is a block found if it is in cache memory?
– Block Identification
 Which block should be replaced on a miss?
– Block Replacement
 What happens on a write?
– Write Strategy
Block Replacement
 Cache has finite size. What do we do when it is full?
 Direct Mapped is Easy
 Which block in the selected set of a set associative cache?
Block Replacement Algorithms
 Random
 First In First Out (FIFO)
 Least Recently Used, pseudo-LRU
 Last In First Out (LIFO)
 Not Recently Used (NRU)
 Least Frequently Used (LFU)
 Re-Reference Interval Predication (RRIP)
 Optimal
Random Replacement Policy
 Random policy needs a pseudo-random number generator
 Overhead is an O(1) amount of work per block replacement
 Makes no attempt to take advantage of any temporal or spatial
localities
FIFO Replacement Policy
 First-in, First-out(FIFO) policy evict the block that has been in
the cache the longest
 It requires a queue Q to store references
 Blocks are enqueued in Q, dequeue operation on Q to determine
which block to evict.
 Overhead is an O(1) amount of work per block replacement
Optimal Replacement Policy
 Evict block with longest reuse distance
 i.e. next reference to block is farthest in future
 Requires knowledge of the future!
 Can’t build it, but can model it with trace
 Useful, since it reveals opportunity
 Optimal better than LRU
 (X,A,B,C,D,X): LRU 4-way SA cache, 2nd X will miss
Least-Recently Used Policy
 For associativity =2, LRU is equivalent to NMRU
 Single bit per line indicates LRU/MRU
 Set/clear on each access
 For a>2, LRU is difficult/expensive
 Timestamps? How many bits?
 Must find min timestamp on each eviction
 Sorted list? Re-sort on every access?
 List overhead: log2(a) bits /block
 Shift register implementation
Random vs FIFO vs LRU
New block Old block (chosen at random)
Random policy:

New block Old block(present longest)

FIFO policy:

Insert time: 8:00 am 7:48am 9:05am 7:10am 7:30 am 10:10am 8:45am

New block Old block(least recently used)

LRU policy:

last used: 7:25am 8:12am 9:22am 6:50am 8:20am 10:02am 9:50am

LRU Implementation
Cycle 1 Cycle 2 Cycle 3 Cycle 4
Hit in CL 0 Hit in CL 4 Hit in CL 7 Miss: replace CL 6

4 LRU 4 LRU 6 LRU 6 LRU 3 LRU

6 6 3 3 1

3 3 1 1 5

1 1 7 5 2

0 7 5 2 0

7 5 2 0 4

5 2 0 4 7

2 MRU 0 MRU 4 MRU 7 MRU 6 MRU

Practical Pseudo-LRU

J
Older 0 F
1 C
1 B
0 X
1 Y
Newer 0 A
1 Z

 Rather than true LRU, use binary tree

 Each node records which half is older/newer
 Update nodes on each reference
 Follow older pointers to find LRU victim
Practical Pseudo-LRU

J J Y X Z BC F A
F
C 011: PLRU Block B
B is here
X
Y 110: MRU block
A is here
Z

Partial Order Encoded in Tree:

B C F A
Z<A Y<X B<C J<F
J
A>X C<F Y X

A>F Z
Practical Pseudo-LRU
J Refs: J,Y,X,Z,B,C,F,A
Older 0 F
1 C
011: PLRU Block B
1 B is here
0 X
1 Y
110: MRU block
Newer 0 A is here
1 Z
 Binary tree encodes PLRU partial order
 At each level point to LRU half of subtree
 Each access: flip nodes along path to block
 Eviction: follow LRU path
 Overhead: (a-1)/a bits per block
Not Recently Used (NRU)
 Keep NRU state in 1 bit/block
 Bit is reset to 0 when installed / re referenced
 Bit is set to 1 when it is not referenced and other block in the
same set is referenced
 Evictions favor NRU=1 blocks
 If all blocks are NRU=0 / 1 then pick by random
 Provides some scan and thrash resistance
 Randomizing evictions rather than strict LRU order
Re-reference Interval Prediction
 RRIP
 Extends NRU to multiple bits
 Start in the middle
 promote on hit
 demote over time
 Can predict near-immediate, intermediate, and distant re-
reference
Least Frequently Used
 Counter per block, incremented on reference
 Evictions choose lowest count
 Logic not trivial (a2 comparison/sort)
 Storage overhead
 1 bit per block: same as NRU
 How many bits are helpful?
Write strategy
 Write through: The information is written to both the block in
the cache and to the block in the next level memory
 Write Through: read misses do not need to write back
evicted line contents
 Write back: The information is written only to the block in the
cache. The modified cache block is written to main memory
only when it is replaced.
 is block clean or dirty?
 Write Back: no writes of repeated writes
What About Write Miss?
 Write allocate: The block is loaded into cache on a write miss

 No-Write allocate: The block is modified in the memory but not

in cache
Types of Cache Misses
 Compulsory
 Very first access to a block
 Will occur even in an infinite cache
 Capacity
 If cache cannot contain all the blocks needed
 Misses in fully associative cache (due to the capacity)
 Conflict
 If too many blocks map to the same set
 Occurs in associative or direct mapped cache
johnjose@[Link]
[Link]

Block Replacement Techniques & Write Strategy: Lecture 4B
No ratings yet
Block Replacement Techniques & Write Strategy: Lecture 4B
21 pages
Cache Memory - Block Replacement Techniques: CS223 Computer Architecture & Organization
No ratings yet
Cache Memory - Block Replacement Techniques: CS223 Computer Architecture & Organization
16 pages
Cache Presentation
No ratings yet
Cache Presentation
45 pages
Onur Comparch Fall2017 Lecture3 Afterlecture
No ratings yet
Onur Comparch Fall2017 Lecture3 Afterlecture
219 pages
Essential Cache Replacement Policies Explained
No ratings yet
Essential Cache Replacement Policies Explained
3 pages
Cache Memory: CS 322M Digital Logic & Computer Architecture
No ratings yet
Cache Memory: CS 322M Digital Logic & Computer Architecture
16 pages
Computer Architecture - Lecture 06
No ratings yet
Computer Architecture - Lecture 06
18 pages
CH 4 e F08
No ratings yet
CH 4 e F08
4 pages
15IF11 Multicore B
No ratings yet
15IF11 Multicore B
36 pages
A Survey On - Page Replacement Algorithms
No ratings yet
A Survey On - Page Replacement Algorithms
13 pages
CPU Cache Design Overview
No ratings yet
CPU Cache Design Overview
27 pages
Chapter 5
No ratings yet
Chapter 5
16 pages
Memory Hierarchy and Cache Optimization
No ratings yet
Memory Hierarchy and Cache Optimization
20 pages
Chapter 4 - Memory Part 3
No ratings yet
Chapter 4 - Memory Part 3
18 pages
Cache Algorithms: From Wikipedia, The Free Encyclopedia
No ratings yet
Cache Algorithms: From Wikipedia, The Free Encyclopedia
5 pages
Cache Basics and Operation
No ratings yet
Cache Basics and Operation
42 pages
CH04
No ratings yet
CH04
46 pages
Cache Read/Write Operations Explained
No ratings yet
Cache Read/Write Operations Explained
9 pages
Akhil Pranay Week13Discussion
No ratings yet
Akhil Pranay Week13Discussion
30 pages
Lab3 Suppl
No ratings yet
Lab3 Suppl
25 pages
Cache
No ratings yet
Cache
10 pages
R RRRRRRRR Final
No ratings yet
R RRRRRRRR Final
28 pages
Blog Algomaster Io P 7 Cache Eviction Strategies
No ratings yet
Blog Algomaster Io P 7 Cache Eviction Strategies
22 pages
Introduction To Operating Systems: Class 10-1: Swapping - Policy (Ch. 22)
No ratings yet
Introduction To Operating Systems: Class 10-1: Swapping - Policy (Ch. 22)
24 pages
N-Way Set Associative Cache Guide
No ratings yet
N-Way Set Associative Cache Guide
70 pages
Onur 447 Spring15 Lecture19 High Performance Caches Afterlecture
No ratings yet
Onur 447 Spring15 Lecture19 High Performance Caches Afterlecture
57 pages
Tut 09
No ratings yet
Tut 09
12 pages
CH04 COA10e
No ratings yet
CH04 COA10e
41 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
46 pages
Memory Hierarchy and Cache Optimization
No ratings yet
Memory Hierarchy and Cache Optimization
36 pages
Cache Memory Simulation Presentation
No ratings yet
Cache Memory Simulation Presentation
13 pages
Cache Design and Optimization Techniques
No ratings yet
Cache Design and Optimization Techniques
19 pages
L38 PDF
No ratings yet
L38 PDF
19 pages
Coa (21CS34)
No ratings yet
Coa (21CS34)
13 pages
CH04 COA10e
No ratings yet
CH04 COA10e
46 pages
05) Cache Memory Introduction
No ratings yet
05) Cache Memory Introduction
20 pages
Adaptive Insertion Policies
No ratings yet
Adaptive Insertion Policies
12 pages
hw4 927001590
No ratings yet
hw4 927001590
7 pages
Cache Fundamentals
No ratings yet
Cache Fundamentals
8 pages
Elements of Cache Design
No ratings yet
Elements of Cache Design
6 pages
Lec8 Memory
No ratings yet
Lec8 Memory
17 pages
Final Project TrinhxuanKhe BuiNgocMinh
No ratings yet
Final Project TrinhxuanKhe BuiNgocMinh
39 pages
Lecture 8
No ratings yet
Lecture 8
33 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
32 pages
Memory Hierarchy in Computer Architecture
No ratings yet
Memory Hierarchy in Computer Architecture
48 pages
Cache Memory Characteristics Explained
No ratings yet
Cache Memory Characteristics Explained
46 pages
Com Arch Lec Slide 3
No ratings yet
Com Arch Lec Slide 3
30 pages
Cache Operation and Write Policies Explained
No ratings yet
Cache Operation and Write Policies Explained
20 pages
Advanced Computer Architecture Guide
No ratings yet
Advanced Computer Architecture Guide
22 pages
Cache Design in Computer Architecture
No ratings yet
Cache Design in Computer Architecture
23 pages
Cache Replacement
No ratings yet
Cache Replacement
10 pages
Cache Replacement Policies, Types of Cache Miss, Writing Policies
No ratings yet
Cache Replacement Policies, Types of Cache Miss, Writing Policies
15 pages
Com Arch Lec Slide 3 2
No ratings yet
Com Arch Lec Slide 3 2
31 pages
Chapter 5.1-5.6 Memory
No ratings yet
Chapter 5.1-5.6 Memory
26 pages
Computer Memory Systems Overview
No ratings yet
Computer Memory Systems Overview
37 pages
CH04 Cache Memory
No ratings yet
CH04 Cache Memory
44 pages
Cache Thrashing Detection Techniques
No ratings yet
Cache Thrashing Detection Techniques
20 pages
18 Caches Cornell PDF
No ratings yet
18 Caches Cornell PDF
43 pages
CSE 202 Homework II: Graph Algorithms
No ratings yet
CSE 202 Homework II: Graph Algorithms
5 pages
Sample Questions
No ratings yet
Sample Questions
2 pages
Residue Number System: Dr. Arunachalam V Associate Professor, SENSE
No ratings yet
Residue Number System: Dr. Arunachalam V Associate Professor, SENSE
13 pages
(Bagajewicz) On The Generalized Benders Decomposition
No ratings yet
(Bagajewicz) On The Generalized Benders Decomposition
10 pages
Digital-Analog Quantum Computation: Mikel - Sanz@ehu - Es
No ratings yet
Digital-Analog Quantum Computation: Mikel - Sanz@ehu - Es
12 pages
D1 Mock Paper Mark Scheme
No ratings yet
D1 Mock Paper Mark Scheme
7 pages
Overview of Key ML Algorithms
No ratings yet
Overview of Key ML Algorithms
16 pages
Semester 2, 2020 Week 8: Data Mining in WEKA Tutorial/Lab Session - 7
No ratings yet
Semester 2, 2020 Week 8: Data Mining in WEKA Tutorial/Lab Session - 7
13 pages
2012
No ratings yet
2012
7 pages
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
No ratings yet
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
40 pages
CS8501 Theory of Computation Two Mark Questions 1
No ratings yet
CS8501 Theory of Computation Two Mark Questions 1
24 pages
Iqta Unit 4
No ratings yet
Iqta Unit 4
31 pages
SVM Example and Equations Guide
No ratings yet
SVM Example and Equations Guide
9 pages
DisjointSet Slide
No ratings yet
DisjointSet Slide
19 pages
An Ant Colony Optimization Algorithm For Multiple
No ratings yet
An Ant Colony Optimization Algorithm For Multiple
5 pages
Design and Implementation of Efficient Quantum Support Vector Machine
No ratings yet
Design and Implementation of Efficient Quantum Support Vector Machine
4 pages
Lexical Analysis: Finite Automata Explained
No ratings yet
Lexical Analysis: Finite Automata Explained
15 pages
Dis 01B: CS 70 Discrete Mathematics and Probability Theory Spring 2021
No ratings yet
Dis 01B: CS 70 Discrete Mathematics and Probability Theory Spring 2021
3 pages
Streams
No ratings yet
Streams
8 pages
OCaml Programming Guide
No ratings yet
OCaml Programming Guide
13 pages
SPPU Data Structure Exam Guide
No ratings yet
SPPU Data Structure Exam Guide
2 pages
C++ Program for K Stacks in Array
No ratings yet
C++ Program for K Stacks in Array
3 pages
Data Structures and Sorting Algorithms in C++
No ratings yet
Data Structures and Sorting Algorithms in C++
21 pages
Discrete and Continuous Optimization Techniques
No ratings yet
Discrete and Continuous Optimization Techniques
19 pages
1 CS205 - DATA - STRUCTURES - QP MAIN JAN 2017 - Ktu Qbank
100% (1)
1 CS205 - DATA - STRUCTURES - QP MAIN JAN 2017 - Ktu Qbank
3 pages
AI Problem-Solving Agent Overview
No ratings yet
AI Problem-Solving Agent Overview
6 pages
Data Structure - Shrivastava - Ibrg
No ratings yet
Data Structure - Shrivastava - Ibrg
268 pages
Exercises: Part I: Author: Mala Mitra
No ratings yet
Exercises: Part I: Author: Mala Mitra
10 pages
Binomial Heap Structures Explained
No ratings yet
Binomial Heap Structures Explained
16 pages
Understanding Elementary Data Organisation
No ratings yet
Understanding Elementary Data Organisation
13 pages

Lec 5

Uploaded by

Lec 5

Uploaded by

Multicore Computer Architecture - Storage and Interconnects

Dr. John Jose

New block Old block(present longest)

Insert time: 8:00 am 7:48am 9:05am 7:10am 7:30 am 10:10am 8:45am

last used: 7:25am 8:12am 9:22am 6:50am 8:20am 10:02am 9:50am

4 LRU 4 LRU 6 LRU 6 LRU 3 LRU

2 MRU 0 MRU 4 MRU 7 MRU 6 MRU

 Rather than true LRU, use binary tree

Partial Order Encoded in Tree:

 No-Write allocate: The block is modified in the memory but not

You might also like