Lecture 8 Cont. Cache Memory

Uploaded by

syed.12682

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lecture 8 Cont. Cache Memory

Uploaded by

syed.12682

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

William Stallings

Computer Organization
and Architecture
8th Edition
Chapter 4
Cont.Cache Memory

Book by : Computer, Architecture and Organizations, 8th Edition ,William Stalling

Original Slides by : Adrian J Pullin
Cont. Cache Memory
Lecture Outcomes
Understanding of:
• Replacement Algorithm
• Write Policy
• Cache Performance
• Locality of Reference
• Pentium 4 Cache Organization
• ARM Cache Organization
Replacement Algorithms (1) Direct mapping

• No choice
• Each block only maps to one line
• Replace that line
Replacement Algorithms (2) Associative & Set Associative
• Hardware implemented algorithm (speed)
• Least Recently used (LRU)
• e.g. in 2 way set associative
– Which of the 2 block is lru?
• First in first out (FIFO)
– replace block that has been in cache longest
• Least frequently used
– replace block which has had fewest hits
• Random
Write Policy
• Must not overwrite a cache block unless main memory is
up to date
• Multiple CPUs may have individual caches
• I/O may address main memory directly
Write through
• All writes go to main memory as well as cache
• Multiple CPUs can monitor main memory traffic to keep
local (to CPU) cache up to date
• Lots of traffic
• Slows down writes
• Remember bogus write through caches!
Write back
• Updates initially made in cache only
• Update bit for cache slot is set when update occurs
• If block is to be replaced, write to main memory only if
update bit is set
• Other caches get out of sync
• I/O must access main memory through cache
• N.B. 15% of memory references are writes
Multilevel Caches
• High logic density enables caches on chip
– Faster than bus access
– Frees bus for other transfers
• Common to use both on and off chip cache
– L1 on chip, L2 off chip in static RAM
– L2 access much faster than DRAM or ROM
– L2 often uses separate data path
– L2 may now be on chip
– Resulting in L3 cache
• Bus access or now on chip…
Measuring Cache Performance
• No cache: Often about 10 cycles per memory access
• Simple cache:
– tave = hC + (1-h)M
– C is often 1 clock cycle
– Assume M is 17 cycles (to load an entire cache line)
– Assume h is about 90%
– tave = .9 (1) + (.1)17 = 2.6 cycles/access
– What happens when h is 95%?

10
Multi-level cache performance
• tave = h1C1 + (1-h1) h2C2 + (1-h1) (1-h2) M
– h1 = hit rate in primary cache
– h2 = hit rate in secondary cache
– C1 = time to access primary cache
– C2 = time to access secondary cache
– M = miss penalty (time to load an entire cache line
from main memory)
Processor Performance Without Cache

• 5GHz processor, cycle time = 0.2ns

• Memory access time = 100ns = 500 cycles
• Ignoring memory access, Clocks Per Instruction (CPI) =
1
• Assuming no memory data access:
CPI = 1 + # stall cycles
= 1 + 500 = 501

12
Performance with Level 1 Cache

• Assume hit rate, h1 = 0.95

• 5GHz processor, cycle time = 0.2ns
• Memory access time = 100ns = 500 cycles
• L1 access time = 0.2ns/processor cycle time (0.2ns) = 1 cycle
• CPI = 1 + # stall cycles
= 1 + 0.05 x 500
= 26
• Processor speed increase due to cache
= 501/26 = 19.3%

13
Performance with L1 and L2 Caches

• Assume:
– L1 hit rate, h1 = 0.95
– L2 hit rate, h2 = 0.90 (this is very optimistic!)
– L2 access time = 5ns = 25 cycles
• CPI = 1 + # stall cycles
= 1 + 0.05 (25 + 0.10 x 500)
= 1 + 3.75 = 4.75
• Processor speed increase due to both caches
= 501/4.75 = 105.5
• Speed increase due to L2 cache
= 26/4.75 = 5.47

14
15
16
17
18
19
Example

20
Hit Ratio (L1 & L2)
For 8 kbytes and 16 kbyte L1
Unified v Split Caches
• One cache for data and instructions or two, one for data and one for
instructions
• Advantages of unified cache
– Higher hit rate
• Balances load of instruction and data fetch
• Only one cache to design & implement
• Advantages of split cache
– Eliminates cache contention between instruction fetch/decode
unit and execution unit
• Important in pipelining
Pentium 4 Cache
• 80386 – no on chip cache
• 80486 – 8k using 16 byte lines and four way set associative organization
• Pentium (all versions) – two on chip L1 caches
– Data & instructions
• Pentium III – L3 cache added off chip
• Pentium 4
– L1 caches
• 8k bytes
• 64 byte lines
• four way set associative
– L2 cache
• Feeding both L1 caches
• 256k
• 128 byte lines
• 8 way set associative
– L3 cache on chip
Pentium 4 Design Reasoning
• Decodes instructions into RISC like micro-ops before L1 cache
• Micro-ops fixed length
– Superscalar pipelining and scheduling
• Pentium instructions long & complex
• Performance improved by separating decoding from scheduling & pipelining
– (More later – ch14)
• Data cache is write back
– Can be configured to write through
• L1 cache controlled by 2 bits in register
– CD = cache disable
– NW = not write through
– 2 instructions to invalidate (flush) cache and write back then invalidate
• L2 and L3 8-way set-associative
– Line size 128 bytes
ARM Cache Features

Core Cache Cache Size (kB) Cache Line Size Associativity Location Write Buffer Size
Type (words) (words)

ARM720T Unified 8 4 4-way Logical 8

ARM920T Split 16/16 D/I 8 64-way Logical 16

ARM926EJ-S Split 4-128/4-128 D/I 8 4-way Logical 16

ARM1022E Split 16/16 D/I 8 64-way Logical 16

ARM1026EJ-S Split 4-128/4-128 D/I 8 4-way Logical 8

Intel StrongARM Split 16/16 D/I 4 32-way Logical 32

Intel Xscale Split 32/32 D/I 8 32-way Logical 32

ARM1136-JF-S Split 4-64/4-64 D/I 8 4-way Physical 32
ARM Cache Organization
• Small FIFO write buffer
– Enhances memory write performance
– Between cache and main memory
– Small c.f. cache
– Data put in write buffer at processor clock speed
– Processor continues execution
– External write in parallel until empty
– If buffer full, processor stalls
– Data in write buffer not available until written
• So keep buffer small
ARM Cache and Write Buffer
Organization
Review Questions

❑What are the differences among sequential access, direct access, and random
access?
❑What is the general relationship among access time, memory cost, and capacity?
❑How does the principle of locality relate to the use of multiple memory levels?
❑What is the distinction between spatial locality and temporal locality?
❑In general, what are the strategies for exploiting spatial locality and temporal
locality?
Thank you

CON101 Quiz PDF
83% (6)
CON101 Quiz PDF
846 pages
MTHFR Protocol - Personalization by Chris Masterjohn
100% (10)
MTHFR Protocol - Personalization by Chris Masterjohn
92 pages
Lecture 1 Introduction To Computer Architecture and Organization
No ratings yet
Lecture 1 Introduction To Computer Architecture and Organization
69 pages
33 11 KV Substation Training Report
No ratings yet
33 11 KV Substation Training Report
37 pages
MC Module-5 Notes
No ratings yet
MC Module-5 Notes
8 pages
MUST TO KNOW CC RODRIGUEZ Flashcards - Quizlet
No ratings yet
MUST TO KNOW CC RODRIGUEZ Flashcards - Quizlet
32 pages
7 Respiration
No ratings yet
7 Respiration
15 pages
R RRRRRRRR Final
No ratings yet
R RRRRRRRR Final
28 pages
04 Cache Memory
No ratings yet
04 Cache Memory
36 pages
CH04
No ratings yet
CH04
46 pages
LECTURE 19
No ratings yet
LECTURE 19
15 pages
Pentium 4 Cache Presentation
No ratings yet
Pentium 4 Cache Presentation
20 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
51 pages
Computer Architecture_Lecture 06
No ratings yet
Computer Architecture_Lecture 06
18 pages
5.5 Cache Organization
No ratings yet
5.5 Cache Organization
8 pages
Cache Memory
No ratings yet
Cache Memory
60 pages
L07-MemoryII
No ratings yet
L07-MemoryII
27 pages
Unit 5 Dpco
No ratings yet
Unit 5 Dpco
20 pages
Computer Architecture and Organization: Lecture14: Cache Memory Organization
No ratings yet
Computer Architecture and Organization: Lecture14: Cache Memory Organization
18 pages
Cache Memory
No ratings yet
Cache Memory
39 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
55 pages
10 Multi-Level Strategies: Assignments
No ratings yet
10 Multi-Level Strategies: Assignments
20 pages
Computer Arch 06
No ratings yet
Computer Arch 06
41 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
UNIT2 Cahe-Opt
No ratings yet
UNIT2 Cahe-Opt
134 pages
09 Caches Tlbs
No ratings yet
09 Caches Tlbs
33 pages
CH 4 e F08
No ratings yet
CH 4 e F08
4 pages
L18-Cache-Wrap-up
No ratings yet
L18-Cache-Wrap-up
30 pages
UNIT-IV Memory and I/O
No ratings yet
UNIT-IV Memory and I/O
36 pages
L17
No ratings yet
L17
23 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
No ratings yet
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
20 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
28 pages
Lec2 PDF
No ratings yet
Lec2 PDF
21 pages
Module4 CAche Performance
No ratings yet
Module4 CAche Performance
40 pages
Computer Organization & Architecture: Cache Memory
No ratings yet
Computer Organization & Architecture: Cache Memory
52 pages
Cache&Virtual Memory
No ratings yet
Cache&Virtual Memory
50 pages
Cache Memory
No ratings yet
Cache Memory
20 pages
15IF11 Multicore B
No ratings yet
15IF11 Multicore B
36 pages
Memory 2
No ratings yet
Memory 2
31 pages
CPU Cache
No ratings yet
CPU Cache
19 pages
Module 5
No ratings yet
Module 5
17 pages
CS 3853 Computer Architecture - Memory Hierarchy
No ratings yet
CS 3853 Computer Architecture - Memory Hierarchy
37 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
32 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
COMP 740: Computer Architecture and Implementation: Montek Singh
No ratings yet
COMP 740: Computer Architecture and Implementation: Montek Singh
41 pages
11 Cache Memory
No ratings yet
11 Cache Memory
40 pages
10_Caches
No ratings yet
10_Caches
34 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
12 pages
Lectures wk11
No ratings yet
Lectures wk11
21 pages
Elements of Cache Design Pentium IV Cache Organization
No ratings yet
Elements of Cache Design Pentium IV Cache Organization
43 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
CO & OS Unit-3 (Only Imp Concepts)
No ratings yet
CO & OS Unit-3 (Only Imp Concepts)
26 pages
04 Cache Memory Internal Memory Revised 2
No ratings yet
04 Cache Memory Internal Memory Revised 2
43 pages
Sunplus Mmobile Inc.: Cache Introduction
No ratings yet
Sunplus Mmobile Inc.: Cache Introduction
18 pages
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
No ratings yet
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
20 pages
Cache Presentation
No ratings yet
Cache Presentation
45 pages
Understanding CPU Caching
No ratings yet
Understanding CPU Caching
7 pages
Lecture 16
No ratings yet
Lecture 16
22 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
Understand CPU Caching Concepts
No ratings yet
Understand CPU Caching Concepts
14 pages
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
Page Fault
No ratings yet
Page Fault
10 pages
CSO Gaddis Java Chapter10 6e
No ratings yet
CSO Gaddis Java Chapter10 6e
46 pages
Multilevel Que and Feedeback Que
No ratings yet
Multilevel Que and Feedeback Que
43 pages
Ata Elahi, Alex Cushman - Computer Networks. Data Communications, Internet and Security. (2024)
No ratings yet
Ata Elahi, Alex Cushman - Computer Networks. Data Communications, Internet and Security. (2024)
418 pages
Lecture 2 Top Level View of Computer Function and Interconnection
No ratings yet
Lecture 2 Top Level View of Computer Function and Interconnection
29 pages
Lecture 6 Cont. Computer Arithematic (Booth - S Division)
No ratings yet
Lecture 6 Cont. Computer Arithematic (Booth - S Division)
25 pages
Lecture 3 Cont. Top Level View of Computer Function and Interconnection
No ratings yet
Lecture 3 Cont. Top Level View of Computer Function and Interconnection
31 pages
Lecture 4 Computer Arithematic (Sign Magnitude)
No ratings yet
Lecture 4 Computer Arithematic (Sign Magnitude)
30 pages
Clamp Fixing
No ratings yet
Clamp Fixing
1 page
East Africa University Bosaso, Puntland Somalia Faculty of Medicine Communicable Disease MR Buruj Ali Salad
No ratings yet
East Africa University Bosaso, Puntland Somalia Faculty of Medicine Communicable Disease MR Buruj Ali Salad
42 pages
Phy Project Report-1
No ratings yet
Phy Project Report-1
18 pages
Formal, Functional and Purpose
No ratings yet
Formal, Functional and Purpose
9 pages
Glovebox Guide To Evs Esf
No ratings yet
Glovebox Guide To Evs Esf
20 pages
Selected Questions Revised 20200305 2HR
No ratings yet
Selected Questions Revised 20200305 2HR
3 pages
2XX-IP: Ethernet Interface Option
No ratings yet
2XX-IP: Ethernet Interface Option
12 pages
6.june 2019
No ratings yet
6.june 2019
241 pages
Area and Volume Formula For Geometrical Figures
No ratings yet
Area and Volume Formula For Geometrical Figures
1 page
Schedule Maintenance Preventive 2023 (Major)
No ratings yet
Schedule Maintenance Preventive 2023 (Major)
1 page
Pediatrics PreTest Self Assessment and Review Twelfth Edition Robert Yetman - The ebook in PDF format is ready for immediate access
100% (1)
Pediatrics PreTest Self Assessment and Review Twelfth Edition Robert Yetman - The ebook in PDF format is ready for immediate access
50 pages
Network Sounder: ETR-6/10N
No ratings yet
Network Sounder: ETR-6/10N
33 pages
Physics MCQs
No ratings yet
Physics MCQs
22 pages
Pokemon Go Stats Lookup
No ratings yet
Pokemon Go Stats Lookup
23 pages
CHAPTER 11 The Cardiovascular System
No ratings yet
CHAPTER 11 The Cardiovascular System
7 pages
Di Mentioning A Simplex Swirl Injector Journal
No ratings yet
Di Mentioning A Simplex Swirl Injector Journal
15 pages
Tpai
No ratings yet
Tpai
47 pages
DIN standreds (flender)
No ratings yet
DIN standreds (flender)
10 pages
0 Geolog Company Overview 2014
No ratings yet
0 Geolog Company Overview 2014
14 pages
2010 Edward Cording - Assessment of Excavation Induced Building Damage
No ratings yet
2010 Edward Cording - Assessment of Excavation Induced Building Damage
20 pages
Indian Logic - Notes
No ratings yet
Indian Logic - Notes
7 pages
Ravindiran
100% (2)
Ravindiran
131 pages
THEJASHREE
No ratings yet
THEJASHREE
57 pages
Features
No ratings yet
Features
7 pages
Test Frequency and Acceptance Criteria: Subgrade/Shoulder/Median
75% (4)
Test Frequency and Acceptance Criteria: Subgrade/Shoulder/Median
6 pages