0% found this document useful (0 votes)

79 views43 pages

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

The document provides an overview of the CIS775: Computer Architecture course. The key topics covered include: - Course objectives such as evaluating instruction set design, advanced pipelining techniques, and memory system design. - Defining computer architecture as the functional operation and information flow within a computer system. - Major topics that will be covered like instruction set architecture, pipelining, memory hierarchy, multiprocessors, and performance evaluation methods. - How computer systems have changed dramatically over the past decades due to advances in technology, computer architecture, and a drop in costs.

Uploaded by

padma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views43 pages

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

Uploaded by

padma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

CIS775: Computer Architecture

Chapter 1: Fundamentals of
Computer Design
1

Course Objectives
To evaluate the issues involved in choosing and
designing instruction set.
To learn concepts behind advanced pipelining
techniques.
To understand the hitting the memory wall
problem and the current state-of-art in memory
system design.
To understand the qualitative and quantitative
tradeoffs in the design of modern computer systems

What is Computer Architecture?

Functional operation of the individual HW
units within a computer system, and the
flow of information and control among
them.
Technology

Parallelism

Computer
Hardware Organization Architecture:
Measurement &
Evaluation

Programming
Language
Interface

Interface Design
(ISA)
Applications

OS
3

Computer Architecture Topics

Input/Output and Storage
Disks, WORM, Tape

Emerging Technologies
Interleaving Memories

DRAM

Memory
Hierarchy

VLSI

Coherence,
Bandwidth,
Latency

L2 Cache

L1 Cache
Instruction Set Architecture

RAID

Addressing,
Protection,
Exception Handling

Pipelining, Hazard Resolution,

Superscalar, Reordering,
Prediction, Speculation,
Vector, DSP

Pipelining and Instruction

Level Parallelism

Computer Architecture Topics

Interconnection Network

Processor-Memory-Switch

Multiprocessors
Networks and Interconnections

Shared Memory,
Message Passing,
Data Parallelism
Network Interfaces
Topologies,
Routing,
Bandwidth,
Latency,
Reliability

Measurement and Evaluation

Architecture is an iterative process:
Searching the space of possible designs
At all levels of computer systems

Design

Analysis

Creativity
Cost /
Performance
Analysis

Good Ideas

Bad Ideas

Mediocre Ideas
6

Issues for a Computer Designer

Functional Requirements Analysis (Target)
Scientific Computing HiPerf floating pt.
Business transactional support/decimal arith.
General Purpose balanced performance for a range of tasks

Level of software compatibility

PL level
Flexible, Need new compiler, portability an issue

Binary level (x86 architecture)

Little flexibility, Portability requirements minimal

OS requirements
Address space issues, memory management, protection

Conformance to Standards
Languages, OS, Networks, I/O, IEEE floating pt.

Computer Systems: Technology

Trends
1988

Supercomputers
Massively Parallel Processors
Mini-supercomputers
Minicomputers
Workstations
PCs

2002
Powerful PCs and
SMP Workstations
Network of SMP
Workstations
Mainframes
Supercomputers
Embedded Computers

Why Such Change in 10 years?

Performance
Technology Advances
CMOS (complementary metal oxide semiconductor) VLSI dominates older
technologies like TTL (transistor transistor logic) in cost AND performance

Computer architecture advances improves low-end

RISC, pipelining, superscalar, RAID,

Price: Lower costs due to

Simpler development
CMOS VLSI: smaller systems, fewer components

Higher volumes
Lower margins by class of computer, due to fewer services

Function :Rise of networking/local interconnection technology

Growth in Microprocessor
Performance

Six Generations of DRAMs

Updated Technology Trends

(Summary)
Capacity

Speed (latency)

Logic

4x in 4 years

2x in 3 years

DRAM

4x in 3 years

2x in 10 years

Disk

4x in 2 years

2x in 10 years

Network (bandwidth) 10x in 5 years

Updates during your study period??
BS (4 yrs)
MS (2 yrs)
PhD (5 yrs)

Integrated Circuits Costs

IC cost = Die cost + Testing cost + Packaging cost
Final test yield
Die cost =
Wafer cost
Dies per Wafer * Die yield
Dies per wafer = * ( Wafer_diam / 2)2 * Wafer_diam Test dies
Die Area
2 * Die Area

Die Yield = Wafer yield * 1 +

Defects_per_unit_area * Die_Area

Die Cost goes roughly with die area4

DAP.S98 1

Performance Trends
(Summary)
Workstation performance (measured in Spec
Marks) improves roughly 50% per year
(2X every 18 months)
Improvement in cost performance estimated
at 70% per year

Computer Engineering
Methodology
Implementation
Complexity

Evaluate Existing
Systems for
Bottlenecks
Benchmarks

Technology
Trends

Implement Next
Generation System

Simulate New
Designs and
Organizations

Workloads
16

How to Quantify Performance?

Plane

DC to Paris

Speed

Passengers

Throughput
(pmph)

Boeing 747

6.5 hours

610 mph

470

286,700

BAD/Sud
Concodre

3 hours

1350 mph

132

178,200

Time to run the task (ExTime)

Execution time, response time, latency

Tasks per day, hour, week, sec, ns (Performance)

Throughput, bandwidth

The Bottom Line:

Performance and Cost or Cost
and Performance?
"X is n times faster than Y" means
ExTime(Y)
--------ExTime(X)

Performance(X)
--------------Performance(Y)

Speed of Concorde vs. Boeing 747

Throughput of Boeing 747 vs. Concorde
Cost is also an important parameter in the
equation which is why concordes are being put
to pasture!
18

Measurement Tools
Benchmarks, Traces, Mixes
Hardware: Cost, delay, area, power estimation
Simulation (many levels)
ISA, RT, Gate, Circuit

Queuing Theory
Rules of Thumb
Fundamental Laws/Principles
Understanding the limitations of any
measurement tool is crucial.
19

Metrics of Performance
Application

Answers per month

Operations per second

Programming
Language
Compiler
(millions) of Instructions per second: MIPS
ISA
(millions) of (FP) operations per second:
MFLOP/s
Datapath
Megabytes per second
Control
Function Units
Cycles per second (clock rate)
Transistors Wires Pins

Cases of Benchmark Engineering

The motivation is to tune the system to the benchmark to achieve peak
performance.
At the architecture level
Specialized instructions

At the compiler level (compiler flags)

Blocking in Spec89 factor of 9 speedup
Incorrect compiler optimizations/reordering.
Would work fine on benchmark but not on other programs

I/O level
Spec92 spreadsheet program (sp)
Companies noticed that the produced output was always out put to a file (so they stored
the results in a memory buffer) and then expunged at the end (which was not measured).
One company eliminated the I/O all together.

After putting in a blazing performance on the benchmark test,

Sun issued a glowing press release claiming that it had
outperformed Windows NT systems on the test.
Pendragon president Ivan Phillips cried foul, saying the results
weren't representative of real-world Java performance and that
Sun had gone so far as to duplicate the test's code within Sun's
Just-In-Time compiler. That's cheating, says Phillips, who claims
that benchmark tests and real-world applications aren't
the same thing.
Did Sun issue a denial or a mea culpa? Initially, Sun neither
denied optimizing for the benchmark test nor apologized for
it. "If the test results are not representative of real-world Java
applications, then that's a problem with the benchmark,"
Sun's Brian Croll said.
After taking a beating in the press, though, Sun retreated and
issued an apology for the optimization.[Excerpted from PC Online221997]

Issues with Benchmark

Engineering
Motivated by the bottom dollar, good
performance on classic suites more
customers, better sales.
Benchmark Engineering Limits the
longevity of benchmark suites
Technology and Applications Limits the
longevity of benchmark suites.
23

SPEC: System Performance

Evaluation Cooperative
First Round 1989
10 programs yielding a single number (SPECmarks)

Second Round 1992

SPECInt92 (6 integer programs) and SPECfp92 (14 floating point
programs)
Compiler Flags unlimited. March 93
new set of programs: SPECint95 (8 integer programs) and SPECfp95 (10
floating point)

benchmarks useful for 3 years

Single flag setting for all programs: SPECint_base95, SPECfp_base95
SPEC CPU2000 (11 integer benchmarks CINT2000, and 14
floating-point benchmarks CFP2000

SPEC 2000 (CINT 2000)Results

SPEC 2000 (CFP 2000)Results

Reporting Performance Results

Reproducability
Apply them on publicly available
benchmarks. Pecking/Picking order

Real Programs
Real Kernels
Toy Benchmarks
Synthetic Benchmarks
27

How to Summarize
Performance
Arithmetic mean (weighted arithmetic mean) tracks

execution time: sum(Ti)/n or sum(Wi*Ti)

Harmonic mean (weighted harmonic mean) of rates (e.g.,
MFLOPS) tracks execution time:
n/sum(1/Ri) or 1/sum(Wi/Ri)
Normalized execution time is handy for scaling
performance (e.g., X times faster than SPARCstation 10)
But do not take the arithmetic mean of normalized
execution time,
use the geometric mean = (Product(Ri)^1/n)

Performance Evaluation
For better or worse, benchmarks shape a field
Good products created when have:
Good benchmarks
Good ways to summarize performance

Given sales is a function in part of performance relative to

competition, investment in improving product as reported by
performance summary
If benchmarks/summary inadequate, then choose between
improving product for real programs vs. improving product to get
more sales;
Sales almost always wins!
Execution time is the measure of computer performance!

Simulations
When are simulations useful?
What are its limitations, I.e. what real world
phenomenon does it not account for?
The larger the simulation trace, the less
tractable the post-processing analysis.
30

Queueing Theory
What are the distributions of arrival rates
and values for other parameters?
Are they realistic?
What happens when the parameters or
distributions are changed?
31

Quantitative Principles of Computer

Design
Make the Common Case Fast
Amdahls Law

CPU Performance Equation

Clock cycle time
CPI
Instruction Count

Principles of Locality
Take advantage of Parallelism
32

Amdahl's Law
Speedup due to enhancement E:
ExTime w/o E
Speedup(E) = ------------ExTime w/ E

Performance w/ E
----------------Performance w/o

Suppose that enhancement E accelerates a fraction F

of the task by a factor S, and the remainder of the
task is unaffected
33

Amdahls Law
ExTimenew = ExTimeold x (1 - Fractionenhanced) + Fractionenhanced
Speedupenhanced

Speedupoverall =

ExTimeold
ExTimenew

1
=

(1 - Fractionenhanced) + Fractionenhanced
Speedupenhanced

Amdahls Law
Floating point instructions improved to run 2X; but
only 10% of actual instructions are FP
ExTimenew =
Speedupoverall =

CPU Performance Equation

CPU
CPUtime
time

== Seconds
Seconds == Instructions
Instructions xx Cycles
Cycles xx Seconds
Seconds
Program
Program
Instruction
Cycle
Program
Program
Instruction
Cycle

Program

Inst Count CPI

Compiler

(X)

Inst. Set.

Organization
Technology

Clock Rate

X
X
36

Cycles Per Instruction

Average Cycles per Instruction
CPI = (CPU Time * Clock Rate) / Instruction Count
= Cycles / Instruction Count
n

CPU time = CycleTime *

i =1

CPIi

* iI

Instruction Frequency
n

CPI =

i =1

CPI
i

where iF

iI
Instruction Count

Invest Resources where time is Spent!

Example: Calculating CPI

Base Machine (Reg / Reg)
Op
Freq Cycles CPI(i)
ALU
50%
1
.5
Load
20%
2
.4
Store
10%
2
.2
Branch
20%
2
.4
1.5

(% Time)
(33%)
(27%)
(13%)
(27%)

Typical Mix

Chapter Summary, #1
Designing to Last through Trends
Capacity

Speed

Logic

2x in 3 years

DRAM

4x in 3 years

2x in 10 years

Disk

4x in 3 years

2x in 10 years

6yrs to graduate => 16X CPU speed, DRAM/Disk size

Time to run the task

Execution time, response time, latency

Tasks per day, hour, week, sec, ns,

Throughput, bandwidth

X is n times faster than Y means

ExTime(Y)
--------ExTime(X)

Performance(X)
-------------Performance(Y)

Chapter Summary, #2
Amdahls Law:
Speedupoverall =

CPI Law:
CPU
CPUtime
time

ExTimeold
ExTimenew

1
=

(1 - Fractionenhanced) + Fractionenhanced
Speedupenhanced

== Seconds
Seconds == Instructions
Instructions xx Cycles
Cycles xx Seconds
Seconds
Program
Program
Instruction
Cycle
Program
Program
Instruction
Cycle

Execution time is the REAL measure of computer

performance!
Good products created when have:

Good benchmarks, good ways to summarize performance

Die Cost goes roughly with die area4

Food for thought

Two companies reports results on two benchmarks
one on a Fortran benchmark suite and the other on
a C++ benchmark suite.
Company As product outperforms Company Bs
on the Fortran suite, the reverse holds true for the
C++ suite. Assume the performance differences are
similar in both cases.
Do you have enough information to compare the
two products. What information will you need?
41

Food for Thought II

In the CISC vs. RISC debate a key argument of the
RISC movement was that because of its simplicity,
RISC would always remain ahead.
If there were enough transistors to implement a CISC
on chip, then those same transistors could implement
a pipelined RISC
If there was enough to allow for a pipelined CISC
there would be enough to have an on-chip cache for
RISC. And so on.
After 20 years of this debate what do you think?
Hint: Think of commercial PCs, Moores law and
some of the data in the first chapter of the book (and
on these slides)
42

Amdahls Law (answer)

Floating point instructions improved to run 2X; but
only 10% of actual instructions are FP
ExTimenew = ExTimeold x (0.9 + .1/2) = 0.95 x ExTimeold
Speedupoverall =

1
0.95

1.053

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
No ratings yet
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
43 pages
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
No ratings yet
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
43 pages
Computer Architecture: Fundamentals
No ratings yet
Computer Architecture: Fundamentals
36 pages
CCS 1202 Lecture 2 - Computer Evolution and Performance
No ratings yet
CCS 1202 Lecture 2 - Computer Evolution and Performance
32 pages
Computer Architecture Fundamentals
No ratings yet
Computer Architecture Fundamentals
36 pages
4 Performance
No ratings yet
4 Performance
67 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
Chapter Two
No ratings yet
Chapter Two
33 pages
CMSC 611: Advanced Computer Architecture
No ratings yet
CMSC 611: Advanced Computer Architecture
21 pages
FIT9134 Week11
No ratings yet
FIT9134 Week11
21 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
Great Ideas in Computer Architecture
No ratings yet
Great Ideas in Computer Architecture
61 pages
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
No ratings yet
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
53 pages
Advanced Computer Architecture: 563 L02.1 Fall 2011
No ratings yet
Advanced Computer Architecture: 563 L02.1 Fall 2011
57 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
68 pages
Aula Ch1
No ratings yet
Aula Ch1
40 pages
Clock Cycle and Performance Metrics
No ratings yet
Clock Cycle and Performance Metrics
15 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
74 pages
PPT#01
No ratings yet
PPT#01
30 pages
Chapter - 01 - Computer Abstractions
No ratings yet
Chapter - 01 - Computer Abstractions
37 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
Advances in Computer Architecture ECE 6373
No ratings yet
Advances in Computer Architecture ECE 6373
151 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
Lecture1 Cda3101
No ratings yet
Lecture1 Cda3101
44 pages
CMP2008 L1
No ratings yet
CMP2008 L1
47 pages
Advanced Computer Architecture Course Overview
No ratings yet
Advanced Computer Architecture Course Overview
56 pages
Advanced Computer Architecture Insights
No ratings yet
Advanced Computer Architecture Insights
48 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
30 pages
ACSA1 Introduction
No ratings yet
ACSA1 Introduction
33 pages
HPC Pipeline Execution Time Overview
No ratings yet
HPC Pipeline Execution Time Overview
124 pages
System Complexity Estimation - Lecture - 9
No ratings yet
System Complexity Estimation - Lecture - 9
17 pages
ACA UNit 1
No ratings yet
ACA UNit 1
29 pages
Lect4 - IC Technology
No ratings yet
Lect4 - IC Technology
43 pages
Ico22 - 1 - Computer Abstraction and Technology
No ratings yet
Ico22 - 1 - Computer Abstraction and Technology
42 pages
Chapter I: Computer Abstractions and Performance
No ratings yet
Chapter I: Computer Abstractions and Performance
14 pages
ARM Computer Organization-Chapter01
No ratings yet
ARM Computer Organization-Chapter01
55 pages
Chapter 01 Modified
No ratings yet
Chapter 01 Modified
55 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Computer Architecture and Evolution
No ratings yet
Computer Architecture and Evolution
118 pages
Computer Architecture: Vnu - University Engineering Technology
No ratings yet
Computer Architecture: Vnu - University Engineering Technology
30 pages
Chapter 1 Measuring Understanding Performance
No ratings yet
Chapter 1 Measuring Understanding Performance
63 pages
Administrative Stuff : Instructor
No ratings yet
Administrative Stuff : Instructor
8 pages
CHAPTER 1 and 2
No ratings yet
CHAPTER 1 and 2
25 pages
1 - Performance
No ratings yet
1 - Performance
38 pages
Fundamentals of Quantitative Design and Analysis: A Quantitative Approach, Fifth Edition
No ratings yet
Fundamentals of Quantitative Design and Analysis: A Quantitative Approach, Fifth Edition
37 pages
Defining Computer Architecture
No ratings yet
Defining Computer Architecture
6 pages
Home Work 3: Class: M.C.A SECTION: RE3004 Course Code: CAP211
No ratings yet
Home Work 3: Class: M.C.A SECTION: RE3004 Course Code: CAP211
15 pages
Alllpdf PDF
No ratings yet
Alllpdf PDF
253 pages
Computer Organization & Design Basics
No ratings yet
Computer Organization & Design Basics
33 pages
Lec 3
No ratings yet
Lec 3
20 pages
Performance Issues
No ratings yet
Performance Issues
19 pages
ACA Question Bank
No ratings yet
ACA Question Bank
16 pages
CPU Performance Evaluation Guide
No ratings yet
CPU Performance Evaluation Guide
36 pages
Knowledge Creation for Teams
No ratings yet
Knowledge Creation for Teams
27 pages
Use Case Modeling Guide
No ratings yet
Use Case Modeling Guide
56 pages
Ooad With Uml: Object Oriented Analysis and Design Using The UML
No ratings yet
Ooad With Uml: Object Oriented Analysis and Design Using The UML
62 pages
Buses and I/O System: Computer Architecture and Assembly Language Fall 2003
No ratings yet
Buses and I/O System: Computer Architecture and Assembly Language Fall 2003
45 pages
FP Growth Algorithm
No ratings yet
FP Growth Algorithm
17 pages
(Autonomous) ,: User Interface Design
No ratings yet
(Autonomous) ,: User Interface Design
3 pages
CSE Course Textbooks and References
No ratings yet
CSE Course Textbooks and References
13 pages
SPM Syllabus
No ratings yet
SPM Syllabus
1 page
Design Thinking
No ratings yet
Design Thinking
1 page
Academic Council Meeting Menu
No ratings yet
Academic Council Meeting Menu
4 pages
Computer Graphics Laboratory Exercises
No ratings yet
Computer Graphics Laboratory Exercises
1 page
Retest III
No ratings yet
Retest III
1 page
Evaluation of Fruit Ripeness Using Electronic Nose: This Paper Describes The Use of An
No ratings yet
Evaluation of Fruit Ripeness Using Electronic Nose: This Paper Describes The Use of An
34 pages
AOC 24G2SPE 23.8-Icnhes FHD 165Hz 1ms IPS Gaming Monitor in UAE Variety in Gaming Parts
No ratings yet
AOC 24G2SPE 23.8-Icnhes FHD 165Hz 1ms IPS Gaming Monitor in UAE Variety in Gaming Parts
1 page
AI's Role in Malaysia's Legal System
No ratings yet
AI's Role in Malaysia's Legal System
24 pages
Ciampa CompTIASec+ 7e PPT Mod08
No ratings yet
Ciampa CompTIASec+ 7e PPT Mod08
42 pages
Final English Diary - 18062016
No ratings yet
Final English Diary - 18062016
437 pages
High-Speed Elevator Control Assignment
No ratings yet
High-Speed Elevator Control Assignment
8 pages
Biznet Dedicated Internet - Connection Setting - Linksys Router PDF
No ratings yet
Biznet Dedicated Internet - Connection Setting - Linksys Router PDF
4 pages
Dhandho Intrinsic Valuation
No ratings yet
Dhandho Intrinsic Valuation
2 pages
Schedule
No ratings yet
Schedule
1 page
3 1 Space Curves and Their Tangents
No ratings yet
3 1 Space Curves and Their Tangents
10 pages
BKlet - Consumer Behaviour in Clothing Choices and Implications PDF
No ratings yet
BKlet - Consumer Behaviour in Clothing Choices and Implications PDF
61 pages
Indian Electricity Act & Rules Overview
No ratings yet
Indian Electricity Act & Rules Overview
3 pages
June 2022 Question Paper 59
No ratings yet
June 2022 Question Paper 59
4 pages
Prohibited Private Practice for Officials
No ratings yet
Prohibited Private Practice for Officials
1 page
Signature Change Form
No ratings yet
Signature Change Form
1 page
Checkweigher SL2/3DM Installation Guide
100% (2)
Checkweigher SL2/3DM Installation Guide
29 pages
2 - The Periodic Table of Arduino
No ratings yet
2 - The Periodic Table of Arduino
4 pages
ASR9K 653 32 Smu List
No ratings yet
ASR9K 653 32 Smu List
3 pages
Microfinance Banks' Impact on Nigeria's Growth
No ratings yet
Microfinance Banks' Impact on Nigeria's Growth
6 pages
Percent Problem Solving Examples
No ratings yet
Percent Problem Solving Examples
2 pages
Life Gain Premier Policy Document
No ratings yet
Life Gain Premier Policy Document
20 pages
Economics Discussion
No ratings yet
Economics Discussion
4 pages
Prudential Bank and Tust Company (Now Bpi) vs. Abasolo G.R. No. 186738 September 27, 2010 Carpio Morales, J.: Facts
100% (1)
Prudential Bank and Tust Company (Now Bpi) vs. Abasolo G.R. No. 186738 September 27, 2010 Carpio Morales, J.: Facts
2 pages
GM320 Eng
No ratings yet
GM320 Eng
16 pages
Nigeria
No ratings yet
Nigeria
34 pages
HHR Passenger Charter
No ratings yet
HHR Passenger Charter
24 pages
Time Table For Summer 2025 Theory Examination
No ratings yet
Time Table For Summer 2025 Theory Examination
10 pages
Formative Assessment - Sep 2023 - QP
No ratings yet
Formative Assessment - Sep 2023 - QP
2 pages
The Politics of Belgium Governing A Divided Society 2nd Edition Kris Deschouwer PDF Version
No ratings yet
The Politics of Belgium Governing A Divided Society 2nd Edition Kris Deschouwer PDF Version
54 pages
Week 1 Amanda Ceresa
No ratings yet
Week 1 Amanda Ceresa
2 pages
Batching of Concrete
No ratings yet
Batching of Concrete
11 pages

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

Uploaded by

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

Uploaded by

CIS775: Computer Architecture

What is Computer Architecture?

Computer Architecture Topics

Pipelining, Hazard Resolution,

Pipelining and Instruction

Computer Architecture Topics

Measurement and Evaluation

Issues for a Computer Designer

Level of software compatibility

Binary level (x86 architecture)

Computer Systems: Technology

Why Such Change in 10 years?

Computer architecture advances improves low-end

Price: Lower costs due to

Function :Rise of networking/local interconnection technology

Six Generations of DRAMs

Updated Technology Trends

Network (bandwidth) 10x in 5 years

Integrated Circuits Costs

Die Yield = Wafer yield * 1 +

Die Cost goes roughly with die area4

How to Quantify Performance?

Time to run the task (ExTime)

Tasks per day, hour, week, sec, ns (Performance)

The Bottom Line:

Speed of Concorde vs. Boeing 747

Answers per month

Cases of Benchmark Engineering

At the compiler level (compiler flags)

After putting in a blazing performance on the benchmark test,

Issues with Benchmark

SPEC: System Performance

Second Round 1992

benchmarks useful for 3 years

SPEC 2000 (CINT 2000)Results

SPEC 2000 (CFP 2000)Results

Reporting Performance Results

execution time: sum(Ti)/n or sum(Wi*Ti)

Given sales is a function in part of performance relative to

Quantitative Principles of Computer

CPU Performance Equation

Suppose that enhancement E accelerates a fraction F

CPU Performance Equation

Inst Count CPI

Cycles Per Instruction

CPU time = CycleTime *

Invest Resources where time is Spent!

Example: Calculating CPI

6yrs to graduate => 16X CPU speed, DRAM/Disk size

Time to run the task

Tasks per day, hour, week, sec, ns,

X is n times faster than Y means

Execution time is the REAL measure of computer

Good benchmarks, good ways to summarize performance

Die Cost goes roughly with die area4

Food for thought

Food for Thought II

Amdahls Law (answer)

You might also like