0% found this document useful (0 votes)

40 views56 pages

Mod 7

Uploaded by

sania.anwar100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views56 pages

Mod 7

Uploaded by

sania.anwar100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

MULTIPLE PROCESSOR

ORGANIZATION
• Single instruction, single data stream - SISD
• Single instruction, multiple data stream - SIMD
• Multiple instruction, single data stream - MISD
• Multiple instruction, multiple data stream- MIMD
SINGLE INSTRUCTION,
SINGLE DATA STREAM -
SISD
• Single processor
• Single instruction stream
• Data stored in single memory
• Uni-processor
MULTIPLE DATA STREAM -
• SingleSIMD
machine instruction

• Controls simultaneous execution

• Number of processing elements

• Lockstep basis

• Each processing element has associated data memory

• Each instruction executed on different set of data by

different processors

• Vector and array processors

SINGLE DATA STREAM -
MISD
• Sequence of data

• Transmitted to set of processors

• Each processor executes different instruction

sequence

• Never been implemented

TAXONOMY OF PARALLEL
PROCESSOR
ARCHITECTURES
MIMD - OVERVIEW
• General purpose processors

• Each can process all instructions necessary

• Further classified by method of processor

communication
TIGHTLY COUPLED - SMP
• Processors share memory

• Communicate via that shared memory

• Symmetric Multiprocessor (SMP)

• Share single memory or pool
• Shared bus to access memory
• Memory access time to given area of memory is
approximately the same for each processor
TIGHTLY COUPLED - NUMA
• Non-uniform memory access

• Access times to different regions of memory may

differ.
LOOSELY COUPLED -
CLUSTERS

• Collection of independent uniprocessors or SMPs

• Interconnected to form a cluster

• Communication via fixed path or network connections

PARALLEL ORGANIZATIONS
- SISD
PARALLEL ORGANIZATIONS
- SIMD
PARALLEL ORGANIZATIONS
- MIMD SHARED MEMORY
- MIMD
DISTRIBUTED MEMORY
SYMMETRIC
MULTIPROCESSORS
• A stand alone computer with the following characteristics
• Two or more similar processors of comparable capacity
• Processors share same memory and I/O
• Processors are connected by a bus or other internal connection
• Memory access time is approximately the same for each processor
• All processors share access to I/O
• Either through same channels or different channels giving paths to same
devices
• All processors can perform the same functions (hence symmetric)
• System controlled by integrated operating system
• providing interaction between processors
• Interaction at job, task, file and data element levels
MULTIPROGRAMMING AND
MULTIPROCESSING
SMP ADVANTAGES

• Performance
• If some work can be done in parallel
• Availability
• Since all processors can perform the same functions, failure of a single
processor does not halt the system
• Incremental growth
• User can enhance performance by adding additional processors
• Scaling
• Vendors can offer range of products based on number of processors
TIGHTLY COUPLED
MULTIPROCESSOR
MULTITHREADING AND
CHIP MULTIPROCESSORS

• Instruction stream divided into smaller streams (threads)

• Executed in parallel
• Wide variety of multithreading designs
DEFINITIONS OF THREADS
AND PROCESSES
• Thread in multithreaded processors may or may not be
same as software threads
• Process:
• An instance of program running on computer
• Resource ownership
• Virtual address space to hold process image
• Scheduling/execution
• Process switch
Cont…
• Thread: dispatch able unit of work within process
• Includes processor context (which includes the program
counter and stack pointer) and data area for stack
• Thread executes sequentially
• Interruptible: processor can turn to another thread
• Thread switch
• Switching processor between threads within same process
• Typically less costly than process switch
IMPLICIT AND EXPLICIT
MULTITHREADING

• All commercial processors and most experimental ones use explicit

multithreading
• Concurrently execute instructions from different explicit threads
• Interleave instructions from different threads on shared pipelines or parallel
execution on parallel pipelines
• Implicit multithreading is concurrent execution of multiple threads
extracted from single sequential program
• Implicit threads defined statically by compiler or dynamically by hardware
APPROACHES TO EXPLICIT
MULTITHREADING
• Interleaved
• Fine-grained
• Processor deals with two or more thread contexts at a time
• Switching thread at each clock cycle
• If thread is blocked it is skipped
• Blocked
• Coarse-grained
• Thread executed until event causes delay
• E.g. Cache miss
• Effective on in-order processor
• Avoids pipeline stall
MULTIPROCESSOR SYSTEM

• A multiprocessor system is a single computer that includes multiple

processors (computer modules).
• Processors may communicate and cooperate at different levels in
solving a given problem.
• The communication may occur by sending messages from one
processor to the other or by sharing a common memory.
• A multiprocessor system is controlled by one operating system which
provides interaction between processors and their programs at the
process, data set and data element levels.
MULTICOMPUTERS

• There is a group of processors, in which each of the processors has

sufficient amount of local memory.
• The communication between the processors is through messages.
• There is neither a common memory nor a common clock.
• This is also called distributed processing.
GRID COMPUTING

• Grid Computing enables geographically dispersed computers or

computing clusters to dynamically and virtually share applications,
data, and computational resources.
• It uses standard TCP/IP networks to provide transparent access to
technical computing services wherever capacity is available,
transforming technical computing into an information utility that is
available across a department or organization.
27
28
29
30
Challenges resulting from multi-core
 Relies on effective exploitation of multiple-thread parallelism
 Need for parallel computing model and parallel programming model
 Aggravates memory wall
 Memory bandwidth
▪ Way to get data out of memory banks
▪ Way to get data into multi-core processor array
 Memory latency
 Fragments L3 cache
 Pins become strangle point
▪ Rate of pin growth projected to slow and flatten
▪ Rate of bandwidth per pin (pair) projected to grow slowly
 Requires mechanisms for efficient inter-processor coordination
 Synchronization
 Mutual exclusion
 Context switching
31
Advantages of Multi-core
• Cache coherency circuitry can operate at a much higher clock rate
than is possible if the signals have to travel off-chip.

• Signals between different CPUs travel shorter distances, those signals

degrade less.

• These higher quality signals allow more data to be sent in a given

time period since individual signals can be shorter and do not need to
be repeated as often.

• A dual-core processor uses slightly less power than two coupled

single-core processors.

32
Performance
Introduction
 Performance measurement is important:
 Helps us to determine if one processor or computer
works faster than other
 Helps us to know how much performance
improvement has taken place after incorporating some
performance enhancement feature
 Help to see through the marketing hype!
 Provides answer to the following questions:
 Why is some hardware better than others for different
programs?
 What factors affect system performance?
 Hardware, OS or compiler?
 How does the machine’s instruction set affect
performance?
Defining Performance in terms of time

Time is the final measure of computer performance

A computer exhibits higher performance if it executes
program faster
Individual
Response Time (elapsed time, Latency): user concern
How long does it take for my job to run?
How long does it take to execute (start to finish)
my job?
How long must/wait for the database query?
Throughput:
How many jobs can the machine run at once?
What is the average execution rate? System
Manager
How much work is getting done? concern
Execution Time
 Elapsed time
 Count everything (disk and memory access, waiting for IO, running
other programs, etc) from start to finish
 A useful number, but often not good for comparison purpose
Elapsed time = CPU time +wait time (IO, other program, etc.)

 CPU time
 Doesn’t count waiting for IO or time spent running other programs
 Can be divided into user CPU time and system CPU time(OS calls)
CPU time = user CPU time + System CPU Time
Elapsed time = user CPU time + System CPU Time + wait time

 Our focus: User CPU time

 CPU execution time or simply execution time: time spent executing
the lines of code that are in our program
Measuring performance
 for some program running on machine X:

Performance =

 X is n times faster than Y means:

=n
The IRON law of processor performance

𝑇𝑖𝑚𝑒
𝑃𝑟𝑜𝑐𝑒𝑠𝑠𝑜𝑟 𝑝𝑒𝑟𝑓𝑜𝑟𝑚𝑎𝑛𝑐𝑒=
𝑃𝑟𝑜𝑔𝑟𝑎𝑚

= XX

(code Size) (CPI) (Cycle time)

Architecture  Implementation  Realization

Compiler Designer Processor designer Chip designer

MIPS and MFLOPS
Problems with MIPS
Problem
Find out the number of instructions for each code
sequence, the faster code sequence, and CPI for
each code sequence
Bench mark sample with problem

Module 07 - Multiprocessing
No ratings yet
Module 07 - Multiprocessing
60 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
91 pages
Chapter 8 - Parallel Processing
No ratings yet
Chapter 8 - Parallel Processing
50 pages
Parallel Processor Overview
No ratings yet
Parallel Processor Overview
32 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
51 pages
CH17 COA9e
No ratings yet
CH17 COA9e
51 pages
CA Chap7 Multicores Multiprocessors
No ratings yet
CA Chap7 Multicores Multiprocessors
42 pages
Aca Unit 1
No ratings yet
Aca Unit 1
34 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
CS 213: Parallel Processing Syllabus
No ratings yet
CS 213: Parallel Processing Syllabus
26 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
CH5 Parallel Processing
No ratings yet
CH5 Parallel Processing
30 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
72 pages
Computer Architecture for CS Students
No ratings yet
Computer Architecture for CS Students
72 pages
Performance Enhancements in Microprocessors
No ratings yet
Performance Enhancements in Microprocessors
47 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
49 pages
Unit Iv Parallelism
No ratings yet
Unit Iv Parallelism
80 pages
Unit 3
No ratings yet
Unit 3
28 pages
StudM1p1Parallel Computer Modelsppt1shared
No ratings yet
StudM1p1Parallel Computer Modelsppt1shared
107 pages
Background: Computer System Architectures Computer System Software
No ratings yet
Background: Computer System Architectures Computer System Software
25 pages
Chapter 12 Multiprocessor Systems
No ratings yet
Chapter 12 Multiprocessor Systems
110 pages
L38 TLP
No ratings yet
L38 TLP
13 pages
Module 2
No ratings yet
Module 2
5 pages
15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Parallel Processing
No ratings yet
Parallel Processing
28 pages
Unit6 - Microprocessor - Final 1
No ratings yet
Unit6 - Microprocessor - Final 1
30 pages
Parallel Computer Models Overview
No ratings yet
Parallel Computer Models Overview
20 pages
CompArch 23a MP-1
No ratings yet
CompArch 23a MP-1
17 pages
Understanding Multi-Core Processors
No ratings yet
Understanding Multi-Core Processors
31 pages
Multi-Core Architectures
100% (1)
Multi-Core Architectures
43 pages
CH17 COA9e Parallel Processing
No ratings yet
CH17 COA9e Parallel Processing
52 pages
Multicore Processor
100% (1)
Multicore Processor
23 pages
10 Multithreading
No ratings yet
10 Multithreading
60 pages
Parallel Prrocessor
No ratings yet
Parallel Prrocessor
12 pages
Ayushagrawal HPC
No ratings yet
Ayushagrawal HPC
17 pages
Parallel Arch 2
No ratings yet
Parallel Arch 2
9 pages
Understanding Multi-Core Processors
No ratings yet
Understanding Multi-Core Processors
31 pages
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
No ratings yet
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
43 pages
High Performance Computing Unit 1
No ratings yet
High Performance Computing Unit 1
3 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
Parallel Processor Insights
No ratings yet
Parallel Processor Insights
32 pages
Unit-5 Part1
No ratings yet
Unit-5 Part1
85 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
Debugging Real-Time Multiprocessor Systems: Class #264, Embedded Systems Conference, Silicon Valley 2006
No ratings yet
Debugging Real-Time Multiprocessor Systems: Class #264, Embedded Systems Conference, Silicon Valley 2006
15 pages
Multiprocessors: COMP 211 - Computer Systems Organization and Architecture
No ratings yet
Multiprocessors: COMP 211 - Computer Systems Organization and Architecture
29 pages
CS213 Parallel Processing Syllabus
No ratings yet
CS213 Parallel Processing Syllabus
26 pages
Chapter - 5 Parallel Processing
No ratings yet
Chapter - 5 Parallel Processing
117 pages
Thread Level Parallelism
No ratings yet
Thread Level Parallelism
21 pages
Unit 6
No ratings yet
Unit 6
36 pages
Future Processors: Coarse-Grain Parallelism
No ratings yet
Future Processors: Coarse-Grain Parallelism
48 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Parallelism and Multicores
No ratings yet
Parallelism and Multicores
54 pages
Lec 4 Superscalarprocessor Updated PDF
No ratings yet
Lec 4 Superscalarprocessor Updated PDF
40 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
SSC Course 6 CPU
No ratings yet
SSC Course 6 CPU
17 pages
Maekawa
No ratings yet
Maekawa
2 pages
JetsamEvent 2025 08 18 173050.ips
No ratings yet
JetsamEvent 2025 08 18 173050.ips
138 pages
Debugging Errors in Musicall App Logs
No ratings yet
Debugging Errors in Musicall App Logs
24 pages
Learning Rsync Rocky Linux
No ratings yet
Learning Rsync Rocky Linux
32 pages
An Exercise Using MS
No ratings yet
An Exercise Using MS
7 pages
IT1102 - Week7 - Introduction To User Account and Access Rights
No ratings yet
IT1102 - Week7 - Introduction To User Account and Access Rights
8 pages
USCS301 Principles of Operating Systems
No ratings yet
USCS301 Principles of Operating Systems
27 pages
ASM Cheat Sheet
No ratings yet
ASM Cheat Sheet
9 pages
Efika MX Linux Installer
No ratings yet
Efika MX Linux Installer
7 pages
Logcat CSC Update Log
No ratings yet
Logcat CSC Update Log
251 pages
Windows Boot Process Explained
No ratings yet
Windows Boot Process Explained
8 pages
Debug 1214
No ratings yet
Debug 1214
3 pages
Silent Install Guide for IT Pros
No ratings yet
Silent Install Guide for IT Pros
8 pages
Module 1b - Desktop and File Management
No ratings yet
Module 1b - Desktop and File Management
21 pages
OS Unit 2: Process Management Notes
No ratings yet
OS Unit 2: Process Management Notes
31 pages
Lab2 Process HK232
No ratings yet
Lab2 Process HK232
27 pages
PostgreSQL Cassandra Upgrade
No ratings yet
PostgreSQL Cassandra Upgrade
41 pages
COM+ Application Management Guide
No ratings yet
COM+ Application Management Guide
7 pages
Create Custom WinPE 2.x CD with SEE Support
No ratings yet
Create Custom WinPE 2.x CD with SEE Support
13 pages
Yocto
No ratings yet
Yocto
9 pages
Ex2200 - Junos Can'T Boot, FSCK Doesn'T Work
No ratings yet
Ex2200 - Junos Can'T Boot, FSCK Doesn'T Work
4 pages
Astrology & Astronomy Software Collection - With Serials or Crack - Contents
0% (1)
Astrology & Astronomy Software Collection - With Serials or Crack - Contents
4 pages
Linux Quota Management
No ratings yet
Linux Quota Management
7 pages
VC Redist-10.0.40219.x86.exe - log-MSI VC Red - Msi
No ratings yet
VC Redist-10.0.40219.x86.exe - log-MSI VC Red - Msi
63 pages
CS462 Project Report: Name: Samuel Day 1. The Nature of The Project
No ratings yet
CS462 Project Report: Name: Samuel Day 1. The Nature of The Project
7 pages
Summer-24
80% (10)
Summer-24
26 pages
Manual - Lab05 Air
No ratings yet
Manual - Lab05 Air
8 pages
Keyboard & Screen
No ratings yet
Keyboard & Screen
46 pages
Cs7103 Multicore Architecture
No ratings yet
Cs7103 Multicore Architecture
5 pages
Evolution of The Windows Kernel Architecture
100% (1)
Evolution of The Windows Kernel Architecture
30 pages

Mod 7

Uploaded by

Mod 7

Uploaded by

MULTIPLE PROCESSOR

• Controls simultaneous execution

• Number of processing elements

• Each processing element has associated data memory

• Each instruction executed on different set of data by

• Vector and array processors

• Transmitted to set of processors

• Each processor executes different instruction

• Never been implemented

• Each can process all instructions necessary

• Further classified by method of processor

• Communicate via that shared memory

• Symmetric Multiprocessor (SMP)

• Access times to different regions of memory may

• Collection of independent uniprocessors or SMPs

• Interconnected to form a cluster

• Communication via fixed path or network connections

• Instruction stream divided into smaller streams (threads)

• All commercial processors and most experimental ones use explicit

• A multiprocessor system is a single computer that includes multiple

• There is a group of processors, in which each of the processors has

• Grid Computing enables geographically dispersed computers or

• Signals between different CPUs travel shorter distances, those signals

• These higher quality signals allow more data to be sent in a given

• A dual-core processor uses slightly less power than two coupled

Time is the final measure of computer performance

 Our focus: User CPU time

 X is n times faster than Y means:

(code Size) (CPI) (Cycle time)

Architecture  Implementation  Realization

Compiler Designer Processor designer Chip designer

You might also like