SlideShare a Scribd company logo
1 of 33
Computer
Architecture
CNE-301
Lecturer: Irfan Ali
Semiconductor Memory
Microprocessors
Later
Generations
LSI
Large
Scale
Integration
VLSI
Very Large
Scale
Integration
ULSI
Ultra Large
Scale
Integration
Semiconductor Memory
Since 1970 semiconductor memory has been through 13 generations
Each generation has provided four times the storage density of the previous generation, accompanied by declining cost per
bit and declining access time
In 1974 the price per bit of semiconductor memory dropped below the price per bit of core
memory
There has been a continuing and rapid decline in memory
cost accompanied by a corresponding increase in physical
memory density
Developments in memory and processor technologies
changed the nature of computers in less than a decade
In 1970 Fairchild produced the first relatively capacious semiconductor memory
Chip was about the size of a
single core
Could hold 256 bits of
memory
Non-destructive Much faster than core
Microprocessors
• The density of elements on processor chips continued to rise
• More and more elements were placed on each chip so that fewer and
fewer chips were needed to construct a single computer processor
• 1971 Intel developed 4004
• First chip to contain all of the components of a CPU on a single chip
• Birth of microprocessor
• 1972 Intel developed 8008
• First 8-bit microprocessor
• 1974 Intel developed 8080
• First general purpose microprocessor
• Faster, has a richer instruction set, has a large addressing capability
Evolution of Intel Microprocessors
a. 1970s Processors
b. 1980s Processors
Evolution of Intel Microprocessors
c. 1990s Processors
d. Recent Processors
Microprocessor Speed
Pipelining
Branch
prediction
Data flow
analysis
Speculative
execution
•Processor moves data or instructions into a
conceptual pipe with all stages of the pipe
processing simultaneously
•Processor looks ahead in the instruction code
fetched from memory and predicts which
branches, or groups of instructions, are likely to
be processed next
•Processor analyzes which instructions are
dependent on each other’s results, or data, to
create an optimized schedule of instructions
•Using branch prediction and data flow analysis,
some processors speculatively execute
instructions ahead of their actual appearance in
the program execution, holding the results in
temporary locations, keeping execution engines
as busy as possible
Techniques built into contemporary processors include:
Performance
Balance
Increase the number of
bits that are retrieved
at one time by making
DRAMs “wider” rather
than “deeper” and by
using wide bus data
paths
Change the DRAM
interface to make it
more efficient by
including a cache or
other buffering scheme
on the DRAM chip
Reduce the frequency
of memory access by
incorporating
increasingly complex
and efficient cache
structures between the
processor and main
memory
Increase the
interconnect bandwidth
between processors and
memory by using higher
speed buses and a
hierarchy of buses to
buffer and structure
data flow
 Adjust the organization and
architecture to compensate
for the mismatch among the
capabilities of the various
components
 Architectural examples
include:
Typical I/O Device Data Rates
Improvements in Chip Organization and
Architecture
• Increase hardware speed of processor
• Fundamentally due to shrinking logic gate size
• More gates, packed more tightly, increasing clock rate
• Propagation time for signals reduced
• Increase size and speed of caches
• Dedicating part of processor chip
• Cache access times drop significantly
• Change processor organization and architecture
• Increase effective speed of instruction execution
• Parallelism
Processor
Trends
The use of multiple processors
on the same chip provides the
potential to increase
performance without
increasing the clock rate
Strategy is to use two simpler
processors on the chip rather
than one more complex
processor
With two processors larger
caches are justified
As caches became larger it
made performance sense to
create two and then three
levels of cache on a chip
Multicore
Many Integrated Core (MIC)
Graphics Processing Unit (GPU)
MIC
• Leap in performance as well as
the challenges in developing
software to exploit such a large
number of cores
• The multicore and MIC strategy
involves a homogeneous
collection of general purpose
processors on a single chip
GPU
• Core designed to perform
parallel operations on graphics
data
• Traditionally found on a plug-in
graphics card, it is used to
encode and render 2D and 3D
graphics as well as process
video
• Used as vector processors for a
variety of applications that
require repetitive
computations
x86 Architecture
 Results of decades of design effort on
complex instruction set computers (CISCs)
 Excellent example of CISC design
 Incorporates the sophisticated design
principles once found only on mainframes
and supercomputers
 An alternative approach to processor
design is the reduced instruction set
computer (RISC)
 The ARM architecture is used in a wide
variety of embedded systems and is one of
the most powerful and best designed RISC
based systems on the market
 In terms of market share Intel is ranked as
the number one maker of microprocessors
for non-embedded systems
Overview
CISC
RISC
Intel
ARM
x86 Evolution
 8080
 First general purpose microprocessor
 8-bit machine with an 8-bit data path to
memory
 Used in the first personal computer (Altair)
 8086
 16-bit machine
 Used an instruction cache, or queue
 First appearance of the x86 architecture
 8088
 used in IBM’s first personal computer
 80286
 Enabled addressing a 16-MByte memory
instead of just 1 MByte
 80386
 Intel’s first 32-bit machine
 First Intel processor to support multitasking
 80486
 More sophisticated cache technology and
instruction pipelining
 Built-in math coprocessor
x86 Evolution - Pentium
Pentium
•Superscalar
•Multiple
instructions
executed in
parallel
Pentium Pro
•Increased
superscalar
organization
•Aggressive
register
renaming
•Branch
prediction
•Data flow
analysis
•Speculative
execution
Pentium II
•MMX
technology
•Designed
specifically to
process video,
audio, and
graphics data
Pentium III
•Additional
floating-point
instructions to
support 3D
graphics
software
Pentium 4
•Includes
additional
floating-point
and other
enhancements
for multimedia
x86 Evolution (continued)
• Core
• First Intel x86 microprocessor
with a dual core, referring to the
implementation of two
processors on a single chip
• Core 2
• Extends the architecture to 64
bits
• Recent Core offerings have up to
10 processors per chip
Instruction set
architecture is
backward compatible
with earlier versions
X86 architecture
continues to
dominate the
processor market
outside of
embedded
systems
General definition:
“A combination of computer
hardware and software, and perhaps
additional mechanical or other parts,
designed to perform a dedicated
function. In many cases, embedded
systems are part of a larger system or
product, as in the case of an antilock
braking system in a car.”
Embedded
Systems
Table 2.7
Examples of Embedded Systems and Their Markets
Embedded Systems
Small to large systems,
implying different cost
constraints and different
needs for optimization and
reuse
Relaxed to very strict
requirements and
combinations of different
quality requirements with
respect to safety, reliability,
real-time and flexibility
Short to long life times
Different environmental
conditions in terms of
radiation, vibrations, and
humidity
Different application
characteristics resulting in
static versus dynamic
loads, slow to fast speed,
compute versus interface
intensive tasks, and/or
combinations thereof
Different models of
computation ranging from
discrete event systems to
hybrid systems
Requirements and Constraints
Figure 2.12
Possible Organization of an Embedded System
Acorn RISC Machine (ARM)
• Family of RISC-based
microprocessors and
microcontrollers
• Designs microprocessor and
multicore architectures and
licenses them to manufacturers
• Chips are high-speed processors
that are known for their small
die size and low power
requirements
• Widely used in PDAs and other
handheld devices
• Chips are the processors in iPod
and iPhone devices
• Most widely used embedded
processor architecture
• Most widely used processor
architecture of any kind
A
R
M
E
v
o
l
u
t
i
o
n
DSP = digital signal processor SoC = system on a chip
ARM Design Categories
• ARM processors are designed to meet the needs of three
system categories:
 Application platforms
 Devices running open operating
systems including Linux, Palm OS,
Symbian OS, and Windows CE in
wireless, consumer entertainment
and digital imaging applications
 Embedded real-time systems
 Systems for storage, automotive
body and power-train, industrial,
and networking applications
 Secure applications
 Smart cards, SIM cards, and
payment terminals
System Clock
Benchmarks
For example, consider this high-level language statement:
A = B + C /* assume all quantities in main memory */
With a traditional instruction set architecture, referred to as a complex instruction
set computer (CISC), this instruction can be compiled into one processor
instruction:
add mem(B), mem(C), mem (A)
On a typical RISC machine, the compilation would look something
like this:
load mem(B), reg(1);
load mem(C), reg(2);
add reg(1), reg(2), reg(3);
store reg(3), mem (A)
Desirable Benchmark Characteristics
Written in a high-level language, making it portable
across different machines
Representative of a particular kind of programming
style, such as system programming, numerical
programming, or commercial programming
Can be measured easily
Has wide distribution
System Performance Evaluation
Corporation (SPEC)
• Benchmark suite
• A collection of programs, defined in a high-level language
• Attempts to provide a representative test of a computer in a particular
application or system programming area
• SPEC
• An industry consortium
• Defines and maintains the best known collection of benchmark suites
• Performance measurements are widely used for comparison and research
purposes
SPEC
CPU2006
• Best known SPEC benchmark
suite
• Industry standard suite for
processor intensive applications
• Appropriate for measuring
performance for applications that
spend most of their time doing
computation rather than I/O
• Consists of 17 floating point
programs written in C, C++, and
Fortran and 12 integer programs
written in C and C++
• Suite contains over 3 million lines
of code
• Fifth generation of processor
intensive suites from SPEC
Amdahl’s Law
• Gene Amdahl [AMDA67]
• Deals with the potential speedup
of a program using multiple
processors compared to a single
processor
• Illustrates the problems facing
industry in the development of
multi-core machines
• Software must be adapted to a
highly parallel execution
environment to exploit the power
of parallel processing
• Can be generalized to evaluate
and design technical
improvement in a computer
system
Amdahl’s Law
Little’s Law
• Fundamental and simple relation with broad applications
• Can be applied to almost any system that is statistically in steady
state, and in which there is no leakage
• Queuing system
• If server is idle an item is served immediately, otherwise an arriving item
joins a queue
• There can be a single queue for a single server or for multiple servers, or
multiples queues with one being for each of multiple servers
• Average number of items in a queuing system equals the average
rate at which items arrive multiplied by the time that an item
spends in the system
• Relationship requires very few assumptions
• Because of its simplicity and generality it is extremely useful
Summary
Chapter 2
• First generation computers
• Vacuum tubes
• Second generation computers
• Transistors
• Third generation computers
• Integrated circuits
• Performance designs
• Microprocessor speed
• Performance balance
• Chip organization and
architecture
Computer Evolution and
Performance
• Multi-core
• MICs
• GPGPUs
• Evolution of the Intel x86
• Embedded systems
• ARM evolution
• Performance assessment
• Clock speed and instructions per
second
• Benchmarks
• Amdahl’s Law
• Little’s Law

More Related Content

What's hot

Fundamentals of Computer Design including performance measurements & quantita...
Fundamentals of Computer Design including performance measurements & quantita...Fundamentals of Computer Design including performance measurements & quantita...
Fundamentals of Computer Design including performance measurements & quantita...Gaditek
 
1 Computer Architecture
1 Computer Architecture1 Computer Architecture
1 Computer Architecturefika sweety
 
Bus Standards and Networking
Bus Standards and NetworkingBus Standards and Networking
Bus Standards and NetworkingPrabu U
 
02 computer evolution and performance
02 computer evolution and performance02 computer evolution and performance
02 computer evolution and performanceSher Shah Merkhel
 
Computer architecture
Computer architectureComputer architecture
Computer architectureRishabha Garg
 
Microprocessor.ppt
Microprocessor.pptMicroprocessor.ppt
Microprocessor.pptsafia kalwar
 
Computer Architecture and organization
Computer Architecture and organizationComputer Architecture and organization
Computer Architecture and organizationBadrinath Kadam
 
Intel 8th generation and 7th gen microprocessor full details especially for t...
Intel 8th generation and 7th gen microprocessor full details especially for t...Intel 8th generation and 7th gen microprocessor full details especially for t...
Intel 8th generation and 7th gen microprocessor full details especially for t...Chessin Chacko
 
4th generation intel core processoer
4th generation intel core processoer4th generation intel core processoer
4th generation intel core processoerAanamika Nath
 
Intel core processors
Intel core processorsIntel core processors
Intel core processorsSidney Field
 
Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...
Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...
Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...ijesajournal
 
Computer Architecture
Computer  ArchitectureComputer  Architecture
Computer ArchitectureAmit Kumar
 
Processor powerpoint
Processor powerpointProcessor powerpoint
Processor powerpointbrennan_jame
 

What's hot (20)

Fundamentals of Computer Design including performance measurements & quantita...
Fundamentals of Computer Design including performance measurements & quantita...Fundamentals of Computer Design including performance measurements & quantita...
Fundamentals of Computer Design including performance measurements & quantita...
 
1 Computer Architecture
1 Computer Architecture1 Computer Architecture
1 Computer Architecture
 
Bus Standards and Networking
Bus Standards and NetworkingBus Standards and Networking
Bus Standards and Networking
 
02 computer evolution and performance
02 computer evolution and performance02 computer evolution and performance
02 computer evolution and performance
 
computer architecture.
computer architecture.computer architecture.
computer architecture.
 
Computer architecture
Computer architectureComputer architecture
Computer architecture
 
CS6303 - Computer Architecture
CS6303 - Computer ArchitectureCS6303 - Computer Architecture
CS6303 - Computer Architecture
 
09. storage-part-1
09. storage-part-109. storage-part-1
09. storage-part-1
 
Microprocessor.ppt
Microprocessor.pptMicroprocessor.ppt
Microprocessor.ppt
 
Computer Architecture and organization
Computer Architecture and organizationComputer Architecture and organization
Computer Architecture and organization
 
Intel 8th generation and 7th gen microprocessor full details especially for t...
Intel 8th generation and 7th gen microprocessor full details especially for t...Intel 8th generation and 7th gen microprocessor full details especially for t...
Intel 8th generation and 7th gen microprocessor full details especially for t...
 
Computer architecture
Computer architectureComputer architecture
Computer architecture
 
4th generation intel core processoer
4th generation intel core processoer4th generation intel core processoer
4th generation intel core processoer
 
Intel Core i7 Processors
Intel Core i7 ProcessorsIntel Core i7 Processors
Intel Core i7 Processors
 
Intel core processors
Intel core processorsIntel core processors
Intel core processors
 
Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...
Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...
Enhanced Embedded Linux Board Support Package Field Upgrade – A Cost Effectiv...
 
Computer Architecture
Computer  ArchitectureComputer  Architecture
Computer Architecture
 
Ivy bridge
Ivy bridgeIvy bridge
Ivy bridge
 
Processor powerpoint
Processor powerpointProcessor powerpoint
Processor powerpoint
 
generations of computer
generations of computergenerations of computer
generations of computer
 

Similar to Ca lecture 03

Intel new processors
Intel new processorsIntel new processors
Intel new processorszaid_b
 
software engineering CSE675_01_Introduction.ppt
software engineering CSE675_01_Introduction.pptsoftware engineering CSE675_01_Introduction.ppt
software engineering CSE675_01_Introduction.pptSomnathMule5
 
CSE675_01_Introduction.ppt
CSE675_01_Introduction.pptCSE675_01_Introduction.ppt
CSE675_01_Introduction.pptAshokRachapalli1
 
CSE675_01_Introduction.ppt
CSE675_01_Introduction.pptCSE675_01_Introduction.ppt
CSE675_01_Introduction.pptAshokRachapalli1
 
Overview_Of_Intel_x86[1].pdf
Overview_Of_Intel_x86[1].pdfOverview_Of_Intel_x86[1].pdf
Overview_Of_Intel_x86[1].pdfMunazza63
 
IT Book of Knowledge
IT Book of KnowledgeIT Book of Knowledge
IT Book of KnowledgePhil Primeau
 
finaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdf
finaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdffinaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdf
finaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdfNazarAhmadAlkhidir
 
An introduction to digital signal processors 1
An introduction to digital signal processors 1An introduction to digital signal processors 1
An introduction to digital signal processors 1Hossam Hassan
 
Network Processor - 2021.pptx
Network Processor - 2021.pptxNetwork Processor - 2021.pptx
Network Processor - 2021.pptxssuserdfb2da
 
Slot29-CH18-MultiCoreComputers-18-slides (1).pptx
Slot29-CH18-MultiCoreComputers-18-slides (1).pptxSlot29-CH18-MultiCoreComputers-18-slides (1).pptx
Slot29-CH18-MultiCoreComputers-18-slides (1).pptxvun24122002
 
Computer Hardware & Software Lab Manual 3
Computer Hardware & Software Lab Manual 3Computer Hardware & Software Lab Manual 3
Computer Hardware & Software Lab Manual 3senayteklay
 
Bharath technical seminar.pptx
Bharath technical seminar.pptxBharath technical seminar.pptx
Bharath technical seminar.pptxMadhav Reddy
 
Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorIOSR Journals
 
Risc and cisc eugene clewlow
Risc and cisc   eugene clewlowRisc and cisc   eugene clewlow
Risc and cisc eugene clewlowChaudhary Manzoor
 
Risc and cisc eugene clewlow
Risc and cisc   eugene clewlowRisc and cisc   eugene clewlow
Risc and cisc eugene clewlowkaran saini
 
18CS44-MODULE1-PPT.pptx
18CS44-MODULE1-PPT.pptx18CS44-MODULE1-PPT.pptx
18CS44-MODULE1-PPT.pptxKokilaK25
 
03 - Lecture Systme Unit Components.pptx
03 - Lecture Systme Unit Components.pptx03 - Lecture Systme Unit Components.pptx
03 - Lecture Systme Unit Components.pptxmomandayaz306
 

Similar to Ca lecture 03 (20)

Intel new processors
Intel new processorsIntel new processors
Intel new processors
 
software engineering CSE675_01_Introduction.ppt
software engineering CSE675_01_Introduction.pptsoftware engineering CSE675_01_Introduction.ppt
software engineering CSE675_01_Introduction.ppt
 
CSE675_01_Introduction.ppt
CSE675_01_Introduction.pptCSE675_01_Introduction.ppt
CSE675_01_Introduction.ppt
 
CSE675_01_Introduction.ppt
CSE675_01_Introduction.pptCSE675_01_Introduction.ppt
CSE675_01_Introduction.ppt
 
Overview_Of_Intel_x86[1].pdf
Overview_Of_Intel_x86[1].pdfOverview_Of_Intel_x86[1].pdf
Overview_Of_Intel_x86[1].pdf
 
IT Book of Knowledge
IT Book of KnowledgeIT Book of Knowledge
IT Book of Knowledge
 
finaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdf
finaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdffinaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdf
finaldraft-intelcorei5processorsarchitecture-130207093535-phpapp01.pdf
 
Module 1 unit 3
Module 1  unit 3Module 1  unit 3
Module 1 unit 3
 
Processors selection
Processors selectionProcessors selection
Processors selection
 
An introduction to digital signal processors 1
An introduction to digital signal processors 1An introduction to digital signal processors 1
An introduction to digital signal processors 1
 
Network Processor - 2021.pptx
Network Processor - 2021.pptxNetwork Processor - 2021.pptx
Network Processor - 2021.pptx
 
Slot29-CH18-MultiCoreComputers-18-slides (1).pptx
Slot29-CH18-MultiCoreComputers-18-slides (1).pptxSlot29-CH18-MultiCoreComputers-18-slides (1).pptx
Slot29-CH18-MultiCoreComputers-18-slides (1).pptx
 
Computer Hardware & Software Lab Manual 3
Computer Hardware & Software Lab Manual 3Computer Hardware & Software Lab Manual 3
Computer Hardware & Software Lab Manual 3
 
Bharath technical seminar.pptx
Bharath technical seminar.pptxBharath technical seminar.pptx
Bharath technical seminar.pptx
 
Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips Processor
 
Risc and cisc eugene clewlow
Risc and cisc   eugene clewlowRisc and cisc   eugene clewlow
Risc and cisc eugene clewlow
 
Risc and cisc eugene clewlow
Risc and cisc   eugene clewlowRisc and cisc   eugene clewlow
Risc and cisc eugene clewlow
 
18CS44-MODULE1-PPT.pptx
18CS44-MODULE1-PPT.pptx18CS44-MODULE1-PPT.pptx
18CS44-MODULE1-PPT.pptx
 
03 - Lecture Systme Unit Components.pptx
03 - Lecture Systme Unit Components.pptx03 - Lecture Systme Unit Components.pptx
03 - Lecture Systme Unit Components.pptx
 
Lecture 10
Lecture 10Lecture 10
Lecture 10
 

More from Haris456

Hazards Computer Architecture
Hazards Computer ArchitectureHazards Computer Architecture
Hazards Computer ArchitectureHaris456
 
Pipelining of Processors Computer Architecture
Pipelining of  Processors Computer ArchitecturePipelining of  Processors Computer Architecture
Pipelining of Processors Computer ArchitectureHaris456
 
Computer Architecture Vector Computer
Computer Architecture Vector ComputerComputer Architecture Vector Computer
Computer Architecture Vector ComputerHaris456
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture Haris456
 
Graphics processing uni computer archiecture
Graphics processing uni computer archiectureGraphics processing uni computer archiecture
Graphics processing uni computer archiectureHaris456
 
Computer Memory Hierarchy Computer Architecture
Computer Memory Hierarchy Computer ArchitectureComputer Memory Hierarchy Computer Architecture
Computer Memory Hierarchy Computer ArchitectureHaris456
 
Computer Architecture Instruction-Level paraallel processors
Computer Architecture Instruction-Level paraallel processorsComputer Architecture Instruction-Level paraallel processors
Computer Architecture Instruction-Level paraallel processorsHaris456
 
Pipeline Computer Architecture
Pipeline Computer ArchitecturePipeline Computer Architecture
Pipeline Computer ArchitectureHaris456
 
Addressing mode Computer Architecture
Addressing mode  Computer ArchitectureAddressing mode  Computer Architecture
Addressing mode Computer ArchitectureHaris456
 
Instruction Set Architecture
Instruction  Set ArchitectureInstruction  Set Architecture
Instruction Set ArchitectureHaris456
 
Computer Architecture
Computer ArchitectureComputer Architecture
Computer ArchitectureHaris456
 

More from Haris456 (11)

Hazards Computer Architecture
Hazards Computer ArchitectureHazards Computer Architecture
Hazards Computer Architecture
 
Pipelining of Processors Computer Architecture
Pipelining of  Processors Computer ArchitecturePipelining of  Processors Computer Architecture
Pipelining of Processors Computer Architecture
 
Computer Architecture Vector Computer
Computer Architecture Vector ComputerComputer Architecture Vector Computer
Computer Architecture Vector Computer
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture
 
Graphics processing uni computer archiecture
Graphics processing uni computer archiectureGraphics processing uni computer archiecture
Graphics processing uni computer archiecture
 
Computer Memory Hierarchy Computer Architecture
Computer Memory Hierarchy Computer ArchitectureComputer Memory Hierarchy Computer Architecture
Computer Memory Hierarchy Computer Architecture
 
Computer Architecture Instruction-Level paraallel processors
Computer Architecture Instruction-Level paraallel processorsComputer Architecture Instruction-Level paraallel processors
Computer Architecture Instruction-Level paraallel processors
 
Pipeline Computer Architecture
Pipeline Computer ArchitecturePipeline Computer Architecture
Pipeline Computer Architecture
 
Addressing mode Computer Architecture
Addressing mode  Computer ArchitectureAddressing mode  Computer Architecture
Addressing mode Computer Architecture
 
Instruction Set Architecture
Instruction  Set ArchitectureInstruction  Set Architecture
Instruction Set Architecture
 
Computer Architecture
Computer ArchitectureComputer Architecture
Computer Architecture
 

Recently uploaded

Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 

Recently uploaded (20)

Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 

Ca lecture 03

  • 3. Semiconductor Memory Since 1970 semiconductor memory has been through 13 generations Each generation has provided four times the storage density of the previous generation, accompanied by declining cost per bit and declining access time In 1974 the price per bit of semiconductor memory dropped below the price per bit of core memory There has been a continuing and rapid decline in memory cost accompanied by a corresponding increase in physical memory density Developments in memory and processor technologies changed the nature of computers in less than a decade In 1970 Fairchild produced the first relatively capacious semiconductor memory Chip was about the size of a single core Could hold 256 bits of memory Non-destructive Much faster than core
  • 4. Microprocessors • The density of elements on processor chips continued to rise • More and more elements were placed on each chip so that fewer and fewer chips were needed to construct a single computer processor • 1971 Intel developed 4004 • First chip to contain all of the components of a CPU on a single chip • Birth of microprocessor • 1972 Intel developed 8008 • First 8-bit microprocessor • 1974 Intel developed 8080 • First general purpose microprocessor • Faster, has a richer instruction set, has a large addressing capability
  • 5. Evolution of Intel Microprocessors a. 1970s Processors b. 1980s Processors
  • 6. Evolution of Intel Microprocessors c. 1990s Processors d. Recent Processors
  • 7. Microprocessor Speed Pipelining Branch prediction Data flow analysis Speculative execution •Processor moves data or instructions into a conceptual pipe with all stages of the pipe processing simultaneously •Processor looks ahead in the instruction code fetched from memory and predicts which branches, or groups of instructions, are likely to be processed next •Processor analyzes which instructions are dependent on each other’s results, or data, to create an optimized schedule of instructions •Using branch prediction and data flow analysis, some processors speculatively execute instructions ahead of their actual appearance in the program execution, holding the results in temporary locations, keeping execution engines as busy as possible Techniques built into contemporary processors include:
  • 8. Performance Balance Increase the number of bits that are retrieved at one time by making DRAMs “wider” rather than “deeper” and by using wide bus data paths Change the DRAM interface to make it more efficient by including a cache or other buffering scheme on the DRAM chip Reduce the frequency of memory access by incorporating increasingly complex and efficient cache structures between the processor and main memory Increase the interconnect bandwidth between processors and memory by using higher speed buses and a hierarchy of buses to buffer and structure data flow  Adjust the organization and architecture to compensate for the mismatch among the capabilities of the various components  Architectural examples include:
  • 9. Typical I/O Device Data Rates
  • 10. Improvements in Chip Organization and Architecture • Increase hardware speed of processor • Fundamentally due to shrinking logic gate size • More gates, packed more tightly, increasing clock rate • Propagation time for signals reduced • Increase size and speed of caches • Dedicating part of processor chip • Cache access times drop significantly • Change processor organization and architecture • Increase effective speed of instruction execution • Parallelism
  • 12. The use of multiple processors on the same chip provides the potential to increase performance without increasing the clock rate Strategy is to use two simpler processors on the chip rather than one more complex processor With two processors larger caches are justified As caches became larger it made performance sense to create two and then three levels of cache on a chip Multicore
  • 13. Many Integrated Core (MIC) Graphics Processing Unit (GPU) MIC • Leap in performance as well as the challenges in developing software to exploit such a large number of cores • The multicore and MIC strategy involves a homogeneous collection of general purpose processors on a single chip GPU • Core designed to perform parallel operations on graphics data • Traditionally found on a plug-in graphics card, it is used to encode and render 2D and 3D graphics as well as process video • Used as vector processors for a variety of applications that require repetitive computations
  • 14. x86 Architecture  Results of decades of design effort on complex instruction set computers (CISCs)  Excellent example of CISC design  Incorporates the sophisticated design principles once found only on mainframes and supercomputers  An alternative approach to processor design is the reduced instruction set computer (RISC)  The ARM architecture is used in a wide variety of embedded systems and is one of the most powerful and best designed RISC based systems on the market  In terms of market share Intel is ranked as the number one maker of microprocessors for non-embedded systems Overview CISC RISC Intel ARM
  • 15. x86 Evolution  8080  First general purpose microprocessor  8-bit machine with an 8-bit data path to memory  Used in the first personal computer (Altair)  8086  16-bit machine  Used an instruction cache, or queue  First appearance of the x86 architecture  8088  used in IBM’s first personal computer  80286  Enabled addressing a 16-MByte memory instead of just 1 MByte  80386  Intel’s first 32-bit machine  First Intel processor to support multitasking  80486  More sophisticated cache technology and instruction pipelining  Built-in math coprocessor
  • 16. x86 Evolution - Pentium Pentium •Superscalar •Multiple instructions executed in parallel Pentium Pro •Increased superscalar organization •Aggressive register renaming •Branch prediction •Data flow analysis •Speculative execution Pentium II •MMX technology •Designed specifically to process video, audio, and graphics data Pentium III •Additional floating-point instructions to support 3D graphics software Pentium 4 •Includes additional floating-point and other enhancements for multimedia
  • 17. x86 Evolution (continued) • Core • First Intel x86 microprocessor with a dual core, referring to the implementation of two processors on a single chip • Core 2 • Extends the architecture to 64 bits • Recent Core offerings have up to 10 processors per chip Instruction set architecture is backward compatible with earlier versions X86 architecture continues to dominate the processor market outside of embedded systems
  • 18. General definition: “A combination of computer hardware and software, and perhaps additional mechanical or other parts, designed to perform a dedicated function. In many cases, embedded systems are part of a larger system or product, as in the case of an antilock braking system in a car.” Embedded Systems
  • 19. Table 2.7 Examples of Embedded Systems and Their Markets
  • 20. Embedded Systems Small to large systems, implying different cost constraints and different needs for optimization and reuse Relaxed to very strict requirements and combinations of different quality requirements with respect to safety, reliability, real-time and flexibility Short to long life times Different environmental conditions in terms of radiation, vibrations, and humidity Different application characteristics resulting in static versus dynamic loads, slow to fast speed, compute versus interface intensive tasks, and/or combinations thereof Different models of computation ranging from discrete event systems to hybrid systems Requirements and Constraints
  • 21. Figure 2.12 Possible Organization of an Embedded System
  • 22. Acorn RISC Machine (ARM) • Family of RISC-based microprocessors and microcontrollers • Designs microprocessor and multicore architectures and licenses them to manufacturers • Chips are high-speed processors that are known for their small die size and low power requirements • Widely used in PDAs and other handheld devices • Chips are the processors in iPod and iPhone devices • Most widely used embedded processor architecture • Most widely used processor architecture of any kind
  • 23. A R M E v o l u t i o n DSP = digital signal processor SoC = system on a chip
  • 24. ARM Design Categories • ARM processors are designed to meet the needs of three system categories:  Application platforms  Devices running open operating systems including Linux, Palm OS, Symbian OS, and Windows CE in wireless, consumer entertainment and digital imaging applications  Embedded real-time systems  Systems for storage, automotive body and power-train, industrial, and networking applications  Secure applications  Smart cards, SIM cards, and payment terminals
  • 26. Benchmarks For example, consider this high-level language statement: A = B + C /* assume all quantities in main memory */ With a traditional instruction set architecture, referred to as a complex instruction set computer (CISC), this instruction can be compiled into one processor instruction: add mem(B), mem(C), mem (A) On a typical RISC machine, the compilation would look something like this: load mem(B), reg(1); load mem(C), reg(2); add reg(1), reg(2), reg(3); store reg(3), mem (A)
  • 27. Desirable Benchmark Characteristics Written in a high-level language, making it portable across different machines Representative of a particular kind of programming style, such as system programming, numerical programming, or commercial programming Can be measured easily Has wide distribution
  • 28. System Performance Evaluation Corporation (SPEC) • Benchmark suite • A collection of programs, defined in a high-level language • Attempts to provide a representative test of a computer in a particular application or system programming area • SPEC • An industry consortium • Defines and maintains the best known collection of benchmark suites • Performance measurements are widely used for comparison and research purposes
  • 29. SPEC CPU2006 • Best known SPEC benchmark suite • Industry standard suite for processor intensive applications • Appropriate for measuring performance for applications that spend most of their time doing computation rather than I/O • Consists of 17 floating point programs written in C, C++, and Fortran and 12 integer programs written in C and C++ • Suite contains over 3 million lines of code • Fifth generation of processor intensive suites from SPEC
  • 30. Amdahl’s Law • Gene Amdahl [AMDA67] • Deals with the potential speedup of a program using multiple processors compared to a single processor • Illustrates the problems facing industry in the development of multi-core machines • Software must be adapted to a highly parallel execution environment to exploit the power of parallel processing • Can be generalized to evaluate and design technical improvement in a computer system
  • 32. Little’s Law • Fundamental and simple relation with broad applications • Can be applied to almost any system that is statistically in steady state, and in which there is no leakage • Queuing system • If server is idle an item is served immediately, otherwise an arriving item joins a queue • There can be a single queue for a single server or for multiple servers, or multiples queues with one being for each of multiple servers • Average number of items in a queuing system equals the average rate at which items arrive multiplied by the time that an item spends in the system • Relationship requires very few assumptions • Because of its simplicity and generality it is extremely useful
  • 33. Summary Chapter 2 • First generation computers • Vacuum tubes • Second generation computers • Transistors • Third generation computers • Integrated circuits • Performance designs • Microprocessor speed • Performance balance • Chip organization and architecture Computer Evolution and Performance • Multi-core • MICs • GPGPUs • Evolution of the Intel x86 • Embedded systems • ARM evolution • Performance assessment • Clock speed and instructions per second • Benchmarks • Amdahl’s Law • Little’s Law