An Architecture Perspective On Modern Microprocessors And GPU- AbhijeetNawal3/25/20111AN ARCHITECTURE PERSPECTIVE
Agenda	INTRODUCTIONINTEL’S NETBURST ARCHITECTUREINTEL’S CORE ARCHITECTUREINTEL’S NEHALEM ARCHITECTURESNEAK PEAK AT NVIDIA TEGRA GPUREFERENCES 3/25/20112AN ARCHITECTURE PERSPECTIVE
IntroductionSuper Scalar Homogeneous Processors From Intel.
Performance = Frequency x IPC
Power = Dynamic Capacitance x Volts x Volts x Frequency.
Dynamic Capacitance is the ratio of the electrostatic charge on a conductor to the potential difference between the conductors required to maintain that charge.
Higher the No Of Pipeline Stages more Instructions in Pipeline.
Higher No Of Pipeline Stages reduces IPC as n/{k+(n-1)} .
Low IPC is offset by increasing the clock rate and reducing stage time.
Each Instruction is CISC based so decodes into micro operations.3/25/20113AN ARCHITECTURE PERSPECTIVE
Introduction…Streaming SIMD Extensions:
SSE instructions are 128-bit integer arithmetic and 128-bit SIMD double precision  floating-point  operations.
They reduce  the  overall  number  of  instructions  required  to execute a particular program task.
They accelerate a broad range of applications, including video, speech and image, photo processing, encryption,  financial,  engineering and  scientific  applications.
Predecode phase:
Before Instruction pipleline fetch and decode phase.
Bundles instructions to be parallelly executed.
Instructions are appended with bits after fetching from memory as they enter the instruction cache.
This unit also has to thus take care of analyzing the structural, control and data hazards. 3/25/2011AN ARCHITECTURE PERSPECTIVE4
Intel Architectures: Netburst3/25/20115AN ARCHITECTURE PERSPECTIVE
NetBurst Architecture3/25/20116AN ARCHITECTURE PERSPECTIVE
Netburst Microarchitecture3/25/20117AN ARCHITECTURE PERSPECTIVE
Features of Netburst ArchitectureHyper Threading:
A processor appears as two logical processors.
Each logical processor has its own set of registers, APIC( Advanced programmable interrupt controller).
Increases resource utilization and improve performance.
Introduced SSE (Streaming SIMD Extensions)3.0
Added some DSP-oriented instructions .
And some process (thread) management instructions.3/25/20118AN ARCHITECTURE PERSPECTIVE
Features of Netburst…Hyper Pipelined Technology:
20 stage pipeline.
Branch Mispredictions can lead to very costly pipeline flushes.
Techniques to hide stall penalties are parallel   execution, buffering and speculation.
Three Major Components:
In-Order Issue Front End
Out-Of-Order Superscalar Execution Core
In-Order Retirement Unit 3/25/2011AN ARCHITECTURE PERSPECTIVE9
Features of Netburst…In-Order Issue Front End:
Two major parts:
Fetch/Decode Unit
Execution Trace Cache
Fetch/ Decode Unit:
Prefetches IA-32 instructions that are likely to be executed. Details in Prefetching.
Fetches instructions that have not already been prefetched.
Decodes instructions into µops and builds trace.3/25/2011AN ARCHITECTURE PERSPECTIVE10
Features of Netburst…Execution Trace Cache:
Middle-man between First Decode Stage and Execution Stage

Modern INTEL Microprocessors' Architecture and Sneak Peak at NVIDIA TEGRA GPU

Editor's Notes

  • #25 Intel faced problems of power dissipation in Netburst with the high clock speeds. Hence it abandoned it and forwarded to Core