GPU Computing

Graphics processing units - powerful, programmable, and highly parallel - are increasingly targeting general-purpose computing applications. GPU ComputingPresented By:Khan Muhammad Nafee Mostafa0507007, Dept of CSE, KUET

GPU ComputingJ. D. OwensM. HoustonD. LuebkeS. GreenJ. E. StoneJ. C. PhillipsProceedings of the IEEE | Vol 96, No. 5 | May 2008We would be concentrating on,What is GPU ComputingWhy GPU ComputingGPU Architecture and EvolutionGPU Computing ModelSoftware Environment Future

GPU for General Purpose ComputingWhat is GPU Computing ?

What is GPU Computing ?GPU computing is the use of a GPU to do general purpose scientific and engineering computingCPU and GPU together in a heterogeneous computing model.Sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance.

Over the past few years, the GPU has evolved from a fixed-function special-purpose processor into a full-fledged parallel programmable processor with additional fixed-function special-purpose functionalityWhy GPU Computing…

GPU for Non-Graphic AppsThe GPU is designed for a particular class of applications with the following characteristics,Computational requirements are largeParallelism is substantialThroughput is more important than latencya growing community has identified other applications with similar characteristics and successfully mapped these applications onto the GPU

GPU extends its hand towards CPU for performanceParallelism is the future of computingMany applications have to process huge set of data following same functionsSeveral stream processors can execute same set of instructions on different data sets and give a higher throughput If GPU take some share of computation load from CPU, many applications can be benefitted in speed-up

GPU is now turned into a programmable engineGPU Architecture and Evolution

GPU PipelineAvailable operations are configurable but not programmable

All GPU programs must be structured in this way: many parallel elements, each processed in parallel by a single programGPU Computing Model

Computing on the GPUProgramming a GPU for Graphicsprogrammer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.

Computing on the GPUProgramming a GPU for GraphicsProgramming a GPU for General-Purpose Programs (Old)programmer specifies geometric primitive covering computation domain of interest; rasterizer generates fragmentEach fragment is shaded by an SPMD general purpose FPFP computes the fragment by a combination of math operations and ‘gather’ accesses from global memory. resulting buffer can be used as an input on future passes. programmer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.

Computing on the GPUProgramming a GPU for General-Purpose Programs (New)programmer directly defines the computation domain of interest as a structured grid of threadsSPMD general-purpose program computes each threadeach thread is computed by a combination of math operations and both ‘gather’ (read) accesses from and ‘scatter’ (write) accesses to global memory; (same buffer can be used for both allowing more flexible algorithms)resulting buffer in global memory can then be used as an input in future computation

Software EnvironmentsBrookGPUMicrosoft’s AcceleratorVendor Specific GPGPU systemsAMD ATI’s CTM (Close to the Metal)NVIDIA’s CUDA (Compute Unified Device Architecture)

Scan performance on CPU, graphics-based GPU (using OpenGL), and direct-compute GPU (using CUDA). Results obtained on a GeForce 8800 GTX GPU and Intel Core2-Duo Extreme 2.93 GHz CPU. (Figure adapted from Harris et al.)Scan performance on CPU, OpenGL and CUDA

Concluding for bright Future…support for double-precision floating-pointhigher bandwidth path between CPU and GPU (like ATI’s HyperTransport)more tightly coupled CPU and GPU (AMD’s fusion or nVidianForce)NVIDIA Quadro for Multiple GPU CollaborationFinally, let us wait for new era when GPU Computing will rule

Thank YouI would also like to thank,

GPU Computing

More Related Content

What's hot

Viewers also liked

Similar to GPU Computing

More from Khan Mostafa

Recently uploaded

GPU Computing