This document discusses accelerated computing using GPUs and OpenCL. It begins by covering the evolution of x86 processors towards multi-core designs and the use of GPUs as accelerators. It then introduces accelerated processing units that combine CPU and GPU components. The document concludes by introducing OpenCL as an open standard for programming GPUs and heterogeneous systems that allows developers to write code that scales across CPUs and GPUs.
GPU compute has leveraged discrete GPUs for a fairly limited set of academic and supercomputing system workloads until recently. With the increase in performance of integrated GPU inside an Accelerated Processing Unit (APU), introduction of Heterogeneous System Architecture (HSA) devices, and proliferation of programming tools, we are seeing GPU compute make its way into mainstream applications. In this presentation we cover GPU compute and HSA, focusing on the application of GPU compute in the Medical and Print Imaging segments. Examples of performance data are reviewed and the case is made for how GPU compute can deliver tangible benefits.
GPU compute has leveraged discrete GPUs for a fairly limited set of academic and supercomputing system workloads until recently. With the increase in performance of integrated GPU inside an Accelerated Processing Unit (APU), introduction of Heterogeneous System Architecture (HSA) devices, and proliferation of programming tools, we are seeing GPU compute make its way into mainstream applications. In this presentation we cover GPU compute and HSA, focusing on the application of GPU compute in the Medical and Print Imaging segments. Examples of performance data are reviewed and the case is made for how GPU compute can deliver tangible benefits.
AMD is introducing “Seattle,” a 64-bit ARM-based server System-on a –Chip (SoC) built on the same technology that powers billions of today’s most popular mobile devices.
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...AMD Developer Central
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Harris Gasparakis, AMD, at the Embedded Vision Alliance Summit, May 2014.
Harris Gasparakis, Ph.D., is AMD’s OpenCV manager. In addition to enhancing OpenCV with OpenCL acceleration, he is engaged in AMD’s Computer Vision strategic planning, ISVs, and AMD Ventures engagements, including technical leadership and oversight in the AMD Gesture product line. He holds a Ph.D. in theoretical high energy physics from YITP at SUNYSB. He is credited with enabling real-time volumetric visualization and analysis in Radiology Information Systems (Terarecon), including the first commercially available virtual colonoscopy system (Vital Images). He was responsible for cutting edge medical technology (Biosense Webster, Stereotaxis, Boston Scientific), incorporating image and signal processing with AI and robotic control.
AMD Bridges the X86 and ARM Ecosystems for the Data Center AMD
Presentation by Lisa Su, senior vice president and general manager, Global Business Units, AMD regarding AMD’s announcement that it will design and build 64-bit ARM technology-based processors.
HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...AMD Developer Central
Presentation Hc-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael Wootton at the AMD Developer Summit (APU13) November 11-13, 2013.
Heterogeneous Systems Architecture: The Next Area of Computing Innovation AMD
Dr. Lisa Su, Senior Vice President and GM, Global Business Units, AMD keynote from ISSCC on Heterogeneous Systems Architecture: The Next Area of Computing Innovation - Case Study, The Holodeck.
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...AMD Developer Central
Presentation CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with Windows Server, by Derrick Isoka at the AMD Developer Summit (APU13) November 11-13, 2013
Heterogeneous Computing : The Future of SystemsAnand Haridass
Charts from NITK-IBM Computer Systems Research Group (NCSRG)
- Dennard Scaling,Moore's Law, OpenPOWER, Storage Class Memory, FPGA, GPU, CAPI, OpenCAPI, nVidia nvlink, Google Microsoft Heterogeneous system usage
Supermicro’s Universal GPU: Modular, Standards Based and Built for the FutureRebekah Rodriguez
The Universal GPU system architecture combines the latest technologies that support multiple GPU form factors, CPU choices, storage, and networking options.Together, these components are optimized to deliver high performance in a balanced architecture in a highly scalable system. Systems can be optimized for each customer’s specific Artificial Intelligence (AI), Machine Learning (ML), or High Performance Computing (HPC) applications. Organizations worldwide are demanding new options for their future computing environments, which have the thermal headroom for the next generation of CPUs and GPUs.
Join this webinar to learn how to leverage Supermicro's Universal GPU system to simplify customer deployments, deliver ultimate modularity and customization options for AI to Omniverse environments.
AMD is introducing “Seattle,” a 64-bit ARM-based server System-on a –Chip (SoC) built on the same technology that powers billions of today’s most popular mobile devices.
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...AMD Developer Central
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Harris Gasparakis, AMD, at the Embedded Vision Alliance Summit, May 2014.
Harris Gasparakis, Ph.D., is AMD’s OpenCV manager. In addition to enhancing OpenCV with OpenCL acceleration, he is engaged in AMD’s Computer Vision strategic planning, ISVs, and AMD Ventures engagements, including technical leadership and oversight in the AMD Gesture product line. He holds a Ph.D. in theoretical high energy physics from YITP at SUNYSB. He is credited with enabling real-time volumetric visualization and analysis in Radiology Information Systems (Terarecon), including the first commercially available virtual colonoscopy system (Vital Images). He was responsible for cutting edge medical technology (Biosense Webster, Stereotaxis, Boston Scientific), incorporating image and signal processing with AI and robotic control.
AMD Bridges the X86 and ARM Ecosystems for the Data Center AMD
Presentation by Lisa Su, senior vice president and general manager, Global Business Units, AMD regarding AMD’s announcement that it will design and build 64-bit ARM technology-based processors.
HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...AMD Developer Central
Presentation Hc-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael Wootton at the AMD Developer Summit (APU13) November 11-13, 2013.
Heterogeneous Systems Architecture: The Next Area of Computing Innovation AMD
Dr. Lisa Su, Senior Vice President and GM, Global Business Units, AMD keynote from ISSCC on Heterogeneous Systems Architecture: The Next Area of Computing Innovation - Case Study, The Holodeck.
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...AMD Developer Central
Presentation CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with Windows Server, by Derrick Isoka at the AMD Developer Summit (APU13) November 11-13, 2013
Heterogeneous Computing : The Future of SystemsAnand Haridass
Charts from NITK-IBM Computer Systems Research Group (NCSRG)
- Dennard Scaling,Moore's Law, OpenPOWER, Storage Class Memory, FPGA, GPU, CAPI, OpenCAPI, nVidia nvlink, Google Microsoft Heterogeneous system usage
Supermicro’s Universal GPU: Modular, Standards Based and Built for the FutureRebekah Rodriguez
The Universal GPU system architecture combines the latest technologies that support multiple GPU form factors, CPU choices, storage, and networking options.Together, these components are optimized to deliver high performance in a balanced architecture in a highly scalable system. Systems can be optimized for each customer’s specific Artificial Intelligence (AI), Machine Learning (ML), or High Performance Computing (HPC) applications. Organizations worldwide are demanding new options for their future computing environments, which have the thermal headroom for the next generation of CPUs and GPUs.
Join this webinar to learn how to leverage Supermicro's Universal GPU system to simplify customer deployments, deliver ultimate modularity and customization options for AI to Omniverse environments.
Modular by Design: Supermicro’s New Standards-Based Universal GPU ServerRebekah Rodriguez
In this webinar, members of the Server Solution Team as well as a member of Supermicro’s Product Office will discuss Supermicro’s Universal GPU Server, the server’s modular, standards-based design, the important role of OCP Accelerator Module (OAM) form factor, and Universal Baseboard (UBB) in the system, as well as touching on AMD's next generation HPC accelerator. In addition, we will get some insights into trends in the HPC and AI/Machine Learning space, including the different software platforms and best practices that are driving innovation in our industry and daily lives. In particular: • Tools to enable use of the high performance hardware for HPC and Deep Learning applications • Tools to enable use of multiple GPUs, including RDMA, to solve highly demanding HPC and deep learning models, such as BERT • Running applications in containers with AMD’s next generation GPU system
Modular by Design: Supermicro’s New Standards-Based Universal GPU ServerRebekah Rodriguez
In this webinar, members of the Server Solution Team as well as a member of Supermicro’s Product Office will discuss Supermicro’s Universal GPU Server, the server’s modular, standards-based design, the important role of OCP Accelerator Module (OAM) form factor, and Universal Baseboard (UBB) in the system, as well as touching on AMD's next generation HPC accelerator. In addition, we will get some insights into trends in the HPC and AI/Machine Learning space, including the different software platforms and best practices that are driving innovation in our industry and daily lives. In particular: • Tools to enable use of the high performance hardware for HPC and Deep Learning applications • Tools to enable use of multiple GPUs, including RDMA, to solve highly demanding HPC and deep learning models, such as BERT • Running applications in containers with AMD’s next generation GPU system
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Lablup Inc.
This slide introduces technical specs and details about Backend.AI 19.09.
* On-premise clustering / container orchestration / scaling on cloud
* Container-level fractional GPU technology to use one GPU as many GPUs on many containers at the same time.
* NVidia GPU Cloud integrations
* Enterprise features
In this deck from the HPC User Forum in Tucson, Jeff Stuecheli from IBM presents: POWER9 for AI & HPC.
"Built from the ground-up for data intensive workloads, POWER9 is the only processor with state-of-the-art I/O subsystem technology, including next generation NVIDIA NVLink, PCIe Gen4, and OpenCAPI."
Watch the video: https://wp.me/p3RLHQ-isJ
Learn more: https://www.ibm.com/it-infrastructure/power/power9
and
http://hpcuserforum.com
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
A presentation for all the IT resellers and retailers in Nepal.
Introducing next generation technologies into the consumer market to collectively deliver a greater and richer computer experience.
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoEmbarcados
Objetivo do Webinar: Venha saber como a plataforma NVIDIA Jetson e suas ferramentas habilitam você a desenvolver e implantar robôs, drones, aplicativos de IVA e outras máquinas autônomas com tecnologia AI que pensam por conta própria.
Apoio: Arrow e NVIDIA.
Convidado: Marcel Saraiva
Gerente de Contas Enterprise da NVIDIA, executivo com 20 anos de expereincia no mercado de TI, teve na sua carreia passagens pela SGI (Silicon Graphics), Intel e Scansource. Engenheiro eletrico formado pela FEI, com pós-graduação em Marketing pela FAAP e MBA em Gestão Empresarial pela FGV.
Link para o Webinar: https://www.embarcados.com.br/webinars/nvidia-jetson-a-inteligencia-artificial-na-palma-de-sua-mao/
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
5. 4P/24-core system examplevery good scalability One memory controller for every processor Full-duplex Hyper Transport links (up to 5.2GHz) Bus Optimization: HT Assist (Cache Probe Filtering) Still the only available 4P system with Direct Connect Architecture MEMORY MEMORY MEMORY MEMORY
6. Direct Connect Architecture 1.0Balanced and Scalable Design to Support up to 6 Cores 2 MEMORY CHANNELS 2 MEMORY CHANNELS 8 DIMMs per CPU 8 DIMMs per CPU 2 MEMORY CHANNELS 2 MEMORY CHANNELS 8 DIMMs per CPU 8 DIMMs per CPU No front side bus HyperTransport™ technology Integrated memory controller NUMA memory architecture
15. 2011 GPU Architecture AMD Radeon™ HD 6900 Series Dual graphics engines New VLIW4 core architecture Up to 24 SIMD engines Up to 96 Texture Units Upgraded render back-ends Improved anti-aliasing performance Fast 256-bit GDDR5 memory interface Up to 5.5 Gbps New GPU compute features
17. Old and New in High Performance Computing Old: Power is free, Transistors are expensive New: Power expensive, Transistors free (Can put more transistors on chip than can afford to turn on) Old: Multiplies are slow, Memory access is fast New: Multiplies fast, Memory slow (up 200 clocks to DRAM memory, 4 clocks for FP multiply) Old: Increasing Instruction Level Parallelism via compilers innovation New: Explicit thread and data parallelism must be exploited
18. GPUs: more than just gaming 15 2700 Both use GPUs Oil exploration platform - 2010 Wii Sports - Golf
19.
20. Tasks like loading a texture or compiling a shader can execute in parallel with main rendering threadDirectX® 10 DirectX® 11 16
28. Great use for additional CPU coresGraphics Workloads Other Highly Parallel Workloads Serial/Task-Parallel Workloads Delivers optimal performance for a wide range of platform configurations
29. ATI Stream Technology is… Heterogeneous: Developers leverage AMD GPUs and x86 CPUs for optimal application performance and user experience High performance:Massively parallel, programmable GPU architecture delivers unprecedented performance and power efficiency Industry Standards:OpenCL™ and DirectCompute 11 enable cross-platform development Engineering Sciences Government Gaming Digital Content Creation Productivity
30.
31.
32. Video Transcoding SampleNo GPU Acceleration CPU Usage: 100% Frames Frames Using four CPU Cores GPU Usage: 1% 26
33. Video Transcoding SampleATI GPU Acceleration CPU Usage: 45% Control Control Frames Frames GPU Usage: 35% Using hundreds of Stream Processors 27
35. Today TeraFLOPS-class GPU Multi-core CPU ~800 million transistors Multi-tasking Up to 2 billion transistors Jogosemmultiplosmonitores Video e audio Full HD
36.
37. Power efficientCons: Software availability ? Single-thread We are here Performance Performance We are here We are here Time x Cores Time Time
38. A new Era on performance evolution Multi-Core Single-Core CPU Core efficiency Software Acceleration Low power consumption Multimedia Gaming GPU
78. Comparing OpenCL™ and DirectX® 11 DirectCompute How will developers choose between OpenCL™ and DirectX® 11 DirectCompute? Feature set is similar in both APIs DirectX® 11 DirectCompute Easiest path to add compute capabilities to existing DirectX applications Windows Vista® and Windows® 7 only OpenCL™ Ideal path for new applications porting to the GPU for the first time True multiplatform: Windows®, Linux®, MacOS Natural programming without dealing with a graphics API
79.
80. Subset of ISO C99 with language extensions - familiar to developers
Our new technology pillars that will help the channel differentiate
Explain how 3 monitors can be less expensive than single 30” monitor. E.g 3x22” ~ $500 solution, vs single 30” > $1000On the productivity, also explain ISVs continue to leverage multi-monitor. E.g. MS office 2010, on powerpoint you can open multiple files on multiple windows.
Original legal approval – Maranello Platform Launch, March 2010The first generation DCA introduced features now expected in the market[cover features at bottom quickly and go to next slide]
Original legal approval – Maranello Platform Launch, March 2010Today’s introduction brings DCA 2.0Four memory channels12 DIMMs per CPUSupports up to 12 cores today, will support next-gen core with up to 16 per CPULet’s take a closer look at the effect of memory on workloads [next slide]
done
Add more deep blue computers
Add “All models ATI Radeon™”Add “as of this date the HD5870 GPU has the highest GFLOPS/mm2 of all known products”
Explain how 3 monitors can be less expensive than single 30” monitor. E.g 3x22” ~ $500 solution, vs single 30” > $1000On the productivity, also explain ISVs continue to leverage multi-monitor. E.g. MS office 2010, on powerpoint you can open multiple files on multiple windows.
Work on the slide (larget text)
Using ATI Stream technology, enjoy better visual quality when you watch streaming video online (YouTube/Hulu) with new video enhancement features.*
Explain how 3 monitors can be less expensive than single 30” monitor. E.g 3x22” ~ $500 solution, vs single 30” > $1000On the productivity, also explain ISVs continue to leverage multi-monitor. E.g. MS office 2010, on powerpoint you can open multiple files on multiple windows.
Let’s look at today’s compute platforms:You have a Phenom II with 758 million transistors on 45nm process technology on the left On the right you see a 5870 DX11 GPU with 2.15 billion transistors on 40nm process technology. Today, with the emergence of visual computing, you see more work than ever before for the GPU. Especially with, arguably for consumers, the most important workload: video.The explosion of HD video and now HD gaming, means the GPU matters more than ever in the PC platform. More user-generated content puts more of the work onto the GPU such as video processing and rendering and 3D user interface.The era of visual computing is already becoming more about mobility and being able to do more of what I’ve just described on the go. However, users do not want more compute capabilities at the expense of battery life or smaller form factors.Favoring one component over the other or taking a niche approach to balanced visual computing platforms does not meet the needs of the mass market. Usage scenarios favor a combination of GPU/CPU balance and low power..
Now – Many of you are technologists, so you are probably glad to see me finally start talking about some technology – the workload changes are also dramatically impacting chip architectures.This chart does a good job of demonstrating the evolution of chip architectures:Starting on X axis on the left you go back in time to highly programmable, single core CPUs which aimed to increase throughput (Y axis) over time by first adding threads, then cores.GPUs on the other hand, started out way to the right in terms of throughput and have been becoming more and more programmable.We call this evolution the move from Homogenous Computing to Heterogeneous Computing , finally resulting up on the top right where the two arrows meet in what we call an APU. A combination of different types of cores, working closely together on different type workloads for optimum performance per watt per mm2This AMD’s architectural vision of the future and where we are heading with our first APU in 2011, the Llano processor – our first integrated CPU + GPU on a single piece of silicon.
WHERE WE ARE TODAYAttempt to provide an environment in which optimized hardware can provide higher absolute performance, better power efficiency, and lower cost. At the same time, the goal is to dramatically improve programmer productivity as the cost of software development is substantially the same as hardware developmentThis means support for heterogeneous multi-core hardware and a much more effective application programming environment are critical.This chart does a good job of summarizing the evolution of chip architectures:Starting on X axis on the left you go back in time to highly programmable, single core CPUs which aimed to increase throughput (Y axis) over time by first adding threads, then cores.GPUs on the other hand, started out way to the right in terms of throughput and have been becoming more and more programmable.We call this evolution the move from Homogenous Computing to Heterogeneous Computing , finally resulting up on the top right where the two arrows meet in what we call an APU. A combination of different types of cores, working closely together on different type workloads for optimum performance per watt per mm2
The need for this optimal energy-efficient balance of CPU and GPU represents the beginning of a new era of computing in 2011.The Fusion of CPU and GPU compute power is what the next chapter in visual computing requires – a powerful visual computing experience at home or on the go without compromise. Our AMD Fusion™ design is driven by mobility and is based on a low-power visual compute architecture that will enhance active and resting battery life while increasing both CPU and GPU performance. This is the culmination of the vision of ‘One AMD’ and only AMD can deliver the GPU and CPU combination that will be the future of computing
Review slide to determine message
The Industry has always tried to move away from proprietary technology and towards open standards when available.The proprietary Apple Display Connector never became popular since DVI was license-free and widely available.3dfx’s Glide API for 3D graphics failed to stick around in the market long after DirectX was available on a wide variety of hardware.nVIDIA’s Cg language was never widely used since OpenGL and DirectX provided a compelling open alternativeThe Unified Display Interface was a failed interface backed by Intel and nVIDIA, which was deprecated in favor of the license-free DisplayPort standard.RAMBUS has tried to bring many proprietary memory technologies to market, but have always been displaced by JEDEC open memory standards.CUDA is a proprietary GPGPU model into the market whose specification is controlled by only one company, we believe it will soon be replaced by OpenCL and the DirectX Compute Shader.