SlideShare a Scribd company logo
1 of 19
Download to read offline
NVIDIA Parallel Nsight™

Siggraph Asia 2010 | Seoul
Agenda

         Introduction to Parallel Nsight

         CUDA C/C++ Source Debugging

         Analysis/System Trace

         Q&A




NVIDIA Confidential
What is Parallel Nsight?

         Development environment for
         heterogeneous platforms (CPU and GPU)

         Fully integrated into Microsoft Visual
         Studio 2008 and 2010

         Dramatic productivity improvement in
         common development tasks



NVIDIA Confidential
Parallel Nsight for Compute




                      Parallel Compute Debugger                              System Analyzer (Pro only)
                      Examine compute kernels directly on GPU hardware       Capture and visualize CPU and GPU level events on a
                                                                             single correlated timeline
                      Debug CUDA C/C++ and DirectCompute applications        Inspect workload dependencies using the Timeline View
                                                                             Profile CUDA kernels using GPU performance counters
                      Visualize thousands of threads executing in parallel
                      using Visual Studio

NVIDIA Confidential
                      Use conditional breakpoints to correct errors in
                      massively parallel code
CUDA C/C++ Debugging

         Compile your code with Debug flag

         Use the familiar Visual Studio interface to debug your GPU code

         Dramatic productivity improvements
                      Explore memory during a live session vs. coding specific transfers
                      Immediately view live variables vs. printf/recompile loop
                      Set data breakpoints on memory area vs. trial and error
                      And much more!


NVIDIA Confidential
Setting Breakpoints




NVIDIA Confidential
Viewing Variable Values




NVIDIA Confidential
Viewing GPU Memory




NVIDIA Confidential
Switching Between Threads




NVIDIA Confidential
Conditional Breakpoints




NVIDIA Confidential
Data Breakpoints




NVIDIA Confidential
Trace

         Powerful performance tool

         Correlated timeline between CPU and GPU

         Bottleneck identification
                      Find CPU vs. GPU boundedness
                      Find memory transfer vs. kernel computation bounded
                      Get macro-level information on which CUDA kernels use the most time




NVIDIA Confidential
Trace across the CPU and GPU




NVIDIA Confidential
Trace – Overlap Memory Transactions




NVIDIA Confidential
Tesla Compute Cluster Support (TCC)

         TCC is a special driver mode for Windows 7, Vista, and HPC
         Server 2008.
                      Included in our most recent R260 driver release.


         Benefits
                      Lower overhead to kernel launches = higher performance
                      Running CUDA on a MS Remote desktop


         Parallel Nsight 1.5 now supports debugging on GPUs using a
         TCC driver.

NVIDIA Confidential
4 Flexible GPU Development Configurations
  Desktop                Single machine, Single NVIDIA GPU
                          Analyzer, Graphics Inspector

                         Single machine, Dual NVIDIA GPUs
                          Analyzer, Graphics Inspector, Compute Debugger


  Networked              Two machines connected over the network
                          Analyzer, Graphics Inspector, Compute Debugger, Graphics Debugger
                TCP/IP


 Workstation SLI         SLI Multi OS workstation with two Quadro GPUs
                          Analyzer, Graphics Inspector, Compute Debugger, Graphics Debugger


NVIDIA Confidential
Parallel Nsight 1.5 Feature Support
                                                                                             Professional ($349)
                                                                      Standard (no cost)   available for purchase in
                                                                                                  December

                      Compute Debugger                                                               
                      DirectX 10 & 11 Debugger & Graphics inspector                                  
                      GeForce Support: 9 series or higher                                            
                      Tesla Support: C1050/S1070 or higher                                           
                      Quadro Support: G9x or higher                                                  
                      Windows 7, Vista and HPC Server 2008                                           
                      Visual Studio 2008 SP1 and Visual Studio 2010                                  
                      Compute Analyzer                                                                
                      OpenGL and OpenCL Analyzer                                                      
                      DirectX 10 & 11 Analyzer                                                        
                      Tesla Compute Cluster (TCC) Debugging                                           

NVIDIA Confidential               http://www.nvidia.com/GetParallelNsight
Parallel Nsight Resources

         Parallel Nsight GPU Computing Forum

         The Parallel Nsight User Guide
                      Installed with the Host installer
                      Available on the Web


         Links to these from: http://developer.nvidia.com/ParallelNsight




NVIDIA Confidential
developer.nvidia.com/ParallelNsight



                         Free Download
                             Videos
                         Documentation


NVIDIA Confidential

More Related Content

What's hot

The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...AMD Developer Central
 
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...AMD Developer Central
 
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...AMD Developer Central
 
GPU Virtualization in Embedded Automotive Solutions
GPU Virtualization in Embedded Automotive SolutionsGPU Virtualization in Embedded Automotive Solutions
GPU Virtualization in Embedded Automotive SolutionsGlobalLogic Ukraine
 
GS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin Coumans
GS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin CoumansGS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin Coumans
GS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin CoumansAMD Developer Central
 
N A G P A R I S280101
N A G P A R I S280101N A G P A R I S280101
N A G P A R I S280101John Holden
 
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey PavlenkoMM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey PavlenkoAMD Developer Central
 
NVIDIA OpenGL and Vulkan Support for 2017
NVIDIA OpenGL and Vulkan Support for 2017NVIDIA OpenGL and Vulkan Support for 2017
NVIDIA OpenGL and Vulkan Support for 2017Mark Kilgard
 
[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c
[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c
[02][cuda c 프로그래밍 소개] gateau intro to_cuda_claparuma
 
PG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry KozlovPG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry KozlovAMD Developer Central
 
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...AMD Developer Central
 
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon SelleyPT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon SelleyAMD Developer Central
 
NVIDIA vGPU Talk – Sizing and Common Mistakes
NVIDIA vGPU Talk – Sizing and Common MistakesNVIDIA vGPU Talk – Sizing and Common Mistakes
NVIDIA vGPU Talk – Sizing and Common MistakesLee Bushen
 
NVIDIA vGPU - Introduction to NVIDIA Virtual GPU
NVIDIA vGPU - Introduction to NVIDIA Virtual GPUNVIDIA vGPU - Introduction to NVIDIA Virtual GPU
NVIDIA vGPU - Introduction to NVIDIA Virtual GPULee Bushen
 
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...AMD Developer Central
 
DSDP Device Debugging Project Update - Eclipse Con 2009
DSDP Device Debugging Project Update - Eclipse Con 2009DSDP Device Debugging Project Update - Eclipse Con 2009
DSDP Device Debugging Project Update - Eclipse Con 2009PawelP
 
NVIDIA OpenGL 4.6 in 2017
NVIDIA OpenGL 4.6 in 2017NVIDIA OpenGL 4.6 in 2017
NVIDIA OpenGL 4.6 in 2017Mark Kilgard
 
Qt 6 Chat - Are You Ready?
Qt 6 Chat - Are You Ready?Qt 6 Chat - Are You Ready?
Qt 6 Chat - Are You Ready?ICS
 
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian BallantyneWT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian BallantyneAMD Developer Central
 
PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...
PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...
PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...AMD Developer Central
 

What's hot (20)

The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
 
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
 
GPU Virtualization in Embedded Automotive Solutions
GPU Virtualization in Embedded Automotive SolutionsGPU Virtualization in Embedded Automotive Solutions
GPU Virtualization in Embedded Automotive Solutions
 
GS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin Coumans
GS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin CoumansGS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin Coumans
GS-4150, Bullet 3 OpenCL Rigid Body Simulation, by Erwin Coumans
 
N A G P A R I S280101
N A G P A R I S280101N A G P A R I S280101
N A G P A R I S280101
 
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey PavlenkoMM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
 
NVIDIA OpenGL and Vulkan Support for 2017
NVIDIA OpenGL and Vulkan Support for 2017NVIDIA OpenGL and Vulkan Support for 2017
NVIDIA OpenGL and Vulkan Support for 2017
 
[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c
[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c
[02][cuda c 프로그래밍 소개] gateau intro to_cuda_c
 
PG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry KozlovPG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry Kozlov
 
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
 
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon SelleyPT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
 
NVIDIA vGPU Talk – Sizing and Common Mistakes
NVIDIA vGPU Talk – Sizing and Common MistakesNVIDIA vGPU Talk – Sizing and Common Mistakes
NVIDIA vGPU Talk – Sizing and Common Mistakes
 
NVIDIA vGPU - Introduction to NVIDIA Virtual GPU
NVIDIA vGPU - Introduction to NVIDIA Virtual GPUNVIDIA vGPU - Introduction to NVIDIA Virtual GPU
NVIDIA vGPU - Introduction to NVIDIA Virtual GPU
 
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
 
DSDP Device Debugging Project Update - Eclipse Con 2009
DSDP Device Debugging Project Update - Eclipse Con 2009DSDP Device Debugging Project Update - Eclipse Con 2009
DSDP Device Debugging Project Update - Eclipse Con 2009
 
NVIDIA OpenGL 4.6 in 2017
NVIDIA OpenGL 4.6 in 2017NVIDIA OpenGL 4.6 in 2017
NVIDIA OpenGL 4.6 in 2017
 
Qt 6 Chat - Are You Ready?
Qt 6 Chat - Are You Ready?Qt 6 Chat - Are You Ready?
Qt 6 Chat - Are You Ready?
 
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian BallantyneWT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
 
PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...
PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...
PT-4058, Measuring and Optimizing Performance of Cluster and Private Cloud Ap...
 

Similar to NVIDIA Parallel Nsight Development Environment

AMD Catalyst Software
AMD Catalyst Software  AMD Catalyst Software
AMD Catalyst Software AMD
 
GPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStack
GPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStackGPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStack
GPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStackBrian Schott
 
VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...
VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...
VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...VMworld
 
車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX
車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX
車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PXNVIDIA Japan
 
DCSF 19 Accelerating Docker Containers with NVIDIA GPUs
DCSF 19 Accelerating Docker Containers with NVIDIA GPUsDCSF 19 Accelerating Docker Containers with NVIDIA GPUs
DCSF 19 Accelerating Docker Containers with NVIDIA GPUsDocker, Inc.
 
VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...
VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...
VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...VMworld
 
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages toolslaparuma
 
CUDA by Example : Getting Started : Notes
CUDA by Example : Getting Started : NotesCUDA by Example : Getting Started : Notes
CUDA by Example : Getting Started : NotesSubhajit Sahu
 
10. GPU - Video Card (Display, Graphics, VGA)
10. GPU - Video Card (Display, Graphics, VGA)10. GPU - Video Card (Display, Graphics, VGA)
10. GPU - Video Card (Display, Graphics, VGA)Akhila Dakshina
 
VMworld 2013: Graphics and Users in VDI
VMworld 2013: Graphics and Users in VDI VMworld 2013: Graphics and Users in VDI
VMworld 2013: Graphics and Users in VDI VMworld
 
NVIDIA GTC 2019: Red Hat and the NVIDIA DGX: Tried, Tested, Trusted
NVIDIA GTC 2019:  Red Hat and the NVIDIA DGX: Tried, Tested, TrustedNVIDIA GTC 2019:  Red Hat and the NVIDIA DGX: Tried, Tested, Trusted
NVIDIA GTC 2019: Red Hat and the NVIDIA DGX: Tried, Tested, TrustedJeremy Eder
 
08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA
08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA
08 - it3D Summit 2016 - Grid - T. Riley- NVIDIAVirginia Grubert
 
Best practices for optimizing Red Hat platforms for large scale datacenter de...
Best practices for optimizing Red Hat platforms for large scale datacenter de...Best practices for optimizing Red Hat platforms for large scale datacenter de...
Best practices for optimizing Red Hat platforms for large scale datacenter de...Jeremy Eder
 
“A New, Open-standards-based, Open-source Programming Model for All Accelerat...
“A New, Open-standards-based, Open-source Programming Model for All Accelerat...“A New, Open-standards-based, Open-source Programming Model for All Accelerat...
“A New, Open-standards-based, Open-source Programming Model for All Accelerat...Edge AI and Vision Alliance
 
PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018NVIDIA
 
Ati Catalyst Preview
Ati Catalyst PreviewAti Catalyst Preview
Ati Catalyst Previewguestb277b2b
 

Similar to NVIDIA Parallel Nsight Development Environment (20)

AMD Catalyst Software
AMD Catalyst Software  AMD Catalyst Software
AMD Catalyst Software
 
GPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStack
GPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStackGPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStack
GPU Accelerated Virtual Desktop Infrastructure (VDI) on OpenStack
 
VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...
VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...
VMworld 2015: Deliver High Performance Desktops with VMware Horizon and NVIDI...
 
車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX
車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX
車載組み込み用ディープラーニング・エンジン NVIDIA DRIVE PX
 
DCSF 19 Accelerating Docker Containers with NVIDIA GPUs
DCSF 19 Accelerating Docker Containers with NVIDIA GPUsDCSF 19 Accelerating Docker Containers with NVIDIA GPUs
DCSF 19 Accelerating Docker Containers with NVIDIA GPUs
 
VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...
VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...
VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...
 
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
 
GPU Programming with Java
GPU Programming with JavaGPU Programming with Java
GPU Programming with Java
 
CUDA by Example : Getting Started : Notes
CUDA by Example : Getting Started : NotesCUDA by Example : Getting Started : Notes
CUDA by Example : Getting Started : Notes
 
10. GPU - Video Card (Display, Graphics, VGA)
10. GPU - Video Card (Display, Graphics, VGA)10. GPU - Video Card (Display, Graphics, VGA)
10. GPU - Video Card (Display, Graphics, VGA)
 
VMworld 2013: Graphics and Users in VDI
VMworld 2013: Graphics and Users in VDI VMworld 2013: Graphics and Users in VDI
VMworld 2013: Graphics and Users in VDI
 
Introduction to GPU Programming
Introduction to GPU ProgrammingIntroduction to GPU Programming
Introduction to GPU Programming
 
NVIDIA GTC 2019: Red Hat and the NVIDIA DGX: Tried, Tested, Trusted
NVIDIA GTC 2019:  Red Hat and the NVIDIA DGX: Tried, Tested, TrustedNVIDIA GTC 2019:  Red Hat and the NVIDIA DGX: Tried, Tested, Trusted
NVIDIA GTC 2019: Red Hat and the NVIDIA DGX: Tried, Tested, Trusted
 
08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA
08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA
08 - it3D Summit 2016 - Grid - T. Riley- NVIDIA
 
Cuda
CudaCuda
Cuda
 
Best practices for optimizing Red Hat platforms for large scale datacenter de...
Best practices for optimizing Red Hat platforms for large scale datacenter de...Best practices for optimizing Red Hat platforms for large scale datacenter de...
Best practices for optimizing Red Hat platforms for large scale datacenter de...
 
“A New, Open-standards-based, Open-source Programming Model for All Accelerat...
“A New, Open-standards-based, Open-source Programming Model for All Accelerat...“A New, Open-standards-based, Open-source Programming Model for All Accelerat...
“A New, Open-standards-based, Open-source Programming Model for All Accelerat...
 
Nvidia at SEMICon, Munich
Nvidia at SEMICon, MunichNvidia at SEMICon, Munich
Nvidia at SEMICon, Munich
 
PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018
 
Ati Catalyst Preview
Ati Catalyst PreviewAti Catalyst Preview
Ati Catalyst Preview
 

NVIDIA Parallel Nsight Development Environment

  • 2. Agenda Introduction to Parallel Nsight CUDA C/C++ Source Debugging Analysis/System Trace Q&A NVIDIA Confidential
  • 3. What is Parallel Nsight? Development environment for heterogeneous platforms (CPU and GPU) Fully integrated into Microsoft Visual Studio 2008 and 2010 Dramatic productivity improvement in common development tasks NVIDIA Confidential
  • 4. Parallel Nsight for Compute Parallel Compute Debugger System Analyzer (Pro only) Examine compute kernels directly on GPU hardware Capture and visualize CPU and GPU level events on a single correlated timeline Debug CUDA C/C++ and DirectCompute applications Inspect workload dependencies using the Timeline View Profile CUDA kernels using GPU performance counters Visualize thousands of threads executing in parallel using Visual Studio NVIDIA Confidential Use conditional breakpoints to correct errors in massively parallel code
  • 5. CUDA C/C++ Debugging Compile your code with Debug flag Use the familiar Visual Studio interface to debug your GPU code Dramatic productivity improvements Explore memory during a live session vs. coding specific transfers Immediately view live variables vs. printf/recompile loop Set data breakpoints on memory area vs. trial and error And much more! NVIDIA Confidential
  • 12. Trace Powerful performance tool Correlated timeline between CPU and GPU Bottleneck identification Find CPU vs. GPU boundedness Find memory transfer vs. kernel computation bounded Get macro-level information on which CUDA kernels use the most time NVIDIA Confidential
  • 13. Trace across the CPU and GPU NVIDIA Confidential
  • 14. Trace – Overlap Memory Transactions NVIDIA Confidential
  • 15. Tesla Compute Cluster Support (TCC) TCC is a special driver mode for Windows 7, Vista, and HPC Server 2008. Included in our most recent R260 driver release. Benefits Lower overhead to kernel launches = higher performance Running CUDA on a MS Remote desktop Parallel Nsight 1.5 now supports debugging on GPUs using a TCC driver. NVIDIA Confidential
  • 16. 4 Flexible GPU Development Configurations Desktop Single machine, Single NVIDIA GPU Analyzer, Graphics Inspector Single machine, Dual NVIDIA GPUs Analyzer, Graphics Inspector, Compute Debugger Networked Two machines connected over the network Analyzer, Graphics Inspector, Compute Debugger, Graphics Debugger TCP/IP Workstation SLI SLI Multi OS workstation with two Quadro GPUs Analyzer, Graphics Inspector, Compute Debugger, Graphics Debugger NVIDIA Confidential
  • 17. Parallel Nsight 1.5 Feature Support Professional ($349) Standard (no cost) available for purchase in December Compute Debugger   DirectX 10 & 11 Debugger & Graphics inspector   GeForce Support: 9 series or higher   Tesla Support: C1050/S1070 or higher   Quadro Support: G9x or higher   Windows 7, Vista and HPC Server 2008   Visual Studio 2008 SP1 and Visual Studio 2010   Compute Analyzer  OpenGL and OpenCL Analyzer  DirectX 10 & 11 Analyzer  Tesla Compute Cluster (TCC) Debugging  NVIDIA Confidential http://www.nvidia.com/GetParallelNsight
  • 18. Parallel Nsight Resources Parallel Nsight GPU Computing Forum The Parallel Nsight User Guide Installed with the Host installer Available on the Web Links to these from: http://developer.nvidia.com/ParallelNsight NVIDIA Confidential
  • 19. developer.nvidia.com/ParallelNsight Free Download Videos Documentation NVIDIA Confidential