Combine containerization and GPU acceleration on VMware: Dell PowerEdge R750 servers with NVIDIA GPUs and VMware vSphere with Tanzu - Infographic

•

0 likes•2 views

Our results running a vGPU-accelerated deep learning image-classification workload in this environment We ran a deep learning image-classification workload on a Dell PowerEdge R750 server with an NVIDIA A100 Tensor Core GPU that was in a VMware vSphere with Tanzu Kubernetes container environment along with two other servers. Using GPU virtualization to allow 10 containers to share the single GPU, our test solution achieved a maximum of 29,896 samples per second. With a single container using all of the GPU resources and a larger batch size, the solution achieved a maximum of 34,352 samples per second. These results show that the PowerEdge R750 with an NVIDIA A100 Tensor Core GPU in a VMware Kubernetes environment with GPU virtualization can support flexibly apportioning GPU compute capability across multiple machine learning workloads in Kubernetes clusters.

Technology

Principled
Technologies®
Copyright 2023 Principled Technologies, Inc. Based on “Combine containerization and GPU acceleration on VMware:
Dell PowerEdge R750 servers with NVIDIA GPUs and VMware vSphere with Tanzu,” a Principled Technologies report,
March 2023. Principled Technologies®
is a registered trademark of Principled Technologies, Inc. All other product
names are the trademarks of their respective owners.
Our results running a vGPU-accelerated deep learning
image-classification workload in this environment
Combine containerization and GPU acceleration on
VMware: Dell PowerEdge R750 servers with NVIDIA
GPUs and VMware vSphere with Tanzu
Learn more at https://facts.pt/Hi5jvB2
10
Number
of
containers
Aggregate throughput
Samples per second, higher is better
1
29,896
34,352
We used a ResNet-50 deep learning image classification workload on a Dell™
PowerEdge™
R750 server with an
NVIDIA®
A100 Tensor Core GPU running VMware®
vSphere®
with Tanzu.
Test parameters Test results (samples per second)
vGPU RAM (GiB) Tanzu node count
(number of containers)
Batch size (number of
images per batch)
Per-node throughput Aggregate throughput
40 1 2,048 34,352 34,352
4 10 128 2,989 29,896
You can determine the optimal configuration for hosting any similar image-classification workload on a
comparable cluster:
• Smaller or more sporadic jobs benefit from the flexibility of many smaller vGPU slices while delivering
nearly the same overall performance as using the whole GPU
• Larger or more regular jobs benefit from the dedicated memory and compute of a whole GPU
With 10 containers sharing the GPU, the
PowerEdge R750 server with an NVIDIA GPU
processed up to 29,896 samples per second.
With a single container using all the GPU resources
and a larger batch size, performance increased to a
maximum of 34,352 samples per second.

1) Testing of a Dell APEX Private Cloud virtual desktop infrastructure found that using NVIDIA GPUs provided a smoother user experience with lower latency and higher frame rates compared to CPUs alone. 2) Specifically, the solution with GPUs had 21.4ms lower latency (12.5% decrease) and 8 more frames per second (57.3% increase). 3) By reducing lag and improving smoothness, GPUs can help remote workers using virtual desktops be more productive.

Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...

Principled Technologies

運用高效、敏捷全新平台極速落實雲原生開發

inwin stack

1. The document discusses using a new high-efficiency and agile platform to quickly implement cloud-native development. 2. It introduces Pivotal Container Service (PKS) which provides a production-ready container platform for deploying Kubernetes clusters on VMware vSphere. 3. PKS leverages technologies like VMware NSX-T and BOSH to provide capabilities like security, high availability, and auto-scaling for container workloads.

Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...

Principled Technologies

The document summarizes a study that tested online transactional processing (OLTP) performance on two server solutions: a Dell EMC PowerEdge R940 server with VMware vSphere 7.0, and a previous-generation Dell EMC PowerEdge R930 server with vSphere 6.7. The study found that the R940 solution processed over 30% more operations per minute than the R930 solution. Upgrading to the R940 platform provides benefits like more processor cores, memory, and PCIe capacity. VMware vSphere 7.0 features like vLCM could help scale Dell EMC environments, and integrating with OpenManage for vCenter allows faster updating, upgrading, and compatibility checking.

A single-socket Dell EMC PowerEdge R7515 solution delivered better value on a...

Principled Technologies

If your company is running important business applications in VMware vSAN clusters of servers that are several years old, chances are good that you’re considering upgrading to newer hardware. Our testing demonstrated that our clusters of single-socket Dell EMC PowerEdge R7515 servers and clusters of dual-socket HPE ProLiant DL380 Gen10 servers could both improve upon the database performance of a legacy cluster with five-year-old servers by more than 50 percent, with the Dell EMC cluster achieving 93.4 percent of the performance of the HPE cluster.

Support more workers using graphics-accelerated VDI desktops

Principled Technologies

The document describes testing of two Dell PowerEdge servers' ability to support graphics-accelerated virtual desktops. A Dell PowerEdge R740xd server with 3 NVIDIA Tesla GPUs supported 96 virtual desktops, 50% more than a Dell PowerEdge R730 server with 2 GPUs which supported 64 desktops. Both servers maintained below 80% CPU utilization. The R740xd's ability to support more desktops in the same rack space allows more efficient use of datacenter resources.

The 16th Generation server handled more database transactions than a previous-generation PowerEdge R750 server with vSphere 7 New servers and software versions can be a boon for your organization. If you’re running MySQL workloads for web applications, retail, or other use cases, your organization could see a performance boost by upgrading to latest-generation Dell PowerEdge R760 servers running VMware vSphere 8.0. In our testing, a PowerEdge R760 server handled 84.5 percent more NOPM than an older Dell PowerEdge R750 server. We also found that Live Optics infrastructure monitoring software allows you to view performance data for new PowerEdge R760 and vSphere 8.0 environments so you can manage those resources to optimize your MySQL workloads.

Run more data analysis more quickly using Dell PowerEdge R760 servers

Principled Technologies

The latest-generation Dell PowerEdge R760 server, powered by 4th Generation Intel Xeon Scalable processors, performed more data analysis in less time than either older server we tested Data analytics can play a critical role in understanding your organization. With timely data analysis, decision‑makers can respond to trends or activity quickly and with the support of data-driven insight. New Dell PowerEdge R760 servers, either in addition to or in place of aging servers, can boost data warehouse workload performance to deliver insights sooner. In our testing, a latest-generation Dell PowerEdge R760 server powered by 4th Generation Intel Xeon Scalable processors completed 20 data analytics workloads in up to 41 percent less time than older Dell PowerEdge R750 and PowerEdge R740xd servers.

DellEMC PowerEdge R740xd Rack Server - Sinteza Co

SintezaCo

The document compares the performance of Dell PowerEdge R740xd servers with Intel Xeon Scalable processors using VMware vSAN storage against Dell PowerEdge R720 servers using traditional HDD storage. Testing showed the R740xd cluster processed database transactions 8 times faster, supported 12 times the input/output operations, and had 98% lower storage latency. This superior performance from the newer hardware could help businesses better support customer needs.

Database performance: Dell PowerEdge R730xd vs. Lenovo ThinkServer RD650

Principled Technologies

The document compares the database performance of the Dell PowerEdge R730xd server to the Lenovo ThinkServer RD650 server. It finds that the Dell PowerEdge R730xd delivered 27.9% more orders per minute for a Microsoft SQL Server 2014 database application than the Lenovo server. It also had lower application response times, up to 24.6% better, and lower disk response times, up to 93.1% better. The Dell server was able to process more orders per minute with faster response times for end users due to its current-generation hardware, increased solid-state drives, and storage capacity three times that of the Lenovo server.

How to Run TensorFlow Cheaper in the Cloud Using Elastic GPUs

Altoros

Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...

inside-BigData.com

In this deck from the 2019 Stanford HPC Conference, Mohan Potheri from VMware,presents: Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDIA GPUs. "This session introduces machine learning on vSphere to the attendee and explains when and why GPUs are important for them. Basic machine learning with Apache Spark is demonstrated. GPUs can be effectively shared in vSphere environments and the various methods of sharing are addressed here. We will explore features like suspending/resuming a VM that has GPUs attached to it, for sharing of those resources. Compelling use cases were developed leveraging vSphere's GPU capabilities. These use cases showcase deep learning with GPGPUs for image processing, stock prediction and distributed training. References to various technical papers will be given." Mohan Potheri is VCDX#98 and has more than 20 years in IT infrastructure, with in depth experience on VMWARE virtualization. He currently focuses on evangelization of "High Performance Computing (HPC)" and "Big Data” Virtualization on vSphere. He also has extensive experience with business-critical applications such as SAP, Oracle, SQL and Java across UNIX, Linux and Windows environments. Mohan Potheri is an expert on SAP virtualization and has been a speaker in multiple VMWORLD and PEX events. Prior to VMWARE, Mohan worked at many large enterprises where he has engineered fully virtualized HPC Solutions. He has planned, designed, implemented and managed robust highly available, DR compliant virtual environments in UNIX and x86 environments. Watch the video: https://youtu.be/rDsht9NFwR0 Learn more: https://www.vmware.com/solutions/high-performance-computing.html and http://hpcadvisorycouncil.com/events/2019/stanford-workshop/ Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

Backend.AI Technical Introduction (19.09 / 2019 Autumn)

Lablup Inc.

Dell PowerEdge R720 rack server solutions for virtual desktop infrastructures

Principled Technologies

Getting more value out of your servers is a smart way to reduce datacenter and VDI costs. In our tests, the Dell PowerEdge R720 server not only cost up to 5.4 percent less per user than the HP server, it also consumed 3.8 percent less power per virtual desktop user than the HP server when running workloads. When extrapolated over an entire rack of servers and with datacenters full of these racks, the potential savings both in hardware costs and in power costs could be dramatic.

Achieve more storage performance with Dell PowerEdge R750 servers equipped wi...

Principled Technologies

The document compares the performance of Dell PowerEdge R750 servers equipped with Broadcom PCIe Gen4 switches to Dell PowerEdge R740xd servers with PCIe Gen3 switches. Testing showed the R750 with Gen4 switches processed over 6.5M more random read IOPS and sustained over twice the throughput on sequential read workloads compared to the R740xd with Gen3 switches. Upgrading to the R750 with Gen4 could help organizations handle more storage requests and process data faster.

GPU/SSD Accelerates PostgreSQL - challenge towards query processing throughpu...

Kohei KaiGai

Upgrading to Dell PowerEdge R750 servers featuring Dell PowerEdge RAID Contro...

Principled Technologies

The PowerEdge R750 solution boosted performance compared to a previous-generation PowerEdge R740 with PERC 10 Conclusion Companies increasingly turn to big data applications to solve business problems such as understanding customer habits and behavior, maintaining electronic health records, and detecting fraud. In a 2022 survey of executives, 97.0 percent had invested in big data initiatives, and 73.7 percent said their organizations had appointed a Chief Data Officer (CDO), up from 12 percent in 2012. With so much data at play, storage becomes an essential consideration for companies as they select hardware platforms to run their vital workloads. Two primary requirements for storage are fast performance and redundancy. Companies seek storage that can quickly put actionable insights into the hands of decision makers. At the same time, it is a fact of life that storage media occasionally fails, and no company wants to risk the potentially very large expense of losing vital business data. In our testing with a disk-intensive TeraSort big data workload, we found that a current-generation Dell PowerEdge R750 server with a Broadcom-based Dell PERC 11 RAID controller with NVMe SSDs outperformed a previous-generation PowerEdge R740 server with a Dell PERC 10 RAID controller. The newer PowerEdge R750 solution completed the workload in 13.9 percent less time and with 16.3 percent better throughput, indicating that it could speed the delivery of insights to decision makers. With its PERC 11 RAID controller, it also offers RAID support for today’s speedy NVMe SSDs. Data-driven organizations must weigh the cost savings of putting off purchases of newer servers against the benefits of investing in those same newer, more powerful servers with RAID support for NVMe drives. Our testing shows that with the Dell PowerEdge R750 server with a Broadcom technology-based Dell PERC 11 RAID controller, buyers need not choose between redundancy and high performance.

VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...

VMworld

Optimize networking performance with the Dell PowerEdge R750 featuring a mode...

Principled Technologies

Using a single 100Gb NIC provided better and more consistent performance than four 25Gb NICs Video files and other large digital media can be taxing to networking and underlying server hardware. For organizations that frequently transmit these types of data, such as those that work in content creation, retailers that run AI on video to gather shopping data, or those hosting safety and security video solutions, speeding data transfer can make the difference in productivity or video quality. As workloads increased, we found that using four 25Gb NICs to gain 100Gb bandwidth capability did not meet target frame rates as closely as the Broadcom 57508 solution did. The Dell PowerEdge R750 server with a Broadcom 57508 Dual Port 100Gb network interface card offered better and more consistent networking performance than the four-NIC solution—delivering up to 31 percent more FPS for a smoother, faster, video experience.

Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...

Principled Technologies

Database server comparison: Dell PowerEdge R630 vs. Lenovo ThinkServer RD550

Principled Technologies

We tested the OLTP performance of a 1U Dell PowerEdge R630, powered by Intel Xeon processors E5-2660 v3, running Microsoft Hyper-V and virtual machines running SQL Server 2014, and compared it to that of the Lenovo ThinkServer RD550 running the same software. For each server, we selected the maximum SATA SSD count that was configurable for each model. The Dell PowerEdge R630 outperformed the Lenovo ThinkServer RD550 by 14.9 percent and offered more than one and a half times the storage space for data in our configuration. By selecting a server that handles more orders per minute and offers significantly more storage capacity potential than the competition, you get a not only faster, efficient experience for your database users, but also have the scaling potential for your storage needs ahead of your business growing.

Optimize networking performance with the Dell PowerEdge R750 featuring a mode...

Principled Technologies

MySQL and Spark machine learning performance on Azure VMsbased on 3rd Gen AMD...

Principled Technologies

Throughput comparison: Dell PowerEdge R720 drive options

Principled Technologies

The new Dell PowerEdge R720 comes with more than just the power to handle your heavy mixed workloads – it offers many storage solutions to deliver the level of performance you need. In our tests, we found that that a configuration of all HDDs could support a total of 1,164 users accessing database, mail, and collaboration applications. The Dell PowerEdge R720 solution with CacheCade enabled increased the supported number of users to 2,929, an increase of 151.6 percent. Finally, the Hybrid solution increased the number of users to 7,574, or an increase of 550.7 percent over the HDD solution, providing you with numerous options and scalability to get the performance you need.

Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS

Ceph Community

This document summarizes the performance of an all-NVMe Ceph cluster using Intel P3700 NVMe SSDs. Key results include achieving over 1.35 million 4K random read IOPS and 171K 4K random write IOPS with sub-millisecond latency. Partitioning the NVMe drives into multiple OSDs improved performance and CPU utilization compared to a single OSD per drive. The cluster also demonstrated over 5GB/s of sequential bandwidth.

Bring ideas to life with the HP Z2 G9 Tower Workstation - Infographic

Principled Technologies

Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....

Principled Technologies

Conclusion Diving into the world of GenAI has the potential to yield a great many benefits for your organization, but it first requires consideration for how best to implement those GenAI workloads. Whether your AI goals are to create a chatbot for online visitors, generate marketing materials, aid troubleshooting, or something else, implementing an AI solution requires careful planning and decision-making. A major decision is whether to host GenAI in the cloud or keep your data on premises. Traditional on-premises solutions can provide superior security and control, a substantial concern when dealing with large amounts of potentially sensitive data. But will supporting a GenAI solution on site be a drain on an organization’s IT budget? In our research, we found that the value proposition is just the opposite: Hosting GenAI workloads on premises, either in a traditional Dell solution or using a managed Dell APEX pay-per-use solution, could significantly lower your GenAI costs over 3 years compared to hosting these workloads in the cloud. In fact, we found that a comparable AWS SageMaker solution would cost up to 3.8 times as much and an Azure ML solution would cost up to 3.6 times as much as GenAI on a Dell APEX pay-per-use solution. These results show that organizations looking to implement GenAI and reap the business benefits to come can find many advantages in an on-premises Dell solution, whether they opt to purchase and manage it themselves or choose a subscription-based Dell APEX pay-per-use solution. Choosing an on-premises Dell solution could save your organization significantly over hosting GenAI in the cloud, while giving you control over the security and privacy of your data as well as any updates and changes to the environment, and while ensuring your environment is managed consistently.

Similar to Combine containerization and GPU acceleration on VMware: Dell PowerEdge R750 servers with NVIDIA GPUs and VMware vSphere with Tanzu - Infographic

Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...

Principled Technologies

Process 84% more MySQL database activity with the latest-gen Dell PowerEdge R...

Principled Technologies

Run more data analysis more quickly using Dell PowerEdge R760 servers

Principled Technologies

DellEMC PowerEdge R740xd Rack Server - Sinteza Co

SintezaCo

Database performance: Dell PowerEdge R730xd vs. Lenovo ThinkServer RD650

Principled Technologies

How to Run TensorFlow Cheaper in the Cloud Using Elastic GPUs

Altoros

Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...

inside-BigData.com

Backend.AI Technical Introduction (19.09 / 2019 Autumn)

Lablup Inc.

Dell PowerEdge R720 rack server solutions for virtual desktop infrastructures

Principled Technologies

Achieve more storage performance with Dell PowerEdge R750 servers equipped wi...

Principled Technologies

GPU/SSD Accelerates PostgreSQL - challenge towards query processing throughpu...

Kohei KaiGai

Upgrading to Dell PowerEdge R750 servers featuring Dell PowerEdge RAID Contro...

Principled Technologies

VMworld 2013: On the Way to GPU Virtualization – 3D Acceleration in Virtual M...

VMworld

Optimize networking performance with the Dell PowerEdge R750 featuring a mode...

Principled Technologies

Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...

Principled Technologies

Database server comparison: Dell PowerEdge R630 vs. Lenovo ThinkServer RD550

Principled Technologies

Optimize networking performance with the Dell PowerEdge R750 featuring a mode...

Principled Technologies

MySQL and Spark machine learning performance on Azure VMsbased on 3rd Gen AMD...

Principled Technologies

Throughput comparison: Dell PowerEdge R720 drive options

Principled Technologies

Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS

Ceph Community

Similar to Combine containerization and GPU acceleration on VMware: Dell PowerEdge R750 servers with NVIDIA GPUs and VMware vSphere with Tanzu - Infographic (20)