SlideShare a Scribd company logo
1 of 7
Download to read offline
*HPE ProLiant DL380 Gen11 server featuring Intel Xeon Gold 6430 processors vs.
HPE ProLiant DL380 Gen10 server featuring Intel Xeon Gold 6130 processors
Improve AI inference performance with HPE ProLiant
DL380 Gen11 servers, powered by 4th
Generation
Intel Xeon Gold processors
In ResNet-50 image-recognition testing, these servers handled
significantly more samples per second than previous-generation HPE
ProLiant servers while achieving lower latency
Companies across a wide range of industries are embracing AI inference applications. From healthcare to retail
to manufacturing, these applications are solving business problems and providing functionality that would
have been hard for most of us to imagine a few decades ago. Because machine learning is computationally
demanding, decision makers might assume they must engage costly cloud solutions for these workloads.
Or, if they want to run the applications in their own data centers, they might think it’s necessary to invest
in expensive servers backed by graphics processing units (GPUs). In fact, many organizations can meet the
requirements of their AI applications on site with powerful HPE ProLiant DL380 Gen11 servers featuring 4th
Generation Intel®
Xeon®
Gold processors. These processors include accelerators such as Intel®
Advanced
Matrix Extensions (AMX) to improve AI performance.
In the PT data center, we conducted a series of ResNet-50 tests to quantify the AI inference performance
potential of these servers. First, we tested the Floating Point 32-bit (FP32) inference performance of the
HPE ProLiant DL380 Gen11 server and compared it to that of its previous-generation counterpart with 3rd
Generation Xeon processors. Next, we investigated performance of the new server with lower precision levels,
which leverages the Intel AMX accelerator1
in the 4th
Generation Intel Gold processor to increase throughput.
With an FP32 precision level, the HPE ProLiant DL380 Gen11 server processed 2.86 times as many images
per second as the older server did. The HPE ProLiant DL380 Gen11 server also delivered strong performance
at two additional precision levels that benefit from the new Intel AMX accelerators found in 4th
Generation
Intel Scalable processor. Based on our overall findings, the HPE ProLiant DL380 Gen11 server is an excellent
candidate for running a wide range of image-recognition workloads for many organizations.
Process
2.86x
as many images per
second at FP32
precision levels*
Enjoy
strong
performance
at Int8 and bfloat16
precision levels
Reduce latency by
30.1%
at FP32
precision levels*
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024
A Principled Technologies report: Hands-on testing. Real-world results.
Our test approach
We used the following two servers in testing:
HPE ProLiant DL380 Gen10 server
• Two Intel Xeon Gold 6130 processors
• 256 GB of RAM
• 1TB of SSD storage
• One dual-port 10GbE NIC
HPE ProLiant DL380 Gen11 server
• Two Intel Xeon Gold 6430 processors
• 256 GB of RAM
• 1TB of SSD storage
• One dual-port 10GbE NIC
We set up a bare-metal Linux®
environment on each server. We captured ResNet-50 v1.5 performance (inference
throughput) using the Intel Reference AI model, the Floating Point 32-bit level of inference precision, and
a batch size of 116. The benchmark measured performance in terms of the number of images per second
each server could process and latency. We conducted each test three times and report the median score.
For complete details on our server configurations and step-by-step test methodologies, please see the
science behind the report.
In addition to these comparisons, we tested two levels of inference precision, integer 8 (Int8) with a batch size
of 116 and Brain Floating Point 16-bit (bfloat16) with a batch size of 80, on only the HPE ProLiant DL380 Gen11
server. These lower precision levels are appropriate for use cases with higher throughput demands and lower
accuracy requirements than FP32. The processor in the older HPE ProLiant DL380 Gen10 server does not have
the Intel®
Advanced Matrix Extensions (AMX) extensions that accelerate these precision levels.
About the HPE ProLiant DL380 Gen11 server
According to HPE, the dual-socket 2U ProLiant DL380 Gen11 “delivers exceptional compute
performance, expandability, and scalability for diverse workloads and environments at 1P economics.”2
The server is powered by powered by 4th
and 5th
Generation Intel®
Xeon®
Scalable Processors with up to
64 cores, has increased memory bandwidth, and offers high-speed PCIe Gen5 I/O. HPE says the server
is particularly well-suited to “data-intensive workloads like software-defined storage, video transcoding,
and virtualized apps that require large storage capacity, and high I/O and memory bandwidth.”3
Learn more at https://buy.hpe.com/us/en/compute/rack-servers/proliant-dl300-servers/proliant-
dl380-server/hpe-proliant-dl380-gen11/p/1014696069.
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 2
What we learned
Figure 1 shows the number of images per second each server processed. The HPE ProLiant DL380 Gen11 server
with Intel Xeon Gold 6430 processors handled 2.86 times as many images per second as the older server, which
means it could do more work with a given number of servers or could perform a fixed amount of inference work
using fewer servers, which has the potential to lead to savings.
About 4th
Generation Intel Xeon Gold 6430 processors
The HPE ProLiant DL380 Gen11 server we tested features Intel Xeon Gold 6430 processors, part of the
4th
Generation Intel Xeon Scalable processor family. According to Intel, its strategy for these processors
aligns processor “cores with built-in accelerators optimized for specific workloads and delivers increased
performance at higher efficiency for optimal total cost of ownership.”4
The processors deliver “a range of features for managing power and performance, making the best
use of resources to achieve key sustainability goals. In addition, the Xeon CPU Max and the Max Series
GPU add high-bandwidth memory and maximum compute density to solve the world’s most challenging
problems faster.”5
Learn more at https://www.intel.com/content/www/us/en/newsroom/resources/press-kit-4th
-gen-
intel-xeon-scalable-processors.html.
Images per second
Higher is better
HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors
877.471
306.158
HPE ProLiant DL380 Gen10 server with Intel Xeon Gold 6130 processors
2.86x
Figure 1: The number of images per second each server processed on the ResNet-50 v1.5 model with FP32 precision. Higher is better.
Source: Principled Technologies.
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 3
Figure 2 shows the latency each server achieved during testing. The HPE ProLiant DL380 Gen11 server with Intel
Xeon Gold 6430 processors achieved 30.1 percent lower latency than the older server, which means it executed
inference work more quickly.
Latency
Lower is better
HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors
2.113
3.026
HPE ProLiant DL380 Gen10 server with Intel Xeon Gold 6130 processors
Figure 2: The latency in seconds each server delivered on the ResNet-50 v1.5 model with FP32 precision. Lower is better.
Source: Principled Technologies.
About ResNet-50
ResNet-50 is a model that organizations use for image classification, or the process of correctly
identifying objects in images. Strong image classification performance may be vital for use cases in
safety and security, retail, healthcare, manufacturing, and other markets. In our testing, we used the
ResNet-50 v1.5 inference implementation from the Intel®
AI Reference Models repository, which uses the
ResNet-50 model and measures the number of (inference) samples per second a solution processes.
Learn more at https://github.com/IntelAI/models/tree/master/quickstart/image_recognition/
tensorflow/resnet50v1_5/inference/cpu.
About the Intel®
AI Reference Models repository
According to Intel, its AI Reference Models repository contains “links to pre-trained models, sample
scripts, best practices, and step-by-step tutorials for many popular open-source machine learning
models optimized by Intel to run on Intel®
Xeon®
Scalable processors and Intel®
Data Center GPUs.”6
Learn more at https://github.com/IntelAI/models.
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 4
What this could mean for your company
The potential business applications of image recognition AI are limitless, but to put our test results into context,
let’s look at a few industries.
Manufacturing
Image recognition use cases in manufacturing range from quality inspection to equipment maintenance to
worker safety. Computer vision technologies can inspect for usage of personal protective equipment (PPE), such
as masks and helmets, and can monitor how well workers follow safety protocols in factories or on construction
sites.7
By choosing the HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors over the
previous-generation server to run these applications, companies can save by performing a given amount of work
with fewer servers.
Agriculture
Image recognition is solving many business problems in agriculture, with use cases including animal monitoring,
estimating crop yield by counting fruits and vegetables, and automatic harvesting, which detects when crops
are ready for picking robots to harvest them. Farmers can even use computer vision systems to demonstrate
compliance with animal welfare regulations.8
For all these applications, running them on the HPE ProLiant DL380
Gen11 server with Intel Xeon Gold 6430 processors vs. its previous-generation counterpart could deliver reliable
information faster.
Healthcare
In healthcare, AI is not only helping to detect disease, but also assists with logistical tasks such as inspecting
hospitals for hygiene. Another application uses facial recognition to identify patients, which provides benefits
such as “helping prevent medical identity fraud, streamlining the registration process, and preventing
unauthorized access to sensitive information.”9
Hospitals and medical practices that select HPE ProLiant
DL380 Gen11 server with Intel Xeon Gold 6430 processors rather than the older server we tested can get
answers earlier.
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 5
A look at lower inference precision levels on the
HPE ProLiant DL380 Gen11 server
As we noted earlier, we tested two lower levels of inference precision, Int8 and bfloat16, on the HPE ProLiant
DL380 Gen11 server. Brain Float 16-bit is a condensed form of the single-precision floating-point format FP32
uses, where 8 exponent bits augment 8-bit precision, vs. 24-bit significance with 8 exponent bits. While this
allows bfloat16 a wider dynamic range of numbers than would be available with 8-bit precision alone, it does
so at the cost of accuracy. Int8 uses 8-bit integers instead of floating-point numbers, and integer instead of
floating-point math, which significantly lowers the compute, memory, and storage overhead for machine
learning applications. These lower precision levels grant higher throughput and lower latency, allowing for faster
responses to queries, and potentially serving more concurrent users in a datacenter environment.
Figure 3 compares the performance in terms of images per second on the HPE ProLiant DL380 Gen11 server
with the three precision levels. As it shows, the server processed from 3 to 5 times as many images per second
using bfloat16 and Int8 precision levels as it did using FP32.
We attribute this strong performance in large part to the 4th
Generation Intel Xeon Scalable processors in this
server, which come with several built-in accelerators to help enhance workload performance. These include Intel®
AMX, which “improves the performance of deep-learning training and inference on the CPU and is ideal for
workloads like natural-language processing, recommendation systems, and image recognition.”10
Images per second the HPE ProLiant DL380 Gen11 server achieved
at different precision levels
Higher is better
FP32
877.47
2,819.42 3.2x
bfloat16
5,027.80
Int8
5.7x
Figure 3: The number of images per second the HPE ProLiant DL380 Gen11 server processed on the ResNet-50 v1.5 model with FP32,
bfloat16, and Int8 precision. Higher is better. Source: Principled Technologies.
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 6
Conclusion
Companies using AI inference to solve business problems have a range of choices for running these
computationally demanding applications. We explored the potential of one solution, the HPE ProLiant DL380
Gen11 server featuring 4th
Generation Intel Xeon Gold processors. We compared this server to its previous-
generation counterpart on ResNet-50 tests using FP32 precision and found it delivered 2.86 times the inference
performance while reducing latency by 30.1 percent. We also tested the HPE ProLiant DL380 Gen11 server at
lower precision levels, which place greater demand on CPU resources, and found its performance to be strong
with both Int8 and bfloat16 precision levels. Compared to potentially pricey pay-as-you-go cloud solutions
and high-end GPU-based server solutions, the HPE ProLiant DL380 Gen11 we tested can be a smart option for
businesses harnessing the power of AI imaging applications.
1. Intel, “Advanced Matrix Extensions Overview,” accessed November 28, 2023, https://www.intel.com/content/www/us/
en/products/docs/accelerator-engines/advanced-matrix-extensions/overview.html.
2. HPE, “HPE ProLiant DL380 Gen11 server,” accessed November 30, 2023, https://buy.hpe.com/us/en/compute/
rack-servers/proliant-dl300-servers/proliant-dl380-server/hpe-proliant-dl380-gen11/p/1014696069.
3. HPE, “HPE ProLiant DL380 Gen11 server,” accessed November 30, 2023, https://buy.hpe.com/us/en/compute/
rack-servers/proliant-dl300-servers/proliant-dl380-server/hpe-proliant-dl380-gen11/p/1014696069.
4. Intel, “4th
Gen Intel Xeon Scalable Processors,” accessed November 28, 2023, https://www.intel.com/content/www/us/
en/newsroom/resources/press-kit-4th
-gen-intel-xeon-scalable-processors.html.
5. Intel, “4th
Gen Intel Xeon Scalable Processors.”
6. GitHub, Intel AI Models, accessed November 28, 2023, https://github.com/IntelAI/models.
7. Viso.ai, “The 100 Most Popular Computer Vision Applications in 2024,” accessed November 28, 2023,
https://viso.ai/applications/computer-vision-applications/.
8. Viso.ai, “Top Applications of Computer Vision in Agriculture,” accessed November 28, 2023,
https://viso.ai/applications/computer-vision-in-agriculture/.
9. Viso.ai, “Top 19 Applications Of Deep Learning and Computer Vision In Healthcare,” accessed November 28, 2023,
https://viso.ai/applications/computer-vision-in-healthcare/.
10. Intel, “Advanced Matrix Extensions Overview,” accessed November 28, 2023, https://www.intel.com/content/www/us/
en/products/docs/accelerator-engines/advanced-matrix-extensions/overview.html.
Principled Technologies is a registered trademark of Principled Technologies, Inc.
All other product names are the trademarks of their respective owners.
For additional information, review the science behind this report.
Principled
Technologies®
Facts matter.®
Principled
Technologies®
Facts matter.®
This project was commissioned by HPE.
Read the science behind this report at https://facts.pt/kh4Gyf2
Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 7

More Related Content

Similar to Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Gold processors

Reducing tco white paper rev5
Reducing tco white paper rev5Reducing tco white paper rev5
Reducing tco white paper rev5Manoj Punamia
 
Intel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overviewIntel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overviewDESMOND YUEN
 
Intel® Xeon® processor E7-8800/4800 v3 Application Showcase
Intel® Xeon® processor E7-8800/4800 v3 Application ShowcaseIntel® Xeon® processor E7-8800/4800 v3 Application Showcase
Intel® Xeon® processor E7-8800/4800 v3 Application ShowcaseIntel IT Center
 
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...Intel IT Center
 
Introducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for DiscoveryIntroducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for DiscoveryIntel IT Center
 
Lynn Comp - Big Data & Cloud Summit 2013
Lynn Comp - Big Data & Cloud Summit 2013Lynn Comp - Big Data & Cloud Summit 2013
Lynn Comp - Big Data & Cloud Summit 2013IntelAPAC
 
Accelerating Insights in the Technical Computing Transformation
Accelerating Insights in the Technical Computing TransformationAccelerating Insights in the Technical Computing Transformation
Accelerating Insights in the Technical Computing TransformationIntel IT Center
 
Intel Knights Landing Slides
Intel Knights Landing SlidesIntel Knights Landing Slides
Intel Knights Landing SlidesRonen Mendezitsky
 
Boost your work with hardware from Intel
Boost your work with hardware from IntelBoost your work with hardware from Intel
Boost your work with hardware from IntelPrincipled Technologies
 
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
	 Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...	 Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...Intel IT Center
 
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
Intel® Xeon® Scalable Processors Enabled Applications Marketing GuideIntel® Xeon® Scalable Processors Enabled Applications Marketing Guide
Intel® Xeon® Scalable Processors Enabled Applications Marketing GuideIntel IT Center
 
Intel® Xeon® Processor E5-2600 v4 Product Family EAMG
Intel® Xeon® Processor E5-2600 v4 Product Family EAMGIntel® Xeon® Processor E5-2600 v4 Product Family EAMG
Intel® Xeon® Processor E5-2600 v4 Product Family EAMGIntel IT Center
 
Check items off your to-do list faster with Dell OptiPlex small and micro for...
Check items off your to-do list faster with Dell OptiPlex small and micro for...Check items off your to-do list faster with Dell OptiPlex small and micro for...
Check items off your to-do list faster with Dell OptiPlex small and micro for...Principled Technologies
 
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...tdc-globalcode
 
Make the most of your time as well as your space with Dell OptiPlex small and...
Make the most of your time as well as your space with Dell OptiPlex small and...Make the most of your time as well as your space with Dell OptiPlex small and...
Make the most of your time as well as your space with Dell OptiPlex small and...Principled Technologies
 
Fujitsu World Tour 2017 - Digital Business with SAP & Fujitsu
Fujitsu World Tour 2017 - Digital Business with SAP & FujitsuFujitsu World Tour 2017 - Digital Business with SAP & Fujitsu
Fujitsu World Tour 2017 - Digital Business with SAP & FujitsuFujitsu India
 
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...Intel® Software
 

Similar to Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Gold processors (20)

Reducing tco white paper rev5
Reducing tco white paper rev5Reducing tco white paper rev5
Reducing tco white paper rev5
 
Intel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overviewIntel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overview
 
Intel® Xeon® processor E7-8800/4800 v3 Application Showcase
Intel® Xeon® processor E7-8800/4800 v3 Application ShowcaseIntel® Xeon® processor E7-8800/4800 v3 Application Showcase
Intel® Xeon® processor E7-8800/4800 v3 Application Showcase
 
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase – Big D...
 
Introducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for DiscoveryIntroducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
 
Intel 6th Gen vPro
Intel 6th Gen vProIntel 6th Gen vPro
Intel 6th Gen vPro
 
Lynn Comp - Big Data & Cloud Summit 2013
Lynn Comp - Big Data & Cloud Summit 2013Lynn Comp - Big Data & Cloud Summit 2013
Lynn Comp - Big Data & Cloud Summit 2013
 
Accelerating Insights in the Technical Computing Transformation
Accelerating Insights in the Technical Computing TransformationAccelerating Insights in the Technical Computing Transformation
Accelerating Insights in the Technical Computing Transformation
 
Intel Knights Landing Slides
Intel Knights Landing SlidesIntel Knights Landing Slides
Intel Knights Landing Slides
 
Boost your work with hardware from Intel
Boost your work with hardware from IntelBoost your work with hardware from Intel
Boost your work with hardware from Intel
 
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
	 Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...	 Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
Intel® Xeon® Processor E5-2600 v3 Product Family Application Showcase - Fin...
 
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
Intel® Xeon® Scalable Processors Enabled Applications Marketing GuideIntel® Xeon® Scalable Processors Enabled Applications Marketing Guide
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
 
Intel® Xeon® Processor E5-2600 v4 Product Family EAMG
Intel® Xeon® Processor E5-2600 v4 Product Family EAMGIntel® Xeon® Processor E5-2600 v4 Product Family EAMG
Intel® Xeon® Processor E5-2600 v4 Product Family EAMG
 
AIDC India - AI on IA
AIDC India  - AI on IAAIDC India  - AI on IA
AIDC India - AI on IA
 
Check items off your to-do list faster with Dell OptiPlex small and micro for...
Check items off your to-do list faster with Dell OptiPlex small and micro for...Check items off your to-do list faster with Dell OptiPlex small and micro for...
Check items off your to-do list faster with Dell OptiPlex small and micro for...
 
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
 
Accelerate your growth potential
Accelerate your growth potentialAccelerate your growth potential
Accelerate your growth potential
 
Make the most of your time as well as your space with Dell OptiPlex small and...
Make the most of your time as well as your space with Dell OptiPlex small and...Make the most of your time as well as your space with Dell OptiPlex small and...
Make the most of your time as well as your space with Dell OptiPlex small and...
 
Fujitsu World Tour 2017 - Digital Business with SAP & Fujitsu
Fujitsu World Tour 2017 - Digital Business with SAP & FujitsuFujitsu World Tour 2017 - Digital Business with SAP & Fujitsu
Fujitsu World Tour 2017 - Digital Business with SAP & Fujitsu
 
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
Tuning For Deep Learning Inference with Intel® Processor Graphics | SIGGRAPH ...
 

More from Principled Technologies

Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....
Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....
Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....Principled Technologies
 
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...Principled Technologies
 
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Principled Technologies
 
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Principled Technologies
 
Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...
Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...
Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...Principled Technologies
 
Improve performance and gain room to grow by easily migrating to a modern Ope...
Improve performance and gain room to grow by easily migrating to a modern Ope...Improve performance and gain room to grow by easily migrating to a modern Ope...
Improve performance and gain room to grow by easily migrating to a modern Ope...Principled Technologies
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Principled Technologies
 
A comparison of security features in Dell, HP, and Lenovo PC systems
A comparison of security features in Dell, HP, and Lenovo PC systemsA comparison of security features in Dell, HP, and Lenovo PC systems
A comparison of security features in Dell, HP, and Lenovo PC systemsPrincipled Technologies
 
Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...Principled Technologies
 
Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...Principled Technologies
 
Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...
Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...
Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...Principled Technologies
 
Scale up your storage with higher-performing Dell APEX Block Storage for AWS
Scale up your storage with higher-performing Dell APEX Block Storage for AWSScale up your storage with higher-performing Dell APEX Block Storage for AWS
Scale up your storage with higher-performing Dell APEX Block Storage for AWSPrincipled Technologies
 
Get in and stay in the productivity zone with the HP Z2 G9 Tower Workstation
Get in and stay in the productivity zone with the HP Z2 G9 Tower WorkstationGet in and stay in the productivity zone with the HP Z2 G9 Tower Workstation
Get in and stay in the productivity zone with the HP Z2 G9 Tower WorkstationPrincipled Technologies
 
Open up new possibilities with higher transactional database performance from...
Open up new possibilities with higher transactional database performance from...Open up new possibilities with higher transactional database performance from...
Open up new possibilities with higher transactional database performance from...Principled Technologies
 
Improving database performance and value with an easy migration to Azure Data...
Improving database performance and value with an easy migration to Azure Data...Improving database performance and value with an easy migration to Azure Data...
Improving database performance and value with an easy migration to Azure Data...Principled Technologies
 
Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...Principled Technologies
 
Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...Principled Technologies
 
Set up students and teachers to excel now and in the future with Intel proces...
Set up students and teachers to excel now and in the future with Intel proces...Set up students and teachers to excel now and in the future with Intel proces...
Set up students and teachers to excel now and in the future with Intel proces...Principled Technologies
 

More from Principled Technologies (20)

Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....
Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....
Investing in GenAI: Cost‑benefit analysis of Dell on‑premises deployments vs....
 
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
 
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
 
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
 
Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...
Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...
Realize 2.1X the performance with 20% less power with AMD EPYC processor-back...
 
Improve performance and gain room to grow by easily migrating to a modern Ope...
Improve performance and gain room to grow by easily migrating to a modern Ope...Improve performance and gain room to grow by easily migrating to a modern Ope...
Improve performance and gain room to grow by easily migrating to a modern Ope...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
Upgrade your cloud infrastructure with Dell PowerEdge R760 servers and VMware...
 
A comparison of security features in Dell, HP, and Lenovo PC systems
A comparison of security features in Dell, HP, and Lenovo PC systemsA comparison of security features in Dell, HP, and Lenovo PC systems
A comparison of security features in Dell, HP, and Lenovo PC systems
 
Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...
 
Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...Increase security, sustainability, and efficiency with robust Dell server man...
Increase security, sustainability, and efficiency with robust Dell server man...
 
Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...
Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...
Scale up your storage with higher-performing Dell APEX Block Storage for AWS ...
 
Scale up your storage with higher-performing Dell APEX Block Storage for AWS
Scale up your storage with higher-performing Dell APEX Block Storage for AWSScale up your storage with higher-performing Dell APEX Block Storage for AWS
Scale up your storage with higher-performing Dell APEX Block Storage for AWS
 
Get in and stay in the productivity zone with the HP Z2 G9 Tower Workstation
Get in and stay in the productivity zone with the HP Z2 G9 Tower WorkstationGet in and stay in the productivity zone with the HP Z2 G9 Tower Workstation
Get in and stay in the productivity zone with the HP Z2 G9 Tower Workstation
 
Open up new possibilities with higher transactional database performance from...
Open up new possibilities with higher transactional database performance from...Open up new possibilities with higher transactional database performance from...
Open up new possibilities with higher transactional database performance from...
 
Improving database performance and value with an easy migration to Azure Data...
Improving database performance and value with an easy migration to Azure Data...Improving database performance and value with an easy migration to Azure Data...
Improving database performance and value with an easy migration to Azure Data...
 
Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...
 
Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...Realize better value and performance migrating from Azure Database for Postgr...
Realize better value and performance migrating from Azure Database for Postgr...
 
Set up students and teachers to excel now and in the future with Intel proces...
Set up students and teachers to excel now and in the future with Intel proces...Set up students and teachers to excel now and in the future with Intel proces...
Set up students and teachers to excel now and in the future with Intel proces...
 

Recently uploaded

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 

Recently uploaded (20)

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Gold processors

  • 1. *HPE ProLiant DL380 Gen11 server featuring Intel Xeon Gold 6430 processors vs. HPE ProLiant DL380 Gen10 server featuring Intel Xeon Gold 6130 processors Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Gold processors In ResNet-50 image-recognition testing, these servers handled significantly more samples per second than previous-generation HPE ProLiant servers while achieving lower latency Companies across a wide range of industries are embracing AI inference applications. From healthcare to retail to manufacturing, these applications are solving business problems and providing functionality that would have been hard for most of us to imagine a few decades ago. Because machine learning is computationally demanding, decision makers might assume they must engage costly cloud solutions for these workloads. Or, if they want to run the applications in their own data centers, they might think it’s necessary to invest in expensive servers backed by graphics processing units (GPUs). In fact, many organizations can meet the requirements of their AI applications on site with powerful HPE ProLiant DL380 Gen11 servers featuring 4th Generation Intel® Xeon® Gold processors. These processors include accelerators such as Intel® Advanced Matrix Extensions (AMX) to improve AI performance. In the PT data center, we conducted a series of ResNet-50 tests to quantify the AI inference performance potential of these servers. First, we tested the Floating Point 32-bit (FP32) inference performance of the HPE ProLiant DL380 Gen11 server and compared it to that of its previous-generation counterpart with 3rd Generation Xeon processors. Next, we investigated performance of the new server with lower precision levels, which leverages the Intel AMX accelerator1 in the 4th Generation Intel Gold processor to increase throughput. With an FP32 precision level, the HPE ProLiant DL380 Gen11 server processed 2.86 times as many images per second as the older server did. The HPE ProLiant DL380 Gen11 server also delivered strong performance at two additional precision levels that benefit from the new Intel AMX accelerators found in 4th Generation Intel Scalable processor. Based on our overall findings, the HPE ProLiant DL380 Gen11 server is an excellent candidate for running a wide range of image-recognition workloads for many organizations. Process 2.86x as many images per second at FP32 precision levels* Enjoy strong performance at Int8 and bfloat16 precision levels Reduce latency by 30.1% at FP32 precision levels* Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 A Principled Technologies report: Hands-on testing. Real-world results.
  • 2. Our test approach We used the following two servers in testing: HPE ProLiant DL380 Gen10 server • Two Intel Xeon Gold 6130 processors • 256 GB of RAM • 1TB of SSD storage • One dual-port 10GbE NIC HPE ProLiant DL380 Gen11 server • Two Intel Xeon Gold 6430 processors • 256 GB of RAM • 1TB of SSD storage • One dual-port 10GbE NIC We set up a bare-metal Linux® environment on each server. We captured ResNet-50 v1.5 performance (inference throughput) using the Intel Reference AI model, the Floating Point 32-bit level of inference precision, and a batch size of 116. The benchmark measured performance in terms of the number of images per second each server could process and latency. We conducted each test three times and report the median score. For complete details on our server configurations and step-by-step test methodologies, please see the science behind the report. In addition to these comparisons, we tested two levels of inference precision, integer 8 (Int8) with a batch size of 116 and Brain Floating Point 16-bit (bfloat16) with a batch size of 80, on only the HPE ProLiant DL380 Gen11 server. These lower precision levels are appropriate for use cases with higher throughput demands and lower accuracy requirements than FP32. The processor in the older HPE ProLiant DL380 Gen10 server does not have the Intel® Advanced Matrix Extensions (AMX) extensions that accelerate these precision levels. About the HPE ProLiant DL380 Gen11 server According to HPE, the dual-socket 2U ProLiant DL380 Gen11 “delivers exceptional compute performance, expandability, and scalability for diverse workloads and environments at 1P economics.”2 The server is powered by powered by 4th and 5th Generation Intel® Xeon® Scalable Processors with up to 64 cores, has increased memory bandwidth, and offers high-speed PCIe Gen5 I/O. HPE says the server is particularly well-suited to “data-intensive workloads like software-defined storage, video transcoding, and virtualized apps that require large storage capacity, and high I/O and memory bandwidth.”3 Learn more at https://buy.hpe.com/us/en/compute/rack-servers/proliant-dl300-servers/proliant- dl380-server/hpe-proliant-dl380-gen11/p/1014696069. Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 2
  • 3. What we learned Figure 1 shows the number of images per second each server processed. The HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors handled 2.86 times as many images per second as the older server, which means it could do more work with a given number of servers or could perform a fixed amount of inference work using fewer servers, which has the potential to lead to savings. About 4th Generation Intel Xeon Gold 6430 processors The HPE ProLiant DL380 Gen11 server we tested features Intel Xeon Gold 6430 processors, part of the 4th Generation Intel Xeon Scalable processor family. According to Intel, its strategy for these processors aligns processor “cores with built-in accelerators optimized for specific workloads and delivers increased performance at higher efficiency for optimal total cost of ownership.”4 The processors deliver “a range of features for managing power and performance, making the best use of resources to achieve key sustainability goals. In addition, the Xeon CPU Max and the Max Series GPU add high-bandwidth memory and maximum compute density to solve the world’s most challenging problems faster.”5 Learn more at https://www.intel.com/content/www/us/en/newsroom/resources/press-kit-4th -gen- intel-xeon-scalable-processors.html. Images per second Higher is better HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors 877.471 306.158 HPE ProLiant DL380 Gen10 server with Intel Xeon Gold 6130 processors 2.86x Figure 1: The number of images per second each server processed on the ResNet-50 v1.5 model with FP32 precision. Higher is better. Source: Principled Technologies. Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 3
  • 4. Figure 2 shows the latency each server achieved during testing. The HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors achieved 30.1 percent lower latency than the older server, which means it executed inference work more quickly. Latency Lower is better HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors 2.113 3.026 HPE ProLiant DL380 Gen10 server with Intel Xeon Gold 6130 processors Figure 2: The latency in seconds each server delivered on the ResNet-50 v1.5 model with FP32 precision. Lower is better. Source: Principled Technologies. About ResNet-50 ResNet-50 is a model that organizations use for image classification, or the process of correctly identifying objects in images. Strong image classification performance may be vital for use cases in safety and security, retail, healthcare, manufacturing, and other markets. In our testing, we used the ResNet-50 v1.5 inference implementation from the Intel® AI Reference Models repository, which uses the ResNet-50 model and measures the number of (inference) samples per second a solution processes. Learn more at https://github.com/IntelAI/models/tree/master/quickstart/image_recognition/ tensorflow/resnet50v1_5/inference/cpu. About the Intel® AI Reference Models repository According to Intel, its AI Reference Models repository contains “links to pre-trained models, sample scripts, best practices, and step-by-step tutorials for many popular open-source machine learning models optimized by Intel to run on Intel® Xeon® Scalable processors and Intel® Data Center GPUs.”6 Learn more at https://github.com/IntelAI/models. Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 4
  • 5. What this could mean for your company The potential business applications of image recognition AI are limitless, but to put our test results into context, let’s look at a few industries. Manufacturing Image recognition use cases in manufacturing range from quality inspection to equipment maintenance to worker safety. Computer vision technologies can inspect for usage of personal protective equipment (PPE), such as masks and helmets, and can monitor how well workers follow safety protocols in factories or on construction sites.7 By choosing the HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors over the previous-generation server to run these applications, companies can save by performing a given amount of work with fewer servers. Agriculture Image recognition is solving many business problems in agriculture, with use cases including animal monitoring, estimating crop yield by counting fruits and vegetables, and automatic harvesting, which detects when crops are ready for picking robots to harvest them. Farmers can even use computer vision systems to demonstrate compliance with animal welfare regulations.8 For all these applications, running them on the HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors vs. its previous-generation counterpart could deliver reliable information faster. Healthcare In healthcare, AI is not only helping to detect disease, but also assists with logistical tasks such as inspecting hospitals for hygiene. Another application uses facial recognition to identify patients, which provides benefits such as “helping prevent medical identity fraud, streamlining the registration process, and preventing unauthorized access to sensitive information.”9 Hospitals and medical practices that select HPE ProLiant DL380 Gen11 server with Intel Xeon Gold 6430 processors rather than the older server we tested can get answers earlier. Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 5
  • 6. A look at lower inference precision levels on the HPE ProLiant DL380 Gen11 server As we noted earlier, we tested two lower levels of inference precision, Int8 and bfloat16, on the HPE ProLiant DL380 Gen11 server. Brain Float 16-bit is a condensed form of the single-precision floating-point format FP32 uses, where 8 exponent bits augment 8-bit precision, vs. 24-bit significance with 8 exponent bits. While this allows bfloat16 a wider dynamic range of numbers than would be available with 8-bit precision alone, it does so at the cost of accuracy. Int8 uses 8-bit integers instead of floating-point numbers, and integer instead of floating-point math, which significantly lowers the compute, memory, and storage overhead for machine learning applications. These lower precision levels grant higher throughput and lower latency, allowing for faster responses to queries, and potentially serving more concurrent users in a datacenter environment. Figure 3 compares the performance in terms of images per second on the HPE ProLiant DL380 Gen11 server with the three precision levels. As it shows, the server processed from 3 to 5 times as many images per second using bfloat16 and Int8 precision levels as it did using FP32. We attribute this strong performance in large part to the 4th Generation Intel Xeon Scalable processors in this server, which come with several built-in accelerators to help enhance workload performance. These include Intel® AMX, which “improves the performance of deep-learning training and inference on the CPU and is ideal for workloads like natural-language processing, recommendation systems, and image recognition.”10 Images per second the HPE ProLiant DL380 Gen11 server achieved at different precision levels Higher is better FP32 877.47 2,819.42 3.2x bfloat16 5,027.80 Int8 5.7x Figure 3: The number of images per second the HPE ProLiant DL380 Gen11 server processed on the ResNet-50 v1.5 model with FP32, bfloat16, and Int8 precision. Higher is better. Source: Principled Technologies. Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 6
  • 7. Conclusion Companies using AI inference to solve business problems have a range of choices for running these computationally demanding applications. We explored the potential of one solution, the HPE ProLiant DL380 Gen11 server featuring 4th Generation Intel Xeon Gold processors. We compared this server to its previous- generation counterpart on ResNet-50 tests using FP32 precision and found it delivered 2.86 times the inference performance while reducing latency by 30.1 percent. We also tested the HPE ProLiant DL380 Gen11 server at lower precision levels, which place greater demand on CPU resources, and found its performance to be strong with both Int8 and bfloat16 precision levels. Compared to potentially pricey pay-as-you-go cloud solutions and high-end GPU-based server solutions, the HPE ProLiant DL380 Gen11 we tested can be a smart option for businesses harnessing the power of AI imaging applications. 1. Intel, “Advanced Matrix Extensions Overview,” accessed November 28, 2023, https://www.intel.com/content/www/us/ en/products/docs/accelerator-engines/advanced-matrix-extensions/overview.html. 2. HPE, “HPE ProLiant DL380 Gen11 server,” accessed November 30, 2023, https://buy.hpe.com/us/en/compute/ rack-servers/proliant-dl300-servers/proliant-dl380-server/hpe-proliant-dl380-gen11/p/1014696069. 3. HPE, “HPE ProLiant DL380 Gen11 server,” accessed November 30, 2023, https://buy.hpe.com/us/en/compute/ rack-servers/proliant-dl300-servers/proliant-dl380-server/hpe-proliant-dl380-gen11/p/1014696069. 4. Intel, “4th Gen Intel Xeon Scalable Processors,” accessed November 28, 2023, https://www.intel.com/content/www/us/ en/newsroom/resources/press-kit-4th -gen-intel-xeon-scalable-processors.html. 5. Intel, “4th Gen Intel Xeon Scalable Processors.” 6. GitHub, Intel AI Models, accessed November 28, 2023, https://github.com/IntelAI/models. 7. Viso.ai, “The 100 Most Popular Computer Vision Applications in 2024,” accessed November 28, 2023, https://viso.ai/applications/computer-vision-applications/. 8. Viso.ai, “Top Applications of Computer Vision in Agriculture,” accessed November 28, 2023, https://viso.ai/applications/computer-vision-in-agriculture/. 9. Viso.ai, “Top 19 Applications Of Deep Learning and Computer Vision In Healthcare,” accessed November 28, 2023, https://viso.ai/applications/computer-vision-in-healthcare/. 10. Intel, “Advanced Matrix Extensions Overview,” accessed November 28, 2023, https://www.intel.com/content/www/us/ en/products/docs/accelerator-engines/advanced-matrix-extensions/overview.html. Principled Technologies is a registered trademark of Principled Technologies, Inc. All other product names are the trademarks of their respective owners. For additional information, review the science behind this report. Principled Technologies® Facts matter.® Principled Technologies® Facts matter.® This project was commissioned by HPE. Read the science behind this report at https://facts.pt/kh4Gyf2 Improve AI inference performance with HPE ProLiant DL380 Gen11 servers, powered by 4th Generation Intel Xeon Scalable processors January 2024 | 7