I/O for Virtualized Multi-Core Servers
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
855
On Slideshare
855
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
18
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. I/O for Virtualized Multi-Core Servers Iddo Kadim, Intel Corporation Director, Virtualization Technology Server Platforms Group Intel Corporation 1
  • 2. Agenda The value of multi-core computing The virtualized server Optimizing I/O in the virtualized server 2 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 3. Insatiable Demand for Servers Weather Prediction Oil Exploration Social Networks Genomics Research Financial Analysis Medical Imaging Digital Content UNITS Design Simulation Cloud Computing Data Center Refresh 1980 1985 1990 1995 2000 2005 2010 3 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 4. IT Constraints on Growth Server Sprawl >30m physical servers currently installed: growing 4X next 10 years Power & Cooling For every $1 spent on server HW, 50 cents spent on power/cooling Operating Cost $8 in maintenance for every $1 spent on new infrastructure Space Crunch Datacenter construction costs >$1,000 /sq ft Source: IDC, "Virtualization Across the Enterprise: Progressing From Cost Reduction to Business Growth," 4 doc # VFE2008_04, May 2008. Source: IDC. * Other names and brands may be claimed as the property of others. * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved Copyright © 2008, Intel Corporation. All rights reserved
  • 5. Impact of Multi-Core 14 12 Relative Performance 10 8 6 4 2 Dual Quad Core Core 65nm 45nm 0 2002 2003 2004 2005 2006 2007 2008 Intel Single Core Intel Multi Core Quad-Core Benefits a Broad Range of Workloads For notes and disclaimers, see legal information slide at end of this presentation. 5 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 6. Server Compression 2004 2008 • 5.1M bops • 5.1M bops • 6 racks • 1 rack • 126 servers • 17 servers • 240 sq ft • 40 sq ft • 48 kW • 6 kW The Bottom Line Benefits for IT (<2 year ROI4) Annual Estimated Floor Space Energy Costs Energy Savings 83% 87% $53K Reduction1 Reduction2 Savings3 For notes and disclaimers, see legal information slide at end of this presentation. Source: Intel January 16, 2008. Performance comparison using SPECjbb2005 bops (business operations per second) between (2004) 2 socket single core Intel Xeon processor (3.6GHz) and (2008) 2 socket Quad-Core Intel Xeon processor E5450 (3.0GHz, 80W, 1333MHz) measured 8/22/07. 1 Floor space based foot on sq ft reduction. 2 Energy costs based on total solution rack power. 3 Energy Savings based on an electric rate of $0.10/kWh assuming 33% average server load. 4 ROI is calculated based on cost of new servers ($6,264 per server pricing based on HP DL 380G5 (32GB RAM) as of Nov 17, 2007. source www.hp.com) divided by energy savings per year. Actual performance results and savings may vary depending on configuration. See backup for details. 6 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 7. Evolution of Virtualization “Virtualization 3.0” Adaptive Continuity High Availability/ Load Balancing Disaster Recovery Workload 1 Workload 2 Virtualization 2.0 App OS OS App VMM OS App VMM Ap Ap Ap Ap HW HW Flexible Workload Management p p p p OS OS OS OS Workload 1 Workload 2 VMM VMM HW HW App OS App OS App OS VMM VMM HW HW Server Test & Consolidation Development Virtualization 1.0 App OS App OS App OS Test Production Managing IT and Capital Expenses VMM VMM VMM HW HW HW App App A p A p A p OS OS p p p VMM OS OS OS VMM HW HW 7 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 8. Importance of Native Performance Intel® Xeon® 5400 series: virtualization performance is Up to 21% gain in performance proportional to native performance & Up to 38% gain in Energy Efficiency 1.38 1.17 1.18 1.20 1.21 1.13 Overhead % 1.00 ‘Native’ ‘Virtual’ SPECW SPECfp*_rate2006 SPECint*_rate2006 VMware* ESX SPECjbb*2005 SPECjbb*2005 - Vmmark* - Baseline Intel® PPW Xeon® 5300 eb*2005 Series Native Performance Drives Virtualized Performance Data Source: Published, measured, submitted or approved results as of April 7, 2008. See backup for details; Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more 8 information on performance tests and on the performance of Intel products, visit http://www.intel.com/performance/resources/limits.htm property of others. * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 9. Virtualization Hardware Assists virtualization performance is Example: proportional to native performance Intel® VT FlexPriority – Virtual Iron 4.0.2 on vConsolidate1 1.35 Overhead % ‘Native’ 1.00 ‘Virtual’ Intel® Xeon® X7350 Intel® VT Intel® VT FlexPriority Off FlexPriority On Hardware Assists improve virtualization performance & efficiency 1. 32-bit OS include Microsoft Windows* XP, Windows 2000 Server up to Windows Server 2003 SP1 2. All measurements conducted internally on platforms running 4 x Quad-Core Intel® Xeon® Processor X7350, 32GB memory, vConsolidate Beta 2, Virtual Iron 4.0.2 software. 1 CSU configuration, Sept 2007. Boot time improvement measured on Windows XP. vConsolidate measurement conducted by configuring the system with Windows 2000 SP4 Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software 9 design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit http://www.intel.com/performance/resources/limits.htm * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 10. Intel® VT for Directed I/O (VT-d) CPU CPU System Bus North Bridge DRAM VT-d Integrated PCIe* Root Ports Devices PCI Express South PCI, LPC, Bridge Legacy devices, … • VT-d provides support for DMA and interrupt remapping • Supports I/O device assignment to VMs, security, etc. 10 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 11. Intel® VT for Directed I/O (Intel® VT-d) VM VM OS OS Native Guest Application Application OS driver Buffer Buffer Driver A Driver A No HW VMM VMM changes Virtual Device Emulation Eliminates Buffer Driver A Chipset intermediate I/O Device Hardware Intel VT-d Remap paths and VMM Device A Device A overheads ||||||||||||||||||| ||||||||||||||||||| Emulation Based Virtual I/O Direct Assigned I/O with Intel® VT-d Better reliability through device VT-d IO Devices Phys Mem isolation Protected Memory regions Chipset 11 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 12. Evolution of I/O Virtualization Hypervisor SW – Emulation VM1 VM2 VMn High performance toll .… Common denominator I/O HW Assists – queuing Near native I/O performance vswitch VMM Common denominator I/O Direct assignment of shared NIC virtual resources Dedicated, Isolated I/O Native performance Scalability The Future: VM mobility with direct assigned virtual resources 12 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 13. Storage, Network and Virtualization SAN Attach Rates VM1 VM2 VM3 VMn 60% 54% (1) iSCSI VIRTUAL 50% SWITCH VMM 40% 1GbE FC 30% FC Edge 18% (2) Fibre EDGE SWITCH Channel 20% iSCSI FC Director CORE 10% Fibre SWITCH Channel 0% General Virtualized Servers Servers LAN SAN Simplified, Dynamic, and Power Efficient 1 Source: IDC Multiclient Study, “Server Virtualization on the Move 2007: Foundation for the Dynamic Data Center,” July 2007 2 Source: Intel estimates 13 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 14. 14 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 15. Legal Disclaimer • INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL® PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. INTEL PRODUCTS ARE NOT INTENDED FOR USE IN MEDICAL, LIFE SAVING, OR LIFE SUSTAINING APPLICATIONS. • Intel may make changes to specifications and product descriptions at any time, without notice. • All products, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice. • Intel, processors, chipsets, and desktop boards may contain design defects or errors known as errata, which may cause the product to deviate from published specifications. Current characterized errata are available on request. • Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. • Intel, Intel Inside, Core, Xeon, Intel Virtualization Technology, FlexPriortiy and the Intel logo are trademarks of Intel Corporation in the United States and other countries. • *Other names and brands may be claimed as the property of others. • Copyright © 2008 Intel Corporation. All rights reserved. 15 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 16. Backup 16 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 17. Legal information slide “Impact of Multi-Core ” Cores SPECint_ SPECfp_ SAP- Code Brand Xeon / L2 L3 rate*_ rate*_ TPC-C* SD* Linpack Year Name Name Launch Freq CPU L1 Cache Cache Cache FSB Chipset Base2000 Base2000 tpmC Users GFlops 2001 Tualatin Intel Pentium III 1.40Ghz 512M L2 WW32 1.26 1 16K(I)+16K(D) 512KB 0 133 SW HE-SL 12.2 6.5 17079 135 - 2002 Prestonia Intel Xeon 2.20 GHz 512M L2 April 2.20 1 12K(I)+8K(D) 512KB 0 400 E7500 15.7 13.1 33768 215 5.46 2002 Prestonia Intel Xeon 2.80 GHz 512M L2 Sept 2.80 1 12K(I)+8K(D) 512KB 0 400 E7500 18.9 14.4 39007 245 6.72 2003 Prestonia Intel Xeon 3.06 GHz 512M L2 March 3.06 1 12K(I)+8K(D) 512KB 0 533 E7501 21.5 16.7 44943 290 7.83 2003 Prestonia Intel Xeon 3.06 GHz 1M L3 July 3.06 1 12K(I)+8K(D) 512KB 1MB 533 E7501 26.0 20.4 52468 329 8.16 2003 Prestonia Intel Xeon 3.20 GHz 1M L3 Oct 3.20 1 12K(I)+8K(D) 512KB 1MB 533 E7501 26.9 20.7 54097 340 8.40 2004 Prestonia Intel Xeon 3.20 GHz 2M L3 March 3.20 1 12K(I)+8K(D) 512KB 2MB 533 E7501 30.7 24.0 60364 408 9.79 2004 Nocona Intel Xeon 3.60 GHz 1M L2 June 3.60 1 16K(I)+16K(D) 1MB 0 800 E7520 32.1 25.2 68010 470 11.60 2005 Irwindale 64-bit Intel Xeon 3.60 GHz 2M L2 Feb 3.60 1 12K(I)+16K(D) 2MB 0 800 E7520 39.9 31.6 74298 597 13.00 2005 Paxville DP Dual-Core Intel Xeon 2.80 2x2M L2 Oct 2.80 2 12K(I)+16K(D) 2x2MB 0 800 E7520 59.5 39.4 87791 788 18.10 2006 Dempsey Dual-Core Intel Xeon 5080 May 3.73 2 12K(I)+16K(D) 2x2MB 0 1066 5000p 81.2 61.6 125954 1047 22.10 2006 Woodcrest Dual-Core Intel Xeon 5160 June 3.00 2 32K(I)+32K(D) 4MB 0 1333 5000p 123.0 82.1 169360 1285 38.30 2006 Clovertown Quad-Core Intel Xeon X5355 Nov 2.66 4 32K(I)+32K(D) 2x4MB 0 1333 5000p 200.0 103.0 240737 1870 64.00 2007 Clovertown Quad-Core Intel Xeon X5365 July 3.00 4 32K(I)+32K(D) 2x4MB 0 1333 5000p 214 105 251300 2080 70.00 2007 Harpertown Quad-Core Intel Xeon X5460 Nov 3.16 4 32K(I)+32K(D) 2x6MB 0 1333 5000p 242 131 273666 2449 82 Intel Pentium® III Microarchitecture Intel NetBurst® Microarchitecture Intel Core® Microarchitecture Results are Published/Measured/Submitted/Approved as of Nov 12, 2007. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit http://www.intel.com/performance/resources/limits.htm 17 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 18. Legal information slide “Server Compression ” Source: Intel January 16, 2008. Performance comparison using SPECjbb2005 bops (business operations per second) between (2004) 2 socket single core Intel Xeon processor (3.6GHz) and (2008) 2 socket Quad-Core Intel Xeon processor E5450 (3.0GHz, 80W, 1333MHz) measured 8/22/07. 1 Floor space based foot on sq ft reduction. 2 Energy costs based on total solution rack power. 3 Energy Savings based on an electric rate of $0.10/kWh assuming 33% average server load. 4 ROI is calculated based on cost of new servers ($6,264 per server pricing based on HP DL 380G5 (32GB RAM) as of Nov 17, 2007. source www.hp.com) divided by energy savings per year. Actual performance results and savings may vary depending on configuration. See next slide for detailed calculations Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit http://www.intel.com/performance/resources/limits.htm or call (U.S.) 1-800-628-8686 or 1-916-356-3104. 18 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 19. Legal information slide Calculations for Refresh Benefits System Configurations: 2 Socket Intel® Xeon® Processor 3.6GHz (Nacona) 8 x 1GB DDR2-400 DIMMs System Idle Power: 225W System Workload Power: 382W SPECjbb2005 bops performance: 40487 2 Socket Intel® Xeon® Processor E5450 (3.0GHz 80W HTN) 8 x 2GB 667MHz FBDIMMs System Idle Power: 237W System Workload Power: 353W SPECjbb2005 bops performance: 297,984 Calculations: Floor space: 240sq. Foot to 60 sq. foot (83% reduction) Annual energy costs: 48kW to 6kW (87% reduction) Yearly Energy Savings: $53k ($61.4k - $8.29k) = $53.11k Qty 126 2S Intel® Xeon® Processor 3.6GHz (Nacona) servers, at idle power (225W * 16 hours/day), at full load (382W * 8 hours/day), for one year at 0.10 kWh = $30.69k * 2 (HVAC factor) = $61.4k electricity cost per year. Compared to Qty 17 2S Intel® Xeon® Processor E5450 (HTN 3.0GHz) servers, at idle power (237W * 16 hours/day), at full load (353W * 8 hours/day), for one year at 0.10 kWh = $4.16k * w (HVAC factor) = $8.29k electricity cost per year. ROI: Using HP DL 380G5 with 32GB RAM is $6,264 as of Nov 17, 2007 on hp.com Cost of 17 servers is $106.5k divided by $53k/year energy savings, this is 2 year ROI Software: BEA JRockit* (build P27.4.0-3-86647-1.6.0_02-20070801-1931-windows-x86_64, compiled mode Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit http://www.intel.com/performance/resources/limits.htm or call (U.S.) 1-800-628-8686 or 1-916-356-3104. 19 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved
  • 20. Quad-Core Intel® Xeon® Processor 7300 Series based Servers Performance Publications and Intel® ISV Application Enabling Results – Config Details • vConsolidate on Virtual iron: Comparison to Dual-Core Intel® Xeon® Processor 7100 Series (167% claim) based on vConsolidate benchmark result published on Virtual Iron 4.0 software: – Dual-Core Intel® Xeon® processor 7140M based platform details: Intel server platform with four Quad-Core Intel® Xeon® processors 7140M 3.4GHz, 800 MHz system bus, 16MB L3 Cache, Intel E8500 chipset, 32GB Memory DDr2 PC2-5300P, 14x Seagate Cheetah* 36GB 15krpm Fibre Channel HDD. Maximum performance achieved using 2 CSUs. – Quad-Core Intel® Xeon® processor X7350 based platform details: Intel server platform with four Quad-Core Intel® Xeon® processors X7350 2.93GHz, 1066 MHz system bus, 8MB L2 Cache, Clarksboro chipset, 32GB Memory FBD PC2-5300F, 14x Seagate Cheetah 36GB 15krpm Fibre Channel HDD. Maximum performance achieved using 4 CSUs. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit http://www.intel.com/performance/resources/limits.htm 20 or call (U.S.) 1-800-628-8686 or 1-916-356-3104. * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. All rights reserved 20