SlideShare a Scribd company logo
1 of 45
Download to read offline
Tracking Down Storage
Performance Issues:
A Customer’s Perspective
Keith Aasen, NetApp
Scott Elliott, Christie Digital
INF-STO1430
#vmworldinf
Agenda
1. Introduction and background
2. Storage problems and their effect on virtual infrastructure
3. Root cause analyses and resolution
4. Results and next steps
Who is Christie?About Christie
• A global visual technologies company
• Visual Solutions include:
• Media Walls
• A global visual technologies company
• Visual Solutions include:
• Digital Cinema Projectors
Who is Christie?About Christie
• A global visual technologies company
• Visual Solutions include:
• 3D Virtual Reality
Who is Christie?About Christie
• A global visual technologies company
• Visual Solutions include:
• Simulation Projection Systems
Who is Christie?About Christie
Christie’s virtualization journey
Kitchener, Ontario
Cypress, California
Christie’s virtualization journey
Kitchener vCenter
14 Hosts:
IBM x3650: Dual Proc, 4-Core, 50 GB RAM
IBM x3850: Quad Proc, 8-Core, 256 GB RAM
NetApp v3140; 60 TB
250+ VMs (and growing)
The problem arises
• Disk latencies:
20 ms
40 ms
Good
Bad
Ugly
The problem arises
• Disk latencies:
20 ms
40 ms
Good
Bad
Ugly
Implementation
Sustained
Latency
Spikes
Increase in business
demand
Sustained
Latency
SpikesDeployed SCOM
Plug-in
Continued growth;
High I/O introduced
Sustained: 30 ms
Spikes: 100 ms
No application impact
Sustained: +40 ms
Spikes: 6 seconds
Significant Application Impact
List of issues
1000 ms = 1 second!
List of issues
1. Most datastores had a consistent 40ms (or higher) of disk
latency with spikes lasting multiple seconds
2. ESXi hosts lose connectivity at seemingly random times
• Most happen between midnight and 5:00 a.m.
3. Applications complained of disk time-outs
• Where applicable, would automatically fail over to DR site
The hunt begins
• Where to start?
• Oceans of data across multiple systems
• Need to correlate information and filter out distractions
• Specialized knowledge to interpret the data
Timing is everything
• Coincidentally, PoC of NetApp OnCommand Balance
• Additional diagnostic analysis and correlated data
• Supplemented SCOM and PerfStats
• Large amount of misaligned VMs
• Most severe latencies happened between midnight and 5:00 a.m.
Intelligence Instead of DataPerformance Capacity Analytics
OnCommand Balance
Misaligned VMs on a LUN
VMDK
NTFS
block
NTFS
block
NTFS
block
NTFS
block
MBR or starting offset
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
The VMDK is aligned to the VMFS file system.
VMFS block VMFS block VMFS block
The VMFS file system is aligned to the WAFL file system
so that the VMFS blocks align to the WAFL blocks
This offset causes
the NTFS blocks
to be misaligned
with the WAFL
blocks
Properly aligned VMs on a LUN
VMDK
NTFS
block
MBR or starting offset
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
Properly aligned VM IO
In a properly aligned VM configuration, each Guest OS
Block (NTFS/EXT3) is mapped to one block on the storage
array.
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
Properly aligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
When a write occurs from the guest OS the write is cached
and then acknowledged back to the guest.
Properly aligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
When a write occurs from the guest OS the write is cached
and then acknowledged back to the guest.
Guest write
Properly aligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
Cached
in
NVRAM
NTFS
block
Guest write
When a write occurs from the guest OS, the write is cached
and then acknowledged back to the guest.
Properly aligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
Cached
in
NVRAM
NTFS
block
ACK
Guest write
When a write occurs from the guest OS the write is cached
and then acknowledged back to the guest.
Properly aligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
Because of NetApp WAFL and NVRAM technology NetApp
controllers can write to disk very quickly therefore NVRAM rarely
fills up.
Written to disk later
Invalidated
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
In a misaligned IO configuration, each windows block is stored
on multiple blocks on the storage array
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
Guest write
In a misaligned IO configuration each windows block is stored
on multiple blocks on the storage array
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
This guest write is 2 partial writes (1/2 of 2 blocks).The storage
controller caches the writes and acknowledges the guest as
before
Cached
in
NVRAM
FL
ck
WA
blo
ACK
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
Cached
in
NVRAM
FL
ck
WA
blo
This write is called a partial write since it only writes parts of 2
blocks. We need to preserve the other half of the block!
Still need!
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
Cached
in
NVRAM
FL
ck
WA
blo
Still need!
To do this we must first read the old blocks in. Normally this is
done during a CP or when an entire write stripe is ready as the
rest of the block may come in.
Reads
WAFL
block
WAFL
block
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
Cached
in
NVRAM
FL
ck
WA
blo
Still need!
Now we can build the new Blocks and write back out.
Misaligned VM IO
VMDK
NTFS
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
VMFS block VMFS block VMFS block
NTFS
block
NTFS
block
NTFS
block
NTFS
block
FL
ck
FL
ck
FL
ck
FL
ck
Invalidated
Now we can build the new blocks and write back out.
Net effect
• This process causes consistency points to take longer in duration
• Increases CPU load on the controller
• No effect on performance to the VM, if the controller can
“keep up”
• If load increases, then a dramatic spike in latency can occur
• Ultimately determines how many VMs can be hosted on a
storage system
How to correct misalignment
• Adjust the MBR or boot sector with MBRalign or VMware
converter
• Permanent solution
• Requires Downtime for the VM
• Create an “Optimized Datastore”
• No downtime required for the VM
• Limited Vendors offer this
• Must be sure not to mix misaligned VMs and aligned VMs
Servers with Misaligned Partitions Report
Virtual Storage Console 4.0
Misaligned VMs on optimized LUN
VMDK
NTFS
block
NTFS
block
NTFS
block
NTFS
block
MBR or starting offset
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
WAFL
block
The VMDK is aligned to the VMFS file system.
VMFS block VMFS block VMFS block
The VMFS file system is “improperly” aligned to the
storage file system so that the NTFS blocks align to the
storage blocks.
This offset causes
the NTFS blocks
to be aligned with
the storage blocks
Getting closer
• Remaining latency spike late at night with no corresponding IO.
• Time coincided with aggregate-level snapshot.
• Aggregate snapshot is on by default on every system. Usually
there is no noticeable activity.
• Will trigger a disk cleanup process, if significant space is
released.
• The cleanup process was colliding with the SQL DB copy causing
the latency spike. (has since had it’s priority adjusted)
• Still had lingering – and seemingly random – spikes
• Use Veeam’s Management Pack for VMware
• Agentless vSphere monitoring and management
• Systems Center Operations Manager Plug-In
• Used report “Virtual Machines: Disk Performance History”
The cumulative effect of client software
Correlating VM disk activity
AV client maintenance synchronized across VMs
The system now…
Milliseconds
What did we learn?
• An underused storage subsystem can mask environment
misconfigurations.
• Storage performance issues are rarely due to a single cause.
• In this case, there were three causes:
1. VM alignment
2. Storage resource contention from background process, and
3. Suboptimal antimalware configuration.
Other lessons learned
1. Invest in monitoring tools to detect problems.
2. Fix misconfigurations before they become a problem.
3. Engage your vendor to assist with the troubleshooting process.
Questions?
Thank you and have a great VMworld 2012!
FILL OUT
A SURVEY
EVERY COMPLETE SURVEY
IS ENTERED INTO
DRAWING FOR A
$25 VMWARE COMPANY
STORE GIFT CERTIFICATE
Tracking Down Storage
Performance Issues:
A Customer’s Perspective
Keith Aasen, NetApp
Scott Elliott, Christie Digital
INF-STO1430
#vmworldinf

More Related Content

What's hot

V mware v-sphere-replication-overview
V mware v-sphere-replication-overviewV mware v-sphere-replication-overview
V mware v-sphere-replication-overviewFirman Indrianto
 
Xen and Client Virtualization: the case of XenClient XT
Xen and Client Virtualization: the case of XenClient XTXen and Client Virtualization: the case of XenClient XT
Xen and Client Virtualization: the case of XenClient XTThe Linux Foundation
 
S4 xen hypervisor_20080622
S4 xen hypervisor_20080622S4 xen hypervisor_20080622
S4 xen hypervisor_20080622Todd Deshane
 
Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...
Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...
Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...Lorscheider Santiago
 
Presentation v mware v-sphere advanced troubleshooting by eric sloof
Presentation   v mware v-sphere advanced troubleshooting by eric sloofPresentation   v mware v-sphere advanced troubleshooting by eric sloof
Presentation v mware v-sphere advanced troubleshooting by eric sloofsolarisyourep
 
XPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSE
XPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSEXPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSE
XPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSEThe Linux Foundation
 
open source virtualization
open source virtualizationopen source virtualization
open source virtualizationKris Buytaert
 
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...The Linux Foundation
 
ProfessionalVMware BrownBag VCP5 Section3: Storage
ProfessionalVMware BrownBag VCP5 Section3: StorageProfessionalVMware BrownBag VCP5 Section3: Storage
ProfessionalVMware BrownBag VCP5 Section3: StorageProfessionalVMware
 
OSS Presentation NexentaStor™
OSS Presentation NexentaStor™OSS Presentation NexentaStor™
OSS Presentation NexentaStor™OpenStorageSummit
 
XPDS16: Hypervisor-based Security: Vicarious Learning via Introspektioneerin...
XPDS16:  Hypervisor-based Security: Vicarious Learning via Introspektioneerin...XPDS16:  Hypervisor-based Security: Vicarious Learning via Introspektioneerin...
XPDS16: Hypervisor-based Security: Vicarious Learning via Introspektioneerin...The Linux Foundation
 
Citrix certified administrator certification
Citrix certified administrator certificationCitrix certified administrator certification
Citrix certified administrator certificationCisco-642-902
 
Vmbkp: VMware vSphere Incremental Backup Tool
Vmbkp: VMware vSphere Incremental Backup ToolVmbkp: VMware vSphere Incremental Backup Tool
Vmbkp: VMware vSphere Incremental Backup ToolTakashi Hoshino
 
Open Source Citrix Windows PV Drivers - Paul Durrant, Citrix
Open Source Citrix Windows PV Drivers - Paul Durrant, CitrixOpen Source Citrix Windows PV Drivers - Paul Durrant, Citrix
Open Source Citrix Windows PV Drivers - Paul Durrant, CitrixThe Linux Foundation
 
VMware Performance Troubleshooting
VMware Performance TroubleshootingVMware Performance Troubleshooting
VMware Performance Troubleshootingglbsolutions
 
Multiple Device Emulators for HVM Guests, Paul Durrant, Citrix
Multiple Device Emulators for HVM Guests, Paul Durrant, CitrixMultiple Device Emulators for HVM Guests, Paul Durrant, Citrix
Multiple Device Emulators for HVM Guests, Paul Durrant, CitrixThe Linux Foundation
 
XPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, Huawei
XPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, HuaweiXPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, Huawei
XPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, HuaweiThe Linux Foundation
 

What's hot (20)

V mware v-sphere-replication-overview
V mware v-sphere-replication-overviewV mware v-sphere-replication-overview
V mware v-sphere-replication-overview
 
Xen and Client Virtualization: the case of XenClient XT
Xen and Client Virtualization: the case of XenClient XTXen and Client Virtualization: the case of XenClient XT
Xen and Client Virtualization: the case of XenClient XT
 
S4 xen hypervisor_20080622
S4 xen hypervisor_20080622S4 xen hypervisor_20080622
S4 xen hypervisor_20080622
 
Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...
Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...
Comparação entre XenServer 6.2 e VMware VSphere 5.1 - Comparison of Citrix Xe...
 
Presentation v mware v-sphere advanced troubleshooting by eric sloof
Presentation   v mware v-sphere advanced troubleshooting by eric sloofPresentation   v mware v-sphere advanced troubleshooting by eric sloof
Presentation v mware v-sphere advanced troubleshooting by eric sloof
 
XPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSE
XPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSEXPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSE
XPDS16: libvirt and Tools: What's New and What's Next - James Fehlig, SUSE
 
open source virtualization
open source virtualizationopen source virtualization
open source virtualization
 
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
 
ProfessionalVMware BrownBag VCP5 Section3: Storage
ProfessionalVMware BrownBag VCP5 Section3: StorageProfessionalVMware BrownBag VCP5 Section3: Storage
ProfessionalVMware BrownBag VCP5 Section3: Storage
 
OSS Presentation NexentaStor™
OSS Presentation NexentaStor™OSS Presentation NexentaStor™
OSS Presentation NexentaStor™
 
XPDS16: Hypervisor-based Security: Vicarious Learning via Introspektioneerin...
XPDS16:  Hypervisor-based Security: Vicarious Learning via Introspektioneerin...XPDS16:  Hypervisor-based Security: Vicarious Learning via Introspektioneerin...
XPDS16: Hypervisor-based Security: Vicarious Learning via Introspektioneerin...
 
XPDS16: Xen Development Update
XPDS16: Xen Development UpdateXPDS16: Xen Development Update
XPDS16: Xen Development Update
 
Citrix certified administrator certification
Citrix certified administrator certificationCitrix certified administrator certification
Citrix certified administrator certification
 
Vmbkp: VMware vSphere Incremental Backup Tool
Vmbkp: VMware vSphere Incremental Backup ToolVmbkp: VMware vSphere Incremental Backup Tool
Vmbkp: VMware vSphere Incremental Backup Tool
 
µ-Xen
µ-Xenµ-Xen
µ-Xen
 
Open Source Citrix Windows PV Drivers - Paul Durrant, Citrix
Open Source Citrix Windows PV Drivers - Paul Durrant, CitrixOpen Source Citrix Windows PV Drivers - Paul Durrant, Citrix
Open Source Citrix Windows PV Drivers - Paul Durrant, Citrix
 
VMware Performance Troubleshooting
VMware Performance TroubleshootingVMware Performance Troubleshooting
VMware Performance Troubleshooting
 
Multiple Device Emulators for HVM Guests, Paul Durrant, Citrix
Multiple Device Emulators for HVM Guests, Paul Durrant, CitrixMultiple Device Emulators for HVM Guests, Paul Durrant, Citrix
Multiple Device Emulators for HVM Guests, Paul Durrant, Citrix
 
A Robust and Flexible Operating System Compatibility Architecture
A Robust and Flexible Operating System Compatibility ArchitectureA Robust and Flexible Operating System Compatibility Architecture
A Robust and Flexible Operating System Compatibility Architecture
 
XPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, Huawei
XPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, HuaweiXPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, Huawei
XPDS16: Xen Scalability Analysis - Weidong Han, Zhichao Huang & Wei Yang, Huawei
 

Similar to Presentation tracking down storage performance issues a customer’s perspective

VMware vSphere Version Comparison 4.0 to 6.5
VMware  vSphere Version Comparison 4.0 to 6.5VMware  vSphere Version Comparison 4.0 to 6.5
VMware vSphere Version Comparison 4.0 to 6.5Sabir Hussain
 
Rearchitecting Storage for Server Virtualization
Rearchitecting Storage for Server VirtualizationRearchitecting Storage for Server Virtualization
Rearchitecting Storage for Server VirtualizationStephen Foskett
 
vmfs intro
vmfs introvmfs intro
vmfs introbergwolf
 
Turning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformTurning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformOpenStack_Online
 
VMworld 2017 - Top 10 things to know about vSAN
VMworld 2017 - Top 10 things to know about vSANVMworld 2017 - Top 10 things to know about vSAN
VMworld 2017 - Top 10 things to know about vSANDuncan Epping
 
vSphere vStorage: Troubleshooting Performance
vSphere vStorage: Troubleshooting PerformancevSphere vStorage: Troubleshooting Performance
vSphere vStorage: Troubleshooting PerformanceProfessionalVMware
 
Turning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformTurning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformwim_provoost
 
Presentazione HPE @ VMUGIT UserCon 2015
Presentazione HPE @ VMUGIT UserCon 2015Presentazione HPE @ VMUGIT UserCon 2015
Presentazione HPE @ VMUGIT UserCon 2015VMUG IT
 
The Unofficial VCAP / VCP VMware Study Guide
The Unofficial VCAP / VCP VMware Study GuideThe Unofficial VCAP / VCP VMware Study Guide
The Unofficial VCAP / VCP VMware Study GuideVeeam Software
 
V sphere 5.1-storage-features-&-futures
V sphere 5.1-storage-features-&-futuresV sphere 5.1-storage-features-&-futures
V sphere 5.1-storage-features-&-futuressubtitle
 
VMworld 2017 Core Storage
VMworld 2017 Core StorageVMworld 2017 Core Storage
VMworld 2017 Core StorageCormac Hogan
 
Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2Stephen Foskett
 
VMworld 2013: Extreme Performance Series: Storage in a Flash
VMworld 2013: Extreme Performance Series: Storage in a Flash VMworld 2013: Extreme Performance Series: Storage in a Flash
VMworld 2013: Extreme Performance Series: Storage in a Flash VMworld
 
Road show 2015 triangle meetup
Road show 2015 triangle meetupRoad show 2015 triangle meetup
Road show 2015 triangle meetupwim_provoost
 
Denver VMUG nov 2011
Denver VMUG nov 2011Denver VMUG nov 2011
Denver VMUG nov 2011Dan Brinkmann
 
Learning from ZFS to Scale Storage on and under Containers
Learning from ZFS to Scale Storage on and under ContainersLearning from ZFS to Scale Storage on and under Containers
Learning from ZFS to Scale Storage on and under Containersinside-BigData.com
 
Vm13 vnx mixed workloads
Vm13 vnx mixed workloadsVm13 vnx mixed workloads
Vm13 vnx mixed workloadspittmantony
 
Vizioncore vRanger 4.5 Technical Overview
Vizioncore vRanger 4.5 Technical OverviewVizioncore vRanger 4.5 Technical Overview
Vizioncore vRanger 4.5 Technical OverviewRogerB747
 

Similar to Presentation tracking down storage performance issues a customer’s perspective (20)

VMware vSphere Version Comparison 4.0 to 6.5
VMware  vSphere Version Comparison 4.0 to 6.5VMware  vSphere Version Comparison 4.0 to 6.5
VMware vSphere Version Comparison 4.0 to 6.5
 
Rearchitecting Storage for Server Virtualization
Rearchitecting Storage for Server VirtualizationRearchitecting Storage for Server Virtualization
Rearchitecting Storage for Server Virtualization
 
vmfs intro
vmfs introvmfs intro
vmfs intro
 
Turning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformTurning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platform
 
VMworld 2017 - Top 10 things to know about vSAN
VMworld 2017 - Top 10 things to know about vSANVMworld 2017 - Top 10 things to know about vSAN
VMworld 2017 - Top 10 things to know about vSAN
 
Hyper v r2 deep dive
Hyper v r2 deep diveHyper v r2 deep dive
Hyper v r2 deep dive
 
vSphere vStorage: Troubleshooting Performance
vSphere vStorage: Troubleshooting PerformancevSphere vStorage: Troubleshooting Performance
vSphere vStorage: Troubleshooting Performance
 
Turning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformTurning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platform
 
Presentazione HPE @ VMUGIT UserCon 2015
Presentazione HPE @ VMUGIT UserCon 2015Presentazione HPE @ VMUGIT UserCon 2015
Presentazione HPE @ VMUGIT UserCon 2015
 
The Unofficial VCAP / VCP VMware Study Guide
The Unofficial VCAP / VCP VMware Study GuideThe Unofficial VCAP / VCP VMware Study Guide
The Unofficial VCAP / VCP VMware Study Guide
 
V sphere 5.1-storage-features-&-futures
V sphere 5.1-storage-features-&-futuresV sphere 5.1-storage-features-&-futures
V sphere 5.1-storage-features-&-futures
 
VMworld 2017 Core Storage
VMworld 2017 Core StorageVMworld 2017 Core Storage
VMworld 2017 Core Storage
 
Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2
 
VMworld 2013: Extreme Performance Series: Storage in a Flash
VMworld 2013: Extreme Performance Series: Storage in a Flash VMworld 2013: Extreme Performance Series: Storage in a Flash
VMworld 2013: Extreme Performance Series: Storage in a Flash
 
Road show 2015 triangle meetup
Road show 2015 triangle meetupRoad show 2015 triangle meetup
Road show 2015 triangle meetup
 
Denver VMUG nov 2011
Denver VMUG nov 2011Denver VMUG nov 2011
Denver VMUG nov 2011
 
Learning from ZFS to Scale Storage on and under Containers
Learning from ZFS to Scale Storage on and under ContainersLearning from ZFS to Scale Storage on and under Containers
Learning from ZFS to Scale Storage on and under Containers
 
TDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 ENTDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 EN
 
Vm13 vnx mixed workloads
Vm13 vnx mixed workloadsVm13 vnx mixed workloads
Vm13 vnx mixed workloads
 
Vizioncore vRanger 4.5 Technical Overview
Vizioncore vRanger 4.5 Technical OverviewVizioncore vRanger 4.5 Technical Overview
Vizioncore vRanger 4.5 Technical Overview
 

More from solarisyourep

Presentation a new era in it
Presentation   a new era in itPresentation   a new era in it
Presentation a new era in itsolarisyourep
 
Presentation a vision for user centric computing
Presentation   a vision for user centric computingPresentation   a vision for user centric computing
Presentation a vision for user centric computingsolarisyourep
 
Presentation advanced management – the road ahead
Presentation   advanced management – the road aheadPresentation   advanced management – the road ahead
Presentation advanced management – the road aheadsolarisyourep
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructuresolarisyourep
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big datasolarisyourep
 
Presentation avoiding the 19 biggest ha & drs configuration mistakes
Presentation   avoiding the 19 biggest ha & drs configuration mistakesPresentation   avoiding the 19 biggest ha & drs configuration mistakes
Presentation avoiding the 19 biggest ha & drs configuration mistakessolarisyourep
 
Presentation blade center foundation for cloud
Presentation   blade center foundation for cloudPresentation   blade center foundation for cloud
Presentation blade center foundation for cloudsolarisyourep
 
Presentation building and running your private cloud
Presentation   building and running your private cloudPresentation   building and running your private cloud
Presentation building and running your private cloudsolarisyourep
 
Presentation building your cloud with v mware
Presentation   building your cloud with v mwarePresentation   building your cloud with v mware
Presentation building your cloud with v mwaresolarisyourep
 
Presentation business critical applications in a virtual env
Presentation   business critical applications in a virtual envPresentation   business critical applications in a virtual env
Presentation business critical applications in a virtual envsolarisyourep
 
Presentation cim1309 v cat 3.0 operating a v-mware cloud
Presentation   cim1309 v cat 3.0 operating a v-mware cloudPresentation   cim1309 v cat 3.0 operating a v-mware cloud
Presentation cim1309 v cat 3.0 operating a v-mware cloudsolarisyourep
 
Presentation cisco intelligent automation complementing and extending v mwa...
Presentation   cisco intelligent automation complementing and extending v mwa...Presentation   cisco intelligent automation complementing and extending v mwa...
Presentation cisco intelligent automation complementing and extending v mwa...solarisyourep
 
Presentation cisco vxi–optimized infrastructure for scaling v mware view wi...
Presentation   cisco vxi–optimized infrastructure for scaling v mware view wi...Presentation   cisco vxi–optimized infrastructure for scaling v mware view wi...
Presentation cisco vxi–optimized infrastructure for scaling v mware view wi...solarisyourep
 
Presentation cloud infrastructure and management – from v sphere to vcloud ...
Presentation   cloud infrastructure and management – from v sphere to vcloud ...Presentation   cloud infrastructure and management – from v sphere to vcloud ...
Presentation cloud infrastructure and management – from v sphere to vcloud ...solarisyourep
 
Presentation cloud infrastructure launch – what’s new
Presentation   cloud infrastructure launch – what’s newPresentation   cloud infrastructure launch – what’s new
Presentation cloud infrastructure launch – what’s newsolarisyourep
 
Presentation cloud meets big
Presentation   cloud meets bigPresentation   cloud meets big
Presentation cloud meets bigsolarisyourep
 
Presentation consuming a cloud
Presentation   consuming a cloudPresentation   consuming a cloud
Presentation consuming a cloudsolarisyourep
 
Presentation desktops for the cloud the view rollout
Presentation   desktops for the cloud the view rolloutPresentation   desktops for the cloud the view rollout
Presentation desktops for the cloud the view rolloutsolarisyourep
 
Presentation disaster recovery in virtualization and cloud
Presentation   disaster recovery in virtualization and cloudPresentation   disaster recovery in virtualization and cloud
Presentation disaster recovery in virtualization and cloudsolarisyourep
 
Presentation drs advanced concepts, best practices and future directions
Presentation   drs advanced concepts, best practices and future directionsPresentation   drs advanced concepts, best practices and future directions
Presentation drs advanced concepts, best practices and future directionssolarisyourep
 

More from solarisyourep (20)

Presentation a new era in it
Presentation   a new era in itPresentation   a new era in it
Presentation a new era in it
 
Presentation a vision for user centric computing
Presentation   a vision for user centric computingPresentation   a vision for user centric computing
Presentation a vision for user centric computing
 
Presentation advanced management – the road ahead
Presentation   advanced management – the road aheadPresentation   advanced management – the road ahead
Presentation advanced management – the road ahead
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructure
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big data
 
Presentation avoiding the 19 biggest ha & drs configuration mistakes
Presentation   avoiding the 19 biggest ha & drs configuration mistakesPresentation   avoiding the 19 biggest ha & drs configuration mistakes
Presentation avoiding the 19 biggest ha & drs configuration mistakes
 
Presentation blade center foundation for cloud
Presentation   blade center foundation for cloudPresentation   blade center foundation for cloud
Presentation blade center foundation for cloud
 
Presentation building and running your private cloud
Presentation   building and running your private cloudPresentation   building and running your private cloud
Presentation building and running your private cloud
 
Presentation building your cloud with v mware
Presentation   building your cloud with v mwarePresentation   building your cloud with v mware
Presentation building your cloud with v mware
 
Presentation business critical applications in a virtual env
Presentation   business critical applications in a virtual envPresentation   business critical applications in a virtual env
Presentation business critical applications in a virtual env
 
Presentation cim1309 v cat 3.0 operating a v-mware cloud
Presentation   cim1309 v cat 3.0 operating a v-mware cloudPresentation   cim1309 v cat 3.0 operating a v-mware cloud
Presentation cim1309 v cat 3.0 operating a v-mware cloud
 
Presentation cisco intelligent automation complementing and extending v mwa...
Presentation   cisco intelligent automation complementing and extending v mwa...Presentation   cisco intelligent automation complementing and extending v mwa...
Presentation cisco intelligent automation complementing and extending v mwa...
 
Presentation cisco vxi–optimized infrastructure for scaling v mware view wi...
Presentation   cisco vxi–optimized infrastructure for scaling v mware view wi...Presentation   cisco vxi–optimized infrastructure for scaling v mware view wi...
Presentation cisco vxi–optimized infrastructure for scaling v mware view wi...
 
Presentation cloud infrastructure and management – from v sphere to vcloud ...
Presentation   cloud infrastructure and management – from v sphere to vcloud ...Presentation   cloud infrastructure and management – from v sphere to vcloud ...
Presentation cloud infrastructure and management – from v sphere to vcloud ...
 
Presentation cloud infrastructure launch – what’s new
Presentation   cloud infrastructure launch – what’s newPresentation   cloud infrastructure launch – what’s new
Presentation cloud infrastructure launch – what’s new
 
Presentation cloud meets big
Presentation   cloud meets bigPresentation   cloud meets big
Presentation cloud meets big
 
Presentation consuming a cloud
Presentation   consuming a cloudPresentation   consuming a cloud
Presentation consuming a cloud
 
Presentation desktops for the cloud the view rollout
Presentation   desktops for the cloud the view rolloutPresentation   desktops for the cloud the view rollout
Presentation desktops for the cloud the view rollout
 
Presentation disaster recovery in virtualization and cloud
Presentation   disaster recovery in virtualization and cloudPresentation   disaster recovery in virtualization and cloud
Presentation disaster recovery in virtualization and cloud
 
Presentation drs advanced concepts, best practices and future directions
Presentation   drs advanced concepts, best practices and future directionsPresentation   drs advanced concepts, best practices and future directions
Presentation drs advanced concepts, best practices and future directions
 

Presentation tracking down storage performance issues a customer’s perspective

  • 1. Tracking Down Storage Performance Issues: A Customer’s Perspective Keith Aasen, NetApp Scott Elliott, Christie Digital INF-STO1430 #vmworldinf
  • 2.
  • 3. Agenda 1. Introduction and background 2. Storage problems and their effect on virtual infrastructure 3. Root cause analyses and resolution 4. Results and next steps
  • 4. Who is Christie?About Christie • A global visual technologies company • Visual Solutions include: • Media Walls
  • 5. • A global visual technologies company • Visual Solutions include: • Digital Cinema Projectors Who is Christie?About Christie
  • 6. • A global visual technologies company • Visual Solutions include: • 3D Virtual Reality Who is Christie?About Christie
  • 7. • A global visual technologies company • Visual Solutions include: • Simulation Projection Systems Who is Christie?About Christie
  • 9. Christie’s virtualization journey Kitchener vCenter 14 Hosts: IBM x3650: Dual Proc, 4-Core, 50 GB RAM IBM x3850: Quad Proc, 8-Core, 256 GB RAM NetApp v3140; 60 TB 250+ VMs (and growing)
  • 10. The problem arises • Disk latencies: 20 ms 40 ms Good Bad Ugly
  • 11. The problem arises • Disk latencies: 20 ms 40 ms Good Bad Ugly Implementation Sustained Latency Spikes Increase in business demand Sustained Latency SpikesDeployed SCOM Plug-in Continued growth; High I/O introduced Sustained: 30 ms Spikes: 100 ms No application impact Sustained: +40 ms Spikes: 6 seconds Significant Application Impact
  • 12. List of issues 1000 ms = 1 second!
  • 13. List of issues 1. Most datastores had a consistent 40ms (or higher) of disk latency with spikes lasting multiple seconds 2. ESXi hosts lose connectivity at seemingly random times • Most happen between midnight and 5:00 a.m. 3. Applications complained of disk time-outs • Where applicable, would automatically fail over to DR site
  • 14. The hunt begins • Where to start? • Oceans of data across multiple systems • Need to correlate information and filter out distractions • Specialized knowledge to interpret the data
  • 15. Timing is everything • Coincidentally, PoC of NetApp OnCommand Balance • Additional diagnostic analysis and correlated data • Supplemented SCOM and PerfStats • Large amount of misaligned VMs • Most severe latencies happened between midnight and 5:00 a.m. Intelligence Instead of DataPerformance Capacity Analytics OnCommand Balance
  • 16. Misaligned VMs on a LUN VMDK NTFS block NTFS block NTFS block NTFS block MBR or starting offset WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block The VMDK is aligned to the VMFS file system. VMFS block VMFS block VMFS block The VMFS file system is aligned to the WAFL file system so that the VMFS blocks align to the WAFL blocks This offset causes the NTFS blocks to be misaligned with the WAFL blocks
  • 17. Properly aligned VMs on a LUN VMDK NTFS block MBR or starting offset WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block
  • 18. Properly aligned VM IO In a properly aligned VM configuration, each Guest OS Block (NTFS/EXT3) is mapped to one block on the storage array. VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block
  • 19. Properly aligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block When a write occurs from the guest OS the write is cached and then acknowledged back to the guest.
  • 20. Properly aligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block When a write occurs from the guest OS the write is cached and then acknowledged back to the guest. Guest write
  • 21. Properly aligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block Cached in NVRAM NTFS block Guest write When a write occurs from the guest OS, the write is cached and then acknowledged back to the guest.
  • 22. Properly aligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block Cached in NVRAM NTFS block ACK Guest write When a write occurs from the guest OS the write is cached and then acknowledged back to the guest.
  • 23. Properly aligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block NTFS block Because of NetApp WAFL and NVRAM technology NetApp controllers can write to disk very quickly therefore NVRAM rarely fills up. Written to disk later Invalidated
  • 24. Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck In a misaligned IO configuration, each windows block is stored on multiple blocks on the storage array
  • 25. Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck Guest write In a misaligned IO configuration each windows block is stored on multiple blocks on the storage array
  • 26. Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck This guest write is 2 partial writes (1/2 of 2 blocks).The storage controller caches the writes and acknowledges the guest as before Cached in NVRAM FL ck WA blo ACK
  • 27. Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck Cached in NVRAM FL ck WA blo This write is called a partial write since it only writes parts of 2 blocks. We need to preserve the other half of the block! Still need!
  • 28. Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck Cached in NVRAM FL ck WA blo Still need! To do this we must first read the old blocks in. Normally this is done during a CP or when an entire write stripe is ready as the rest of the block may come in. Reads
  • 29. WAFL block WAFL block Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck Cached in NVRAM FL ck WA blo Still need! Now we can build the new Blocks and write back out.
  • 30. Misaligned VM IO VMDK NTFS block WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block VMFS block VMFS block VMFS block NTFS block NTFS block NTFS block NTFS block FL ck FL ck FL ck FL ck Invalidated Now we can build the new blocks and write back out.
  • 31. Net effect • This process causes consistency points to take longer in duration • Increases CPU load on the controller • No effect on performance to the VM, if the controller can “keep up” • If load increases, then a dramatic spike in latency can occur • Ultimately determines how many VMs can be hosted on a storage system
  • 32. How to correct misalignment • Adjust the MBR or boot sector with MBRalign or VMware converter • Permanent solution • Requires Downtime for the VM • Create an “Optimized Datastore” • No downtime required for the VM • Limited Vendors offer this • Must be sure not to mix misaligned VMs and aligned VMs
  • 33. Servers with Misaligned Partitions Report
  • 35. Misaligned VMs on optimized LUN VMDK NTFS block NTFS block NTFS block NTFS block MBR or starting offset WAFL block WAFL block WAFL block WAFL block WAFL block WAFL block The VMDK is aligned to the VMFS file system. VMFS block VMFS block VMFS block The VMFS file system is “improperly” aligned to the storage file system so that the NTFS blocks align to the storage blocks. This offset causes the NTFS blocks to be aligned with the storage blocks
  • 36. Getting closer • Remaining latency spike late at night with no corresponding IO. • Time coincided with aggregate-level snapshot. • Aggregate snapshot is on by default on every system. Usually there is no noticeable activity. • Will trigger a disk cleanup process, if significant space is released. • The cleanup process was colliding with the SQL DB copy causing the latency spike. (has since had it’s priority adjusted)
  • 37. • Still had lingering – and seemingly random – spikes • Use Veeam’s Management Pack for VMware • Agentless vSphere monitoring and management • Systems Center Operations Manager Plug-In • Used report “Virtual Machines: Disk Performance History” The cumulative effect of client software
  • 38. Correlating VM disk activity AV client maintenance synchronized across VMs
  • 40. What did we learn? • An underused storage subsystem can mask environment misconfigurations. • Storage performance issues are rarely due to a single cause. • In this case, there were three causes: 1. VM alignment 2. Storage resource contention from background process, and 3. Suboptimal antimalware configuration.
  • 41. Other lessons learned 1. Invest in monitoring tools to detect problems. 2. Fix misconfigurations before they become a problem. 3. Engage your vendor to assist with the troubleshooting process.
  • 43. Thank you and have a great VMworld 2012!
  • 44. FILL OUT A SURVEY EVERY COMPLETE SURVEY IS ENTERED INTO DRAWING FOR A $25 VMWARE COMPANY STORE GIFT CERTIFICATE
  • 45. Tracking Down Storage Performance Issues: A Customer’s Perspective Keith Aasen, NetApp Scott Elliott, Christie Digital INF-STO1430 #vmworldinf