Right data right place righttime: automated datatieringMike KlemmDistinguished Engineer
Right data right place righttime: automated datatieringLazarus VekiaridesExec. Director, EqualLogicSoftware Engineering
Fluid Data tenets: engineered to move yourbusiness forward Better       Best      Innovative   Maximum      Dynamic    Nat...
Data growth and its impacts         WW Enterprise Storage Capacity Shipped                      (Petabytes)2 48% CAGR grow...
Storage tiering provides two benefits                            Per-GB costs for high-capacity storage can beReduces cost...
Not all tiering is created equal                                               Metadata:• Complete virtualization of stora...
Dell storage tiering approaches EqualLogic Load Balancing             Capabilities                        Use cases       ...
EqualLogic automated tieringNetworkI/O balancing and MPIOCapacityManages disk utilizationAutomatic performanceOptimizes hi...
Automatic load balancingSpreads data across allavailable resources                                     EqualLogic SAN• Dyn...
Hybrid arrays: auto tiering within the arrayPrioritize placement forapplication optimizationand maximum value• First place...
Virtualized, automated SAN-wide tiering Solution • Tier 1: 1 PS6010XVS • Tier 2: 5 PS6510XV • Tier 3: 3 PS6510X • 250 VMwa...
Compellent automated tieringMetadataTracks page usage within volumesMigratesBetween tiers, RAIDs and within drivesWritesDi...
Data progressionDaily data movement• Default start time is 7 p.m.• Data is moved per page• Historical Replay pages  eligib...
Fast Track• Places the busiest pages on      Full disk drive        With fast track  the outer portion of the drive  – the...
Efficient operations at cloud scale Solution • Compellent SAN:      – Tier 1: 9 SSDs      – Tier 2: 15K RPM Fibre Channel ...
How Fluid Cache tier works  Low-latency  interconnect  Storage  network • Extension of fluid data tiering               Fl...
Accelerating efficienciesHigh Availability                                     Unprecedented Performance• Fault tolerant b...
Let Dell help you fulfill your whole IT visionwith our extensive enterprise portfolio    Storage             Servers      ...
Dell Fluid Data architectureThe right data in the right place at the right time for the right cost                        ...
What next?                     Visit solutions expo              Schedule Solutions Center visit                      Sche...
Please help Dell meet your needsby filling out the Session Evaluation Surveys    On paper    Content                      ...
Right data right place right time: automated data tiering - Mike Klemm, Distinguished Engineer and Lazarus Vekiarides Exec...
Upcoming SlideShare
Loading in …5
×

Right data right place right time: automated data tiering - Mike Klemm, Distinguished Engineer and Lazarus Vekiarides Exec. Director, EqualLogic

1,025 views

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,025
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
24
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • So data growth means more of a good thing, at least potentially. The big challenge that often keeps us from leveraging all this good data is inefficient data management. [click] The impact of this inefficiency can be huge, when you consider that storage typically accounts for some 17% of total IT budgets. [click] When you then consider that up to 75% of storage is underutilized, [click] that around 46% of your storage team’s time is spent on admin operations, [click] and that your data’s growth will likely speed up, not slow down over time, you start to gain a real appreciation for the costs – capital, operating – of inefficient storage.
  • No matter whether the mix of dynamic data and more inert data you manage is 50-50 or 60-40 or even 70-30, the ability to tailor your storage to your different types of data can make a big difference. For one thing, the per-gigabyte cost for solid state disk storage can reach 30 times the per-gigabyte cost of a Tier 2 or Tier 3 storage technology such as SATA or nearline SAS. So that’s our first incentive for tiering our storage: saving costs. [click] Take the unstructured data, the files, documents, images, maybe video, and put it on storage that offers high capacity and low cost. But when you do that, you’re also freeing up your high-performance SSD or 15,000 RPM SAS storage [click], which can then better support your dynamic applications, your demanding databases. So does this all make sense? By tiering your shared storage, you have a means both for reducing costs and for improving performance. Why haven’t we done this all along? Maybe it’s because tiering storage the old-fashioned way was barely worth the effort. To tier manually takes inventorying your applications, [click] and let’s do things right by using a three-year planning window. [click] What kind of IOPS performance do these apps require? Maybe there are minimum performance criteria – like SLAs – that you’ll need to support. [click] What kind of data growth are we looking at in this planning window? [click] Once we have all this, it’s time to design our tiering scheme. Once this is implemented and given a chance to run a while, [click] it only makes sense to monitor things. Make sure you achieved in the tiering what you set out to achieve. [click] To the extent you may not have – things change, you’re working with a whole set of moving targets – you’ll need to make adjustments, tweak things here and there. Before too long, you’re going to do it all again. [click] It’s an inexact science at best, and it can take a lot of work and a lot of time.
  • To get on top of these trends – indeed, to get the most of your rich, enabling data – we believe you need to virtualize your storage. Storage virtualization decouples IT and business operations from the complexity and the nuts and bolts of storage hardware and networking. With virtualized storage, you can unleash all your storage specialists on your team on more strategic initiatives, as well as enable the smart generalists on your team to add basic storage administration to their scope of responsibilities. Dell’s particular approach to storage virtualization doesn’t stop at that decoupling, by the way. It also involves driving huge levels of intelligence into our storage systems, which paves the way for all kinds of automation, including the automated tiering we’re talking about today. [click] Specifically, automated tiering can help reduce complexity, reduce costs, and maximize application performance.
  • Goal: Interface load is balancedInternal communication is balancedI/O is directed to the most used arrayBest network paths are preferredUse case 1: setup and spreadingUse case 2: A network port gets overloaded
  • No matter whether the mix of dynamic data and more inert data you manage is 50-50 or 60-40 or even 70-30, the ability to tailor your storage to your different types of data can make a big difference. For one thing, the per-gigabyte cost for solid state disk storage can reach 30 times the per-gigabyte cost of a Tier 2 or Tier 3 storage technology such as SATA or nearline SAS. So that’s our first incentive for tiering our storage: saving costs. [click] Take the unstructured data, the files, documents, images, maybe video, and put it on storage that offers high capacity and low cost. But when you do that, you’re also freeing up your high-performance SSD or 15,000 RPM SAS storage [click], which can then better support your dynamic applications, your demanding databases.
  • No matter whether the mix of dynamic data and more inert data you manage is 50-50 or 60-40 or even 70-30, the ability to tailor your storage to your different types of data can make a big difference. For one thing, the per-gigabyte cost for solid state disk storage can reach 30 times the per-gigabyte cost of a Tier 2 or Tier 3 storage technology such as SATA or nearline SAS. So that’s our first incentive for tiering our storage: saving costs. [click] Take the unstructured data, the files, documents, images, maybe video, and put it on storage that offers high capacity and low cost. But when you do that, you’re also freeing up your high-performance SSD or 15,000 RPM SAS storage [click], which can then better support your dynamic applications, your demanding databases.
  • Server-side caching solution that creates a large-scale, low latency cache by pooling DELL Express Flash SSD’s both within a server and between servers across a networkThe cache pool is mounted over a Compellent SAN and will accelerate writes, read-after-writes and re-reads.The size of the cache pool is limited only by the total amount of SSD capacity in the server or clusterThe cache pool is highly available and resilient to failuresConnecting the servers using a high-speed fabric such as 10/40Gigabit Ethernet or Infiniband allows this cache pool to be accessible several orders of magnitude faster than traditional storage.cache is size of total cache in the clusterwrite-back cache capability provides HALUN management coordinated between server and storageCache integrated into storage amangementdon't need to configure separete from storageConfigured as part of storagein failure situations - stil have access to cached data being written from a secondary serverAccess of data when it doesn't go through the cache - preventing the customer from messing up the dataRDMA over ethernet could also use infinibandExtension of fluid data tieringCluster-wide cache poolH/A through redundant cache writesEase of managementFull integration with array data protection
  • In addition to Dell’s storage solutions, our portfolio of clients, servers, networking and software products can help you approach your IT initiatives comprehensively and flexibly. And our Dell Storage Solutions incorporate a global team of over 30,000 trained and experienced storage and IT professionals to help put you on the path to success.
  • Consulting Services -- Dell Data Protection experts help companies of all sizes by providing an objective data quality assessment through the process of discovery and analysis. This objective view of data values leads to design transformations that optimize your storage ecosystem through efficient tiering. Take the first step in this transformational journey from the data center to the Cloud by scheduling an initial whiteboarding session with our services professionals.
  • Right data right place right time: automated data tiering - Mike Klemm, Distinguished Engineer and Lazarus Vekiarides Exec. Director, EqualLogic

    1. 1. Right data right place righttime: automated datatieringMike KlemmDistinguished Engineer
    2. 2. Right data right place righttime: automated datatieringLazarus VekiaridesExec. Director, EqualLogicSoftware Engineering
    3. 3. Fluid Data tenets: engineered to move yourbusiness forward Better Best Innovative Maximum Dynamic Native cloud Scale-out Selftogether ownership licensing efficiency tiering integration designs protecting experience
    4. 4. Data growth and its impacts WW Enterprise Storage Capacity Shipped (Petabytes)2 48% CAGR growth in storage Storage capacity budget is requirements 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 hardware1 Average Disk Allocation Levels 46% (upper/lower range shown)2 100 Total Hours 80 Provisioning 60 40 time of Migrationsamount of 20 storage staff Backupstorage typically 0 spent onunderutilized operations1 Storage Administration 1 Horizon Information Strategies, Storage Facts, Figures, Best Practices and Estimates, April 2011 2 IDC, Worldwide Enterprise Storage Systems 2011-2015 Forecast: “Emerging” Once Again is a Keyword in the Storage Market, #228255, May 2011
    5. 5. Storage tiering provides two benefits Per-GB costs for high-capacity storage can beReduces costs 1/30th the cost of high-performance storage *• Aligns inactive data with capacity- optimized storage Performance 30X Tier 1 Improves Cost 3X performance Tier 2 • Aligns demanding X applications with Tier 3 performance optimized storage *Cost multiples are estimates only, based on recent component cost data, and subject to change. 5
    6. 6. Not all tiering is created equal Metadata:• Complete virtualization of storage information about each block• Granular approach• Intelligent automation• Non-disruptive data movement• Performance and capacity Read optimization 20 times since 12:05:09 . . . Storage pool Volume
    7. 7. Dell storage tiering approaches EqualLogic Load Balancing Capabilities Use cases • Optimizes data placement and • Highly virtualized, IP-based network load storage with a peer-scale architecture • Balances load in a pool of arrays – Capacity • Ideal for IT generalist – Performance • Balances load within a hybrid arrayCompellent Data Progression • Optimizes data placement across • Multi-protocol, core storage 3 tiers for the enterprise and • Balances across multiple axes critical workloads – Drive type • Custom-tuned by storage – RAID level administrator – Platter location
    8. 8. EqualLogic automated tieringNetworkI/O balancing and MPIOCapacityManages disk utilizationAutomatic performanceOptimizes high I/O data distributionthroughout the poolHybrid arrayDistributes high I/O data throughoutthe array
    9. 9. Automatic load balancingSpreads data across allavailable resources EqualLogic SAN• Dynamically adjust to workloads• Real time I/O balancing SSD NL-SAS• Move hot and cold data• Improves performance Array 1 Array 2• Capacity balancing Storage pool• Access frequency vs. array latency
    10. 10. Hybrid arrays: auto tiering within the arrayPrioritize placement forapplication optimizationand maximum value• First place data on SSD• Move data to HDD• Frequent access = hot data• Move hot data to SSD• A portion of SSD can be used as controller write cache extension• Very different than SSD cache
    11. 11. Virtualized, automated SAN-wide tiering Solution • Tier 1: 1 PS6010XVS • Tier 2: 5 PS6510XV • Tier 3: 3 PS6510X • 250 VMware vSphere™ VMs running Microsoft™ SQL Server® and other “Dell EqualLogic greatly simplifies business applications the process of adding a new SAN…. The firmware automatically begins Results load balancing to the new • 2X improvement in query performance array, with no downtime.” • 80X faster creation of new customer environments Robert Scarbrough • 100% availability over 16 months – Vice President, Technical zero need for planned downtime Operations OptumInsight
    12. 12. Compellent automated tieringMetadataTracks page usage within volumesMigratesBetween tiers, RAIDs and within drivesWritesDirectly to tier 1
    13. 13. Data progressionDaily data movement• Default start time is 7 p.m.• Data is moved per page• Historical Replay pages eligible to move to lowest tier immediately• Data progression will run against multiple volumes at the same timeRAID restripe occurs• Adding additional drives 13
    14. 14. Fast Track• Places the busiest pages on Full disk drive With fast track the outer portion of the drive – the “Fast Tracks”• Uses the same data movement engine as data progression• Data is moved when progression runs The most active blocks are dynamically and automatically placed on outer edge 14
    15. 15. Efficient operations at cloud scale Solution • Compellent SAN: – Tier 1: 9 SSDs – Tier 2: 15K RPM Fibre Channel – Tier 3: 7,200 RPM SATA drives • 29 VMs running SAP applications on 4 “Thanks to the way the Dell PowerEdgeR710 servers architecture manages data at the block level, we discovered we could perform I/O writes to Results the storage system very • Performance gains from 80% to 400% rapidly.” • Full backups down from 16 to 6 hours • System-wide snapshot process now Kai Sookwongse takes 30 seconds Mazda North American Operations
    16. 16. How Fluid Cache tier works Low-latency interconnect Storage network • Extension of fluid data tiering Fluid cache SW • Cluster-wide cache pool PCIE SSD • H/A through redundant cache writes • Ease of management • Full integration with array data protection
    17. 17. Accelerating efficienciesHigh Availability Unprecedented Performance• Fault tolerant between nodes • Accelerate applications by up to 30X* over traditional SAN with rotating media • Increase the number of users per server by 10XEfficient Better together• Double the number of VMs per server • Systems management for server and SAN• Reduce infrastructure support by 1/6 for work in concert same user count and SLAs • Largest working data set for cache with full coherency to SAN Confidential * Garner #G00231588 IOPS HDD vs. SSD
    18. 18. Let Dell help you fulfill your whole IT visionwith our extensive enterprise portfolio Storage Servers Networking Services Fluid Data PowerEdge servers Virtual Network A architecture is feature customer- Architecture: modular, adaptive designed to inspired design, innovation drives and flexible automatically and advanced systems responsiveness, agi approach, configur management, and intelligently innovations in lity, and efficiency able for any optimize data energy efficiency for next- organizational size everywhere generation or budget networks
    19. 19. Dell Fluid Data architectureThe right data in the right place at the right time for the right cost 100X Fluid Cache tier Tier 0 30X Tier 1 3X Tier 2 X = relative performance Tier 3 Application Primary Offsite Backup Object- Cloud servers storage storage to disk and based tape archive
    20. 20. What next? Visit solutions expo Schedule Solutions Center visit Schedule an EBC Whiteboard Consulting Visit services kiosks in expoDell World 2012
    21. 21. Please help Dell meet your needsby filling out the Session Evaluation Surveys On paper Content On the Dell World app style • Forms in room • Turn in on the way out guidelines 1. Select My Schedule Session Evaluation Survey 2. Select session to evaluate 3. Select Surveys 4. Select survey title Or 5. Simply complete the survey Confidential

    ×