• Save
Data Domain Overview
Upcoming SlideShare
Loading in...5
×
 

Data Domain Overview

on

  • 3,146 views

 

Statistics

Views

Total Views
3,146
Views on SlideShare
3,146
Embed Views
0

Actions

Likes
3
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Note to Presenter: View in Slide Show mode for hyperlink in footer to work.According to the latestTheInfoPro Wave Storage Study, 48 percent of Fortune 1000 respondents have backup deduplication in use and another 40percent have it either in pilot, or in their future plans. That’s about nine in 10 respondents, either with deduplication or moving to it, giving deduplication a “Technology Heat Index” rank of 1. In other words, the move is on—from tape-centric backup architecture to disk-centric designed backup based on deduplication technologies.Note to Presenter: Details of the May 2011 release of TheInfoPro Wave 15 Storage Study can be found at this URL: http://www.theinfopro.com/2011/05/latest-it-market-study-from-theinfopro-f1000-enterprises-2011-storage-spend-continues-at-a-strong-pace/.TheInfoPro’s “Technology Heat Index” is widely regarded as effective measure of user “demand” for a technology, and from a vendor’s perspective, a good indicator of the relative size of the market opportunity.
  • This was expected to be a $2.7B market in 2011 that hasn’t really been tracked until now. EMC is in a clear leadership position with over 62% market share.Of all the backup market segments, that’s a great place to be, since it’s growing the fastest and will be over a $5B market by 2015. This means the market is embracing these solutions.
  • Another important differentiator for Data Domain systems is the Data Invulnerability Architecture. Data Domain Data Invulnerability Architecture lays out the industry's best defense against data integrity issues by providing unprecedented levels of data protection, data verification, and self-healing capabilities that are unavailable in conventional disk or tape systems.There are three key areas of data integrity protection described on this slide:First is end-to-end data verification at backup time. As illustrated by the graphic at the right, end-to-end verification means reading data after it is written and comparing it to what was sent to disk, proving that it is reachable through the file system to disk and that the data is not corrupted. Specifically, when the Data Domain Operating System receives a write request from backup software, it computes a checksum over the data. After analyzing the data for redundancy, it stores the new data segments and all of the checksums. After all the data has been written to disk, Data Domain Operating System verifies that it can read the entire file from the disk platter and through the Data Domain file system, and that the checksums of the data read back match the checksums of the written data. This confirms the data is correct and recoverable from every level of the system. If there are problems anywhere along the way—for example, if a bit has flipped on a disk drive—it will be caught. Since most restores happen within a day or two of backups, systems that verify/correct data integrity slowly over time will be too late for most recoveries.Second is a self-healing file system. Data Domain systems actively re-verify the integrity of all data every week in an ongoing background process. This scrub process will find and repair defects on the disk before they can become a problem. In addition, real-time error detection ensures that all data returned to the user during a restore is correct. On every read from disk, the system first verifies that the block read from disk is the block expected. It then uses the checksum to verify the integrity of the data. If any issue is found, the Data Domain Operating System will self-heal and correct the data error. In addition to data verification and self-healing, there are a collection of other capabilities. Data Domain with RAID 6 provides double disk failure protection; NVRAM enables fast, safe restart; and snapshots provide point-in-time file system recoverability.Backups are the data store of last resort. Data Domain Data Invulnerability Architecture provides extra levels of data integrity protection to detect faults and repair them to ensure backup data or recovery is not at risk.
  • In addition to DD Boost, EMC offers four additional Data Domain software options that can enhance the value of a Data Domain system in your environment. Note to Presenter: Click now in Slide Show mode for animation.The first is DD Retention Lock software enables you to easily implement deduplication with file locking to satisfy IT governance and compliance standards including SEC 17a-4(f) for archive data. Note to Presenter: Click now in Slide Show mode for animation.Next is DD Replicator software, which provides fast, network-efficient , encrypted replication for disaster recovery, remote office data protection, multi-site tape consolidation, and long-term offsite retention. DD Replicator asynchronously transfers only the compressed, deduplicated data over the WAN, making network-based replication cost-effective, fast, and reliable. In addition, you can replicate up to 270 remote sites into a single Data Domain system for consolidated protection of your distributed enterprise.Note to Presenter: Click now in Slide Show mode for animation.Next,DD Virtual Tape Library software, which eliminates tape-related failures by enabling all Data Domain systems to emulate multiple tape devices over a Fibre Channel interface. This software option provides easy integration of deduplication storage in open systems and IBM i environments. Note to Presenter: Click now in Slide Show mode for animation.Next is DD Extended Retention software, which enables long-term retention of backup data on the DD860 or DD990 with up to 65 PB of logical capacity.Note to Presenter: Click now in Slide Show mode for animation.Finally, DD Encryption software protects backup and archive data stored on Data Domain systems with encryption that is performed inline—before the data is written to disk. Encrypting data at rest satisfies internal governance rules and compliance regulations and protects against theft or loss of a physical system. The combination of inline encryption and deduplication provides the most secure data-at-rest encryption solution available.
  • Here’s a look at the latest Data Domain product family including the new DD990. The capabilities previously available in a DD Archiver are now only available with the ‘DD Extended Retention software option’ on two platforms – as you can see the capacity supported for the DD860 and DD990 now includes a line dedicated to DD Extended Retention.
  • Thank you.
  • OPTIONAL SLIDEEMC Global Services is a large component of the your total EMC experience. EMC Global Services allows you to…Save money by:Significantly lowering your implementation and operating expenditure costsFilling internal resource gaps for less Protecting your investments in EMC solutionsAccelerate time to value by:Reducing deployment timeAccelerating return on investment for new projectsEasing the burden of compliance while protecting critical business informationMitigate risk and get better results by:Configuring the solution to meet your requirementsImproving your service levels and reducing your management costsUsing EMC best practices and unmatched product expertise = superior customer experienceReducing disruption while taking advantage of the features and benefits of the latest EMC products and solutions

Data Domain Overview Data Domain Overview Presentation Transcript

  • EMC DATA DOMAIN OVERVIEWDaniel SouzaEMC² Products and StorageSupport & Unix And SAN© Copyright 2012 EMC Corporation. All rights reserved. 1
  • EMC Data Domain: Leadership and Innovation A history of industry firsts2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 First deduplication First deduplication Largest Fastest backup First long-term NAS virtual tape library deduplication controller retention array system for backup and First deduplication First deduplication Cascaded archive volume replication directory replication replication First deduplication First First inline nearline storage distributed deduplication processing for compliant archiving © Copyright 2012 EMC Corporation. All rights reserved. 2
  • Deduplication Dramatically ReducesStorage Capacity Requirements Deduplication 10–30 times less data stored versus fulls + incrementals with typical retention policies 30 20 Data Stored 10 0 1 5 10 15 20 Weeks in Use Deduplication storage Traditional storage© Copyright 2012 EMC Corporation. All rights reserved. 3
  • Backup Data Reduction/DeduplicationTime series of large enterprise implementation 2H 07 15% 15% 14% 31% 25% 2H 08 24% 12% 16% 28% 21% Over three years, in-use 1H 09 27% 8% 15% rates for backup with 25% 26% deduplication have risen 2H 09 40% 4% 14% from 15% to 48% 22% 20% 1H 10 46% 6% 14% 17% 18% 1H 11 48% 7% 16% 7% 10% 13% In Use Now In Pilot/Evaluation In Near-term Plan In Long-term Plan Past Long-term Plan Not in PlanSource: Wave 15 Storage Study – Q2 2011, published 5/16/11, large-enterprise sample; H ‗07, n=151; 2H ‗08, n=127; 1H ‗09,n=147; 2H ‗09, n=182; 1H ‗10, n=146; 1H ‗11, n=31;TheInfoPro (www.theinfopro.com)© Copyright 2012 EMC Corporation. All rights reserved. 4
  • Backup Data Reduction/Deduplication Large enterprise EMC Competitor 1 Competitor 2 The ―in-use‖ rating for EMC is now over 3x that Competitor 3 of its nearest competitor Competitor 4 Competitor 5 Competitor 6 Competitor 7 0% 10% 20% 30% 40% 50% 60% 70%In Use Now (Not Including Pilots) In Pilot/Evaluation (Budget Has Already Been Allocated) Near-term Plan Long-term Plan Past Long-term Plan (> 18 Months Out) Source: Wave 15 Storage Study – Q2 2011, published 5/16/11, large-enterprise sample, n=31,TheInfoPro (www.theinfopro.com) © Copyright 2012 EMC Corporation. All rights reserved. 5
  • Purpose-Built Backup AppliancesOpen systems + mainframe Worldwide Supplier Revenue for the First Half of 2011, Total PBBA Market EMC IBM Others EMC HP 62% Quantum Symantec First Half 2011 Total Market $1.2B Source: Worldwide Purpose-Built Backup Appliance 2011–2015 Forecast Update: Explosive Growth in 2011, December 2011, IDC, Doc # 232070.© Copyright 2012 EMC Corporation. All rights reserved. 6
  • With Data Domain Deduplication StorageSystems, You Can… Retain longer Keep backups onsite longer with less disk for fast, reliable restores, and eliminate the use of tape for operational recovery Replicate smarter Move only deduplicated data over WAN existing networks with up to 99% bandwidth efficiency for cost-effective disaster recovery Recover reliably Continuous fault detection and self- healing ensure data recoverability to meet service level agreements© Copyright 2012 EMC Corporation. All rights reserved. 7
  • Deduplication Fundamentals© Copyright 2012 EMC Corporation. All rights reserved. 8
  • Data Domain BasicsEasy integration with existing environment Control Tier Backup and Archive Applications Target Tier Disaster Recovery Tier EMC CIFS, NFS, Symantec NDMP, DD Boost CommVault Ethernet Replication Virtual Tape IBM Library (VTL) over HP Fibre Channel DD890 appliance DD890 appliance Veeam Quest© Copyright 2012 EMC Corporation. All rights reserved. 9
  • Data Deduplication: TechnologyOverviewStore more backups in a smaller footprint Friday Full Backup Backup Estimated A B C D A E F G Data Logical Reduction Physical FRIDAY FULL 1 TB 2–4x 250 GBMon Incremental A B H Monday Incremental 50 GB 7–10x 5 GBTues Incremental C B I Tuesday Incremental 50 GB 7–10x 5 GBWeds Incremental E G J Wednesday Incremental 50 GB 7–10x 5 GBThurs Incremental A C K Thursday Incremental 50 GB 7–10x 5 GB Second Friday Full Backup Second FRIDAY FULL 1 TB 50–60x 18 GB B C D E F L G H TOTAL 2.2 TB 7.6x 288 GB A BCDE FGH I J K L© Copyright 2012 EMC Corporation. All rights reserved. 10
  • Retain: Store More for Longer with LessOver one year of retention in 3U of Data Domain deduplicationstorage Backup Cumulative Estimated Physical Data Logical Reduction First Full 1 TB 4x 250 GB Week 1 April 7 2.2 TB 8x 288 GB Week 2 April 14 3.4 TB 10x 326 GB Week 3 April 21 4.6 TB 13x 364 GB Month 1 April 28 5.8 TB 14x 402 GB Month 2 May 31 10.6 TB 19x 554 GB Month 3 June 30 15.4 TB 21x 706 GB TOTAL 15.4 TB 21x 706 GB© Copyright 2012 EMC Corporation. All rights reserved. 11
  • Data Integrity:Data Invulnerability ArchitectureEnd-to-end data verification Checksum Generate Verify Deduplication, write to disk Checksum Data Verify the file Verify File System system metadata integritySelf-healing file system Deduplication Verify user data Cleaning integrity Expired data Local Compression Defrag Verify RAID Verify stripe integrityOther RAID 6 NVRAM Snapshots End-to-end data verification© Copyright 2012 EMC Corporation. All rights reserved. 12
  • Network-Efficient Replicationfor True Disaster RecoveryLowers WAN costs; improves service level agreements Flexible replication 1–5% One-to-many Many-to-one DB Data Domain system Bi-directional System-to- system Home Archive data Cascaded WAN Backup data Data Domain system 1–5% 1–5% Home Data Domain DD890 Data Domain system Destination: Source:Remote sites Data Center Hub Supports hundreds of remote sites 95–99% cross-site bandwidth reduction© Copyright 2012 EMC Corporation. All rights reserved. 13
  • DD Boost Software Distributes parts of deduplication process to backup server or application clients Speeds backups by up to 50 percent DD Boost Enables more efficient resource utilization Provides application control of Data Domain replication process Supports majority of backup software market and native utilities in industry leading databases – EMC Avamar and NetWorker – Symantec NetBackup and Backup Exec – EMC Greenplum and Oracle RMAN New! – Quest vRanger© Copyright 2012 EMC Corporation. All rights reserved. 14
  • Additional Data Domain SoftwareOptions Data Domain Retention Lock Data Domain Replicator • Secure data retention for file • Network-efficient and and email archive data encrypted • Satisfies internal governance • Consolidate up to 270 remote and compliance regulations sites into a single system Data Domain Virtual Tape Data Domain Extended Library Retention • Easily integrates with Fibre • Long-term retention of Channel backup data • Supports open systems and • Up to 65 PB logical capacity IBM i operating environments Data Domain Encryption • Inline encryption of data at rest • Protects against theft or loss of a physical system© Copyright 2012 EMC Corporation. All rights reserved. 15
  • Industry‘s Most Scalable InlineDeduplication Systems Data Domain Software Options • DD Boost • DD Replicator Large Enterprise • DD Encryption • DD Retention Lock • DD Extended Retention • DD Virtual Tape Library Midsize Enterprise Small Enter./ ROBO DD160 DD620 DD640 DD670 DD860 DD890 DD990 Speed (DD Boost) 1.1 TB/hr 2.4 TB/hr 3.4 TB/hr 5.4 TB/hr 9.8 TB/hr 14.7 TB/hr 31.0 TB/hr Speed (other) 667 GB/hr 1.1 TB/hr 2.3 TB/hr 3.6 TB/hr 5.1 TB/hr 8.1 TB/hr 15.0 TB/hr 1.4–7.1 PB 5.7–28.5 PB Logical capacity 40–195 TB 83–415 TB 0.32–1.6 PB 0.6–2.7 PB 2.9–14.2 PB 5.7–28.5 PB1 13– 65 PB1 Up to 142 TB Up to 570 TB Usable capacity Up to 3.98 TB Up to 8.3 TB Up to 32.2 TB Up to 55.9 TB Up to 285 TB Up to 570 TB1 Up to 1.3 PB1 1 With DD Extended Retention software option© Copyright 2012 EMC Corporation. All rights reserved. 16
  • Deduplication Storage Evaluation Criteria© Copyright 2012 EMC Corporation. All rights reserved. 17
  • Methodology:Inline vs. Post-Process Deduplication INLINE POST-PROCESS Deduplication Before Storing Deduplication After Storing Deduplication Store Deduplication 3x disk accesses to shared store Other activities unimpeded The more processes, the more resource − Predictable contention − Simpler − Copy to tape: Too slow to stream tape − Recovery: Service level agreement predictability − Replication: Poor time-to-disaster-recovery − Deduplication: If interleaved with backup or restore More administration to fight these issues© Copyright 2012 EMC Corporation. All rights reserved. 18
  • Performance:CPU-Centric vs. Spindle-Bound Data Domain Improvement since 2004: 6,000 Throughput: ~200x Capacity: ~450x Throughput MB/s Fibre Channel SATA Most deduplication vendors 50 50 100 150 200 Number of Disk Spindles© Copyright 2012 EMC Corporation. All rights reserved. 19
  • Why Data Domain? Less disk to resource, less to manage • CPU-centric deduplication • Inline deduplication Simple, mature, and flexible • Simple, mature appliance • Any fabric, any software, backup or archive applications Resilience and disaster recovery • Storage of last resort • Fast time-to-disaster recovery (DR) readiness • Cross-site global compression – Data center or remote office© Copyright 2012 EMC Corporation. All rights reserved. 20
  • Data Domain Infrastructure andEcosystemSupports a variety of workloads and data types Backup Archive Midrange and VMware Mainframe Microsoft Microsoft SharePoint NAS, SAN, DAS Primary IBM i Oracle storage EMC DLm1000 SAP Backup Applications Archive Applications EMC EMC CA IBM F5 Networks Symantec HP Atempo Symantec CommVault Vizioncore BakBone CommVault Disaster Recovery Network Replication over WAN© Copyright 2012 EMC Corporation. All rights reserved. 22
  • Enterprise Recoverability Readinessat Disaster Recovery Site Data Domain inline DR-ready deduplicated Replicate during backup replication Backup to Cache Backup time 1.7-times longer than Data Domain ―Adaptive‖ post-process deduplicated DR-ready replication Deduplicate and replicate less than 50% ingest speed—two times longer if uncompressed at fixed bandwidth Backup to Cache Backup time 1.1-times longer than Data Domain ―Scheduled‖ post-process deduplicated DR-ready replication Deduplicate and replicate less than 50% ingest speed—two times longer if uncompressed at fixed bandwidth Backup to VTL Recall tapes VTL/tape/truck Copy to tape ? Truck to storage Truck from storage© Copyright 2012 EMC Corporation. All rights reserved. 23
  • EMC Global Services Strategize Design Implement Manage TECHNOLOGY MAINTENANCE AND CONSULTING MANAGED SERVICES EDUCATION DEPLOYMENT SUPPORT • Strategic • Best practice • Residency Services • 360° • Open Storage Observation methodologies provide onsite or global, proactive, a Technology service establishes from architecture remote skilled nd preemptive education, EMC a roadmap/vision through integration service procedures and technology-specific to meet your professionals with solution support learning • Assessment, Desig recovery objectives proven best paths, EMC Proven n/ practices and Professional • Operational Implementation, O technology Certification Readiness service perational expertise recommends a Assurance, Health Reference Check, Data • Remote Managed Architecture that Migration Services provide leverages EMC cost- deduplication effective, ITIL- technologies and based, 24x7 optimizes your intelligent remote implementation monitoring and operational infrastructure management© Copyright 2012 EMC Corporation. All rights reserved. 24
  • Why EMC Global Services Save money • Significantly lower implementation and operating expenditures • Fill internal resource gaps for less • Protect investments in EMC solutions Accelerate time to value • Reduce deployment time • Accelerate return on investment for new projects • Ease the burden of compliance while protecting critical business information Mitigate risk and get better results • Configure the solution to meet your requirements • Improve service levels; reduce management costs • EMC best practices and unmatched product expertise = superior customer experience • Reduce disruption while taking advantage of the features and benefits of the latest EMC products and solutions© Copyright 2012 EMC Corporation. All rights reserved. 25