  1. 1. S O L U T I O N P R O F I L E IBM ProtecTIER Adds Native Replication December 2009 Ever since their acquisition of Diligent in 2008, IBM has been a top player in the enterprise-level deduplication market. The ProtecTIER deduplication product line, which includes both easy-to-deploy appliances and highly scalable gateway options, proved to be a good acquisition for both companies and allowed IBM to supplement theirstrong tape backup business.However, we were initially concerned about a lack of native replication in ProtecTIER.Diligent had not added the feature and IBM did not immediately provide it post-acquisition. The main reason for the delay was that in enterprise accounts, whereProtecTIER’s high performance and petabyte scalability played very well, customersalready practiced array-based replication and had little interest in using another methodof replication for ProtecTIER. But as IBM moved down-market, they found that mid-market customers rarely used array-based replication, citing expense and complexity.To serve this market – and any enterprise customers who wanted it -- IBM added nativereplication capabilities to the ProtecTIER product family.This Product Profile will detail where ProtecTIER is today, how native replicationincreases its value to enterprise and mid-market customers, and how customers areusing ProtecTIER to protect their critical data.Challenge solution. However, the storage environment has changed dramaticallyTape still forms the backbone of many over the past few years VTL-centricbackup infrastructures. In these backup plans have not changed with it.environments, the traditional backup This has serious implications for backupprocess begins with backing up data from and disaster recovery (DR).production servers onto tape or disk forshort term storage and retrieval. From Many of the challenges center on never-there the backups are typically copied to a ending data growth. Corporate data istape library. Tapes are either placed into growing at a minimum rate of 60% a year,30-60 day retention cycles or are and many environments experiencephysically transported to a remote breathtaking growth rates of 100% orlocation for long-term archiving. more year over year. These large volumes of data make it extremely difficult toWith their fast disk and cache, VTLs used backup within acceptable windows.to keep tape-based backup a workable Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 1 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com
  2. 2. S O L U T I O N P R O F I L E scale up to 1PB of physical storageLarge backups also require large media housing up to 25PB or more ofpurchases and take up costly real estate in compressed data behind a single server.remote archiving locations. Reliability High speed connections enable data tosuffers as media demand grows, and as flow at sustained speeds up totape management requires more manual 1000MB/sec and achieve size reductionsintervention. Increased tape media of 25:1 or more. ProtecTIER is availablehandling also heightens security risks as as an entry-level 7TB plug-n-playIT struggles to efficiently manage and appliance all the way up to ultra-hightransport tape. performance active-active cluster gateway solution that is scalable to 1 petabyte ofCost presents another major factor. useable capacity. The back-end disk mayStoring large amounts of data off-site is be IBM or non-IBM, which allows IT toexpensive in terms of capacity, leverage heterogeneous storage resources.maintenance overhead, and real estate.And finally, DR testing – never easy But IBM ProtecTIER is hardly operatingunder the best of conditions – becomes in a competitive vacuum. Manyexceptionally difficult with tape-based deduplication vendor battles are fought asbackup. When testing requires restoring much over price as features, especially intape from backup, chances are it simply is the cost-sensitive mid-market. IBM’snot going to happen. competitive advantage is not so much a small price tag as it is an excellent ROI. The business need for enterprise-class de-Deduplication and IBM deduplication is undeniable. CustomersDeduplication technologies offer struggle to backup large data files withintremendous advantages in solving these reasonable backup windows. This need for both capacity and performance drivesthorny problems, and enterprise and mid-market customers are embracing the customers to look at IBM ProtecTIER; nottechnology. But deduplication for the cheapest per-device price tag, but for very strong ROI from a systemtechnologies have never been one size fitsall and never will be. There are too many perspective. To match the kind of powerways to engineer where the deduplication and scalability that one ProtecTIER system provides, many other competingprocess occurs, how fast it operates, howrobust it is, and how secure the data vendors have to sell multipleremains. deduplication systems. This perspective levels the playing field in favor of IBMIBM’s ProtecTIER integrates ProtecTIER even in cost-consciousdeduplication with IBM’s storage and environments. (And who isn’t cost- conscious these days?)system expertise to benefit enterprise andmid-market customers. ProtecTIER can Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 2 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com
  3. 3. S O L U T I O N P R O F I L ELet us look at some of the primary environments with high performancefeatures in ProtecTIER, including new requirements. The flexible ProtecTIERnative replication capabilities. Gateway offers high inline deduplication performance and serves Highly efficient design. The customers with specific design ProtecTIER deduplication engine, requirements and support both IBM called HyperFactor, has a highly and non-IBM storage systems. efficient index design that allows it to scale up to 1PB of base capacity  Highly reliable. ProtecTIER does without impacting indexing not use a hash algorithm, avoiding the performance. This avoids a possibility of data loss from hash considerable problem for competing collisions. ProtecTIER’s non- products that are based on hashing or disruptive inline architecture makes content-aware algorithms. sure that the deduplication runs smoothly without interrupting High performance from existing backup processes. In addition, aggregated throughput. In non- ProtecTIER is built on premium Fibre aggregated deduplication systems, Channel drives and highly available single system throughput identifies a cluster configurations, making it throughput number against a single exceptionally reliable. repository. Each node supports its own repository and when another ProtecTIER and Replication node is added a second single In addition to these enterprise-class repository comes online. But features, IBM has added native ProtecTIER leverages a global replication to ProtecTIER. Replication repository using multiple gateways. serves to improve DR operations and This enables ProtecTIER to scale its enables more data to be protected at a system with the addition of each new lower cost, and is fast becoming a node. For example, a single node can necessity for data protection. sustain speeds of 500MB/sec or more, while a two-node cluster can sustain Replication allows for fine-grained and 1000MB/sec or more as opposed to fast recovery, and when it comes to independent node-based data silos. recovering from backup tape it is orders of magnitude faster than going through Flexible systems. IBM sells the backup application’s recovery process. different implementations of However, replicating backup is only ProtecTIER to match different practical if the size of the backup has been environments. The ProtecTIER 2- controlled via deduplication. The great Node Cluster offers high performance mass of primary data otherwise requires and high capacity in an active-active massive bandwidth and huge capacity, cluster and is especially effective in Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 3 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com
  4. 4. S O L U T I O N P R O F I L Emaking it far too awkward to replicate as ProtecTIER works at the cartridge level toa matter of course. ProtecTIER replicates clone cartridges to the DR site via theonly unique new deduplicated data, backup application. Replication is policy-making ProtecTIER-based replication an driven and its inline architecture lets itimportant piece of a disaster recovery replicate concurrently with backup andsolution. deduplication operations. It can also be scheduled to run during set replicationMid-market customers are embracing this windows. ProtecTIER replication keepsapproach to replication, which is far bandwidth demands to a minimum bysimpler and less resource-intensive then deduplicating replicated data at both thearray-based replication. (Not to mention primary and secondary sites.considerably less expensive than buying acostly array for its replication features.) From the policy interface, IT assigns repositories or cartridges to be replicatedProtecTIER replication is also far more and schedules the replication time frame.efficient than transporting physical tape IT may prioritize policies for a granularto off-site locations. With ProtecTIER level of control. ProtecTIER nativenative replication, this market can now replication emulates moving tapes in andpractice a number of DR scenarios. out of VTL import/export slots, and ITExamples include replicating within a may use the slots via backup applicationcluster or across the data center, to to virtually move cartridges from onedevices on opposite ends of a campus, to a library or site to another. IT may defineremote data center, or to an externally which cartridges remain visible to thehosted DR site. replication operation and may assign locations to the cartridges. CustomersProtecTIER’s powerful deduplication may also choose to clone remotecapabilities reduce backup disk capacity ProtecTIER virtual cartridges to tape forby up to 25 times or more. This also long-term storage at the DR site.reduces the bandwidth required toreplicate data by 90% or greater. Native In case of disaster, an affected primaryreplication is built-in to all new site will failover to the DR site. IT can useProtecTIER gateways and appliances and the ProtecTIER interface screen tois available as a software upgrade on all maintain full access to management andexisting models. (Older ProtecTIER monitoring during the disaster. When themodels may need to add a 2nd NIC card to primary site resumes normal operations,use replication.) ProtecTIER replication the DR site will failback to the primary.does one-to-one replication from one siteto another, and IBM’s roadmap includesmany-to-one and many-to-manycapabilities in the near future. Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 4 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com
  5. 5. S O L U T I O N P R O F I L EProtecTIER in the Real World extremely helpful to immediate recovery of end-user files. Finally, this veryNonprofit Fox Chase Cancer Center is one dynamic data center changes frequently.of the leading cancer research and ProtecTIER’s architecture scalestreatment centers in the United States. throughput and capacity to manageThe hospital and the research center dynamic environments like this.make for an extremely demanding dataprotection environment. ProtecTIER Native ReplicationThe hospital generates critical patient Now that IBM has added nativecare and case management records, and replication to ProtecTIER, Fox Chasethe research center continually produces plans to expand their ProtecTIERdata from nearly a hundred ongoing installation and to add replication as partresearch programs. IT is charged with of a comprehensive disaster recovery andeconomically protecting this large volume business continuity revamp.of critical and highly regulated data. Fox Chase maintains two data centers 12IBM ProtecTIER is a strong component of miles apart. Fox Chase plans to add aIT’s data protection success. Before second ProtecTIER to the secondary site,ProtecTIER, Fox Chase was generating where almost 50% of their servers arenearly a terabyte of new data a month and now located. Fox Chase will use eachbackups were taking longer than 10-12 site’s ProtecTIER for local backup, andhour backup windows. Even with these each will then replicate their backup tolong windows operating all night and on the opposite site. This setup will maintainweekends, growing data was pushing full data protection and failover/failbackbackup times to unacceptable levels. access from either location.ProtecTIER enabled them to cut their Fox Chase must allow for two differentbackup time from an unacceptable 13 critical data types, each with its own set ofhours or more to just 6 hours. backup and data protection challenges.ProtecTIER’s flexible virtual tape The first type is very large files containinginterface let Fox Chase saturate its study and experimental data from itsEthernet network with backup traffic research arm. The hospital generates thewithout experiencing performance loss. second type of data, which includes patient and medical care that is highlyThe customer also achieved strong regulated and very sensitive.storage compression in the primary datacenter where ProtecTIER stores 250TB of The first type requires high capacity anddata in a 22TB allocation. Fox Chase fast throughput to store and replicate thekeeps several weeks of backups on the very large files within acceptableProtecTIER system, which has proved windows. High compression ratios enable Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 5 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com
  6. 6. S O L U T I O N P R O F I L EFox Chase to back up these large backup IT to manage local and remoteobjects at high speed, and to use highly ProtecTIER installations from a singlecompressed storage for lower capacity local console.costs. The storage arrays will do array-based, live frame-by-frame replication of 3. Benefit #3: Strong disasterthe critical clinical data. ProtecTIER will recovery and businessprovide backup replication for both file continuity. ProtecTIER disk-basedtypes. archives are optimized for DR and business continuity. Failover and failback options between primary andBenefits of ProtecTIER with remote sites ensure continuity duringNative Replication disaster recovery. Fast ProtecTIER-to- ProtecTIER connections greatlyProtecTIER offers strong benefits to increase the speed of data restoresenterprise and mid-market customers. over the WAN. ProtecTIERTogether they form ProtecTIER’s high replication’s compact bandwidth alsoROI in the data protection environment. enables more frequent testing of DR plans.1. Benefit #1: High performance backup and restore. ProtecTIER is 4. Benefit #4: High scalability and a game-changing technology for throughput. ProtecTIER’s backup and restore. It greatly architecture achieves very high increases the speed and reliability of sustained throughput for extremely backup operations, which in turn fast processing and replication. supports service level agreements, Clusters and a global repository allow recovery point objectives, and backup single nodes to sustain throughput in performance. the 500MB/sec range with peak throughputs topping 650MB/sec. By2. Benefit #2: Secure and available adding a second node and supporting offsite archives. Instant electronic a global repository, IBM can push the vaulting between primary and remote sustained throughput rate into the sites protects data integrity and 1000MB/sec range, with peak recoverability. ProtecTIER only throughputs even higher. replicates a small amount of unique new data, resulting in extremely 5. Benefit #5: Native Replication. efficient bandwidth use. Electronic Automated data replication to the data movement replaces physical remote site yields dramatic transport and its attendant risk of data improvements to the backup and loss. IT management overhead can be recovery process. Inline deduplication a real issue with remote data enables ProtecTIER to replicate management, so ProtecTIER enables concurrently with backup operations. Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 6 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com
  7. 7. S O L U T I O N P R O F I L E Concurrent deduplication saves like clustering and global repositories significant backup and replication further demonstrate IBM’s commitment time and dramatically lowers to developing the ProtecTIER line. bandwidth requirements. IT can also replace risky physical tape transport By adding native replication to the by cloning virtual tape cartridges to ProtecTIER mix, IBM makes a compelling physical media at the remote site. case for small enterprise and mid-market adoption as well. These price-conscious customers can be leery of what theyTaneja Group Opinion perceive as an initially higher cost.No one has ever accused IBM of doing However, IBM’s ROI proposition is a compelling one with a strong dollar-to-things by halves and ProtecTIER withnative replication is no exception. IBM value message. For all of these reasons,ProtecTIER is already deployed across we expect to see IBM continue to increase its large customer account base and makehundreds of installations in demandingverticals like healthcare, financial services serous inroads into the downstream dataand telecommunications. ProtecTIER’s protection market.in-line, single system throughput and .petabyte-level scalability make it highlyattractive to these customers. FeaturesNOTICE: The information and product recommendations made by the TANEJA GROUP are based upon public information andsources and may also include personal opinions both of the TANEJA GROUP and others, all of which we believe to be accurate andreliable. However, as market conditions change and not within our control, the information and recommendations are madewithout warranty of any kind. All product names used and mentioned herein are the trademarks of their respective owners. TheTANEJA GROUP, Inc. assumes no responsibility or liability for any damages whatsoever (including incidental, consequential orotherwise), caused by your use of, or reliance upon, the information and recommendations presented herein, nor for anyinadvertent errors which may appear in this document. Copyright The TANEJA Group, Inc. 2009-2010. All Rights Reserved 7 of 7 87 Elm Street, Suite 900 Hopkinton, MA 01748 Tel: 508-435-5040 Fax: 508-435-1530 www.tanejagroup.com TSL03007-USEN-00