Drp For Menora


Published on

Published in: Business, Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • Drp For Menora

    1. 1. DRP BCP for Menora Pini Cohen EVP & Senior Analyst Architecture & Infrastructure Strategies [email_address]
    2. 2. Market Status and Trends DRP BCP <ul><li>The second Lebanon war has raised the BCPDRP issue. Organizations realize that all of Israel might be under missile threat </li></ul><ul><li>This has two implications: </li></ul><ul><ul><li>Better shelters are considered for Data Centers </li></ul></ul><ul><ul><li>Out of Israel BCPDRP solutions are considered </li></ul></ul><ul><li>DRP is more and more related to Regulations . Israel Standard for Risk Management #5300 is starting to effect the DRPBCP market. We expect more vertical regulations to follow </li></ul><ul><li>Interesting project – EDS at FIBI with HDS – DRP outside Israel with 10MB/SEC line and journaling </li></ul>
    3. 3. Agenda <ul><li>Current BCPDRP situation </li></ul><ul><li>Definitions </li></ul><ul><li>Categories </li></ul><ul><li>Pitfalls </li></ul><ul><li>Justification </li></ul><ul><li>BCPDRP service levels </li></ul><ul><li>Related Technologies </li></ul><ul><li>DRP related technologies </li></ul>
    4. 4. Gaps in Best Practices <ul><li><40% (less then!) of BC plans address resumption </li></ul><ul><ul><li>Moving from backup site to primary site </li></ul></ul><ul><li><50% (less then!) of BC plans address </li></ul><ul><ul><li>Personnel transportation </li></ul></ul><ul><ul><li>Telecommunication and network outages </li></ul></ul><ul><li>Only 66% of organizations regularly validate data backups </li></ul><ul><li>Only 65% of BC plans include policies on speaking to the media </li></ul><ul><li>One-third of plans do not include recovery “sequences” for business functions </li></ul><ul><li>75% of recovery configurations are “ out of sync ” with their production configurations </li></ul><ul><li>Significant lack of senior-executive participation in planning </li></ul><ul><li>Only 25% of organizations include BC training for all key personnel </li></ul>
    5. 5. DRBCP Facts As Datacenters consume more electricity, the chance for fire increases Primary Reason for DR/BC 33% Regulatory Compliance 34% Stakeholder Protection 16% Past Business Interruption 7% Public Relations 10% Other Causes of Data Loss Hardware System 44% Human Error 32% Software 14% Virus 7% Natural Disaster 3%
    6. 6. Organizations Without Documented Plans Source: Cutter Emergency Preparedness: The Need for Enterprise-Wide Efforts
    7. 7. What do you consider to be your organization’s biggest threat with regard to being able to continue business? Source: Cutter Emergency Preparedness: The Need for Enterprise-Wide Efforts
    8. 8. Approximately what percentage of your organization’s total IT budget is dedicated to emergency preparedness, business continuity, and disaster recovery? Source: Cutter Emergency Preparedness: The Need for Enterprise-Wide Efforts
    9. 9. Agenda <ul><li>Current BCPDRP situation </li></ul><ul><li>Definitions </li></ul><ul><li>Categories </li></ul><ul><li>Pitfalls </li></ul><ul><li>Justification </li></ul><ul><li>BCPDRP service levels </li></ul><ul><li>DRP related technologies </li></ul>
    10. 10. Definitions <ul><li>BCM (Business Continuity Management) The development, maintenance and implementation of strategies; plans and actions to ensure the continued availability of critical business processes and services </li></ul><ul><li>Disaster Recovery Planning ( DRP ) is a subset of Business Continuity Planning (BCP) that focuses on the processes, plans, and procedures to recover an organization’s technology infrastructure to support critical business processes and functions within a predetermined time after a disaster or unplanned disruption in operations. </li></ul>www.tenders.tas.gov.au/domino/dtf/dtf.nsf/0/1ed10d1fa4fcdb39ca256f1d00060555/$FILE/BCM%20Seminar%20220306.ppt
    11. 11. Example of BCM activities considerations <ul><li>Evacuation </li></ul><ul><li>Setting up an information centre, to register employees and make an inventory of missing or wounded people </li></ul><ul><li>Care for employees; families and victims; community </li></ul><ul><li>Setting up communication and IT networks </li></ul><ul><li>Creating alternative office space </li></ul><ul><li>Reallocation - Transportation & hotels near the alternative office space </li></ul><ul><li>Managing / Recovering day to day business </li></ul><ul><li>Security (physical security) </li></ul><ul><li>Reputation Management </li></ul>Source: www.tenders.tas.gov.au/domino/dtf/dtf.nsf/0/1ed10d1fa4fcdb39ca256f1d00060555/$FILE/BCM%20Seminar%20220306.ppt stki modifications
    12. 12. The Ongoing Trend: Eliminating Data Loss on Recovery <ul><li>IT must deliver current-state recovery capability by application or LOB </li></ul><ul><li>For each application, business leaders have traditionally defined: </li></ul><ul><ul><li>RTOs: Recovery time objectives </li></ul></ul><ul><ul><li>RPOs: Recovery point objectives </li></ul></ul><ul><ul><li>LBO: Lost business opportunity </li></ul></ul><ul><ul><li>Required recovery classes </li></ul></ul><ul><ul><li>Financial impact of outages (e.g., hour, day, week, month) </li></ul></ul><ul><li>IT should provide a categorization framework for the evaluation </li></ul>Crisis Classification Recovery From RPO State RTO Lost Business Opportunity Recovery Point Failure Point Lost Data RPO Time Restart Point
    13. 13. Market status and Trends <ul><li>IBM BCRS selected clients: Menora, Dexia, Ayalon, Mekorot, Novolog, Yashir Insurance, New Kopel, Saifan, Poalim Sahar, Given imaging, Postil, Danel, Psagot, Albat, Harel Insurance, IEC, Amidar, Technion, Keren Makafet, Tambor, Mehish, Malam IDBI </li></ul><ul><li>Med1 – Hosting selected clients: Teva, Leumit Health Services, Asuta, Migdal, Brom, Broadcom, TI, Tadiran Kesher, Clal Insurance, Cafronet </li></ul><ul><li>Med1 – DRP selected clients: Leumi Card, Bank Israel, YES , ZIM, AudioCodes, Partner (transmission pop) , Meshek and Calcala Company , Netivey Gas, Smarteam, AIG, Shva, Bituach Haklai </li></ul><ul><li>HP is hosting (DRP – selected clients) - Tel Aviv Stock Exchange   , Oradio, BNP Bank, Mashbir Machsanei Ofna, Bank Yahav, Elihu Insurance, HSBC Bank, Mashbir Haklai </li></ul>All Rights Reserved @STKI Moshav Bnei Zion, Israel +972 9 74 444 74 www.stki.info
    14. 14. BCM - A Viewpoint <ul><li>BEING PROPERLY PREPARED IS A COMPLEX SCIENCE </li></ul>Marsh www.tenders.tas.gov.au/domino/dtf/dtf.nsf/0/1ed10d1fa4fcdb39ca256f1d00060555/$FILE/BCM%20Seminar%20220306.ppt
    15. 15. Agenda <ul><li>Current BCPDRP situation </li></ul><ul><li>Definitions </li></ul><ul><li>Pitfalls </li></ul><ul><li>Justification </li></ul><ul><li>BCPDRP service levels </li></ul><ul><li>DRP related technologies </li></ul>
    16. 16. Pitfalls and mistakes – BCP BCM <ul><li>IT is in charge </li></ul><ul><li>BCP vs. DRP - Lack of integration with DR (e.g., data center plans not providing voice/data network, systems, and personnel support for work-area recovery ) </li></ul><ul><li>Lack of education and awareness of explicit roles and responsibilities </li></ul><ul><li>Lack of maintenance/updates </li></ul><ul><li>Business executives believe backed-up information can be recovered quickly </li></ul><ul><li>Users test the ability to recovery systems separately but do not check consistency of systems and data </li></ul>
    17. 17. Agenda <ul><li>Current BCPDRP situation </li></ul><ul><li>Definitions </li></ul><ul><li>Pitfalls </li></ul><ul><li>Justification </li></ul><ul><li>BCPDRP service levels </li></ul><ul><li>DRP related technologies </li></ul>
    18. 18. Business Case for BCP & DRP <ul><li>What is the impact to the company if an unplanned outage caused the company to delay delivering a critical product release to a customer ? </li></ul><ul><li>What is the impact to the company if an unplanned outage to not be able to provide Customer Care support? </li></ul><ul><li>What is the impact to the company if an unplanned outage caused the company to delay processing Cash Receipts ? </li></ul><ul><li>What is the impact to the company if an unplanned outage caused the company to delay processing and printing Customer Bills ? </li></ul><ul><li>What is the impact to the company if an unplanned outage caused the company to delay Financial Reporting ? </li></ul><ul><li>What is the impact to the company if an unplanned outage caused the company to delay Payroll for employees? </li></ul>
    19. 19. Agenda <ul><li>Current BCPDRP situation </li></ul><ul><li>Definitions </li></ul><ul><li>Pitfalls </li></ul><ul><li>Justification </li></ul><ul><li>BCPDRP service levels </li></ul><ul><li>DRP related technologies </li></ul>
    20. 20. Recovery Objectives and Costs <ul><li>Mission-critical: Platinum </li></ul><ul><ul><li>RTO & RPO: Zero </li></ul></ul><ul><ul><li>Cost: 6x-8x can be as high as 20x </li></ul></ul><ul><li>Business-critical: Gold </li></ul><ul><ul><li>RTO: <8 hours </li></ul></ul><ul><ul><li>RPO: <15 minutes </li></ul></ul><ul><ul><li>Cost: 4x-5x </li></ul></ul><ul><li>Basic: Silver </li></ul><ul><ul><li>RTO: <72 hours for critical </li></ul></ul><ul><ul><li>RPO: <24 hours </li></ul></ul><ul><ul><li>Cost: 1.6x </li></ul></ul><ul><li>Standalone: Bronze </li></ul><ul><ul><li>Best-effort recovery </li></ul></ul><ul><ul><li>RTO: >1 week </li></ul></ul><ul><ul><li>RPO: 1 week </li></ul></ul><ul><ul><li>Cost: x </li></ul></ul>Establish a realistic and measurable framework What’s in a “Nine”? 99. 999 99 9 0 = = = = 5 min 58 min 8.8 hr 3.6 days
    21. 21. BC & DRP Service Level Model
    22. 22. BC & DRP Service Level Model continued
    23. 23. Agenda <ul><li>Current BCPDRP situation </li></ul><ul><li>Definitions </li></ul><ul><li>Pitfalls </li></ul><ul><li>Justification </li></ul><ul><li>BCPDRP service levels </li></ul><ul><li>DRP related technologies </li></ul>
    24. 24. Related Technologies <ul><li>Related Technologies (DRP, High Availability, Backup) </li></ul><ul><ul><li>Clustering </li></ul></ul><ul><ul><li>Replication – Synchronous, Semi-Synchronous, A-Synchronous </li></ul></ul><ul><ul><li>Oracle RAC – Real Applications Cluster </li></ul></ul><ul><ul><li>Geo-Clustering </li></ul></ul><ul><ul><li>Boot from SAN </li></ul></ul><ul><ul><li>CDP – Continuous Data Protection </li></ul></ul><ul><ul><li>Server Virtualization and P2V </li></ul></ul><ul><ul><li>Backup to disk – VTL – De-Duplication </li></ul></ul><ul><li>Interesting new concept by Israeli Startup – Continuity Software </li></ul>
    25. 25. Site-Level Clustering <ul><li>Addresses unplanned downtime (not planned) </li></ul><ul><li>Campus environment (<5 miles) </li></ul><ul><li>RTO, RPO — minutes </li></ul><ul><li>Potential failover performance impact </li></ul><ul><li>DR issue with single database </li></ul><ul><li>Complex implementation and scripts </li></ul>Example solutions: HP MC Service Guard, IBM HACMP, Sun (True) Cluster, Microsoft Cluster, Veritas Cluster, Oracle RAC, Sybase HA, Parallel Sysplex Clustering Options Database A Failover for B Database B Failover for A Cluster Shareable Disk Group Failover for A, B, and C Cluster Shareable Disk Group Database A Master Node Database B Master Node Database C Master Node Failover N + 1 Failover
    26. 26. Remote Logging <ul><li>Capture log data and transmit to remote site </li></ul><ul><ul><li>Pros </li></ul></ul><ul><ul><ul><li>Minimal database data loss at remote site (RPO) </li></ul></ul></ul><ul><ul><ul><li>Can be synchronized with other applications </li></ul></ul></ul><ul><ul><li>Cons </li></ul></ul><ul><ul><ul><li>Complex and long recovery times at remote site (RTO) </li></ul></ul></ul><ul><ul><ul><li>Complex to implement and maintain </li></ul></ul></ul><ul><ul><ul><li>Increased processing at both sites </li></ul></ul></ul><ul><ul><ul><li>No provision for non-database data </li></ul></ul></ul>Log Propagation Logging solutions are complex and time consuming IMS DB2 Oracle CICSlogs Logs Applied to Database
    27. 27. Synchronous, Storage-Based, Data Replication <ul><li>Data writes committed at both locations </li></ul><ul><li>Sites <50 miles apart </li></ul><ul><li>Database/application restart required </li></ul><ul><li>RTO = <4 hours, RPO = last committed transaction </li></ul><ul><li>Leveraged infrastructure and no quiesce required </li></ul><ul><li>Vendor-specific solutions </li></ul><ul><li>Many times the applications are started manually </li></ul>Synchronous Replication Examples: EMC SRDF, IBM PPRC, HDS HRC, Compaq DRM, HP Continuous Access XP Controller Send Less Than 50 Miles Wait Until Complete
    28. 28. Active Active – Oracle RAC
    29. 29. Geographic cluster Hardware Veritas Cluster Server Hardware Shared Storage Primary Site Node A Secondary Site Node C Hardware Primary Site Node B DR Agent VCS VCS DR Agent Source: eval.symantec.com/mktginfo/enterprise/other_resources/ent-vcs_for_vmware_11-2006.en-us.ppt with modifications Local Cluster Heartbeat Global Cluster Heartbeat Hardware-Based Data Replication
    30. 30. Continuous data protection (CDP) <ul><ul><li>“ Any PIT” recovery </li></ul></ul><ul><li>Some CDP flavors: </li></ul><ul><li>Replicate all writes (complete copy of Server) </li></ul><ul><li>Replicated only application related writes – for easier “bare metal restore” </li></ul><ul><li>Considerations </li></ul><ul><ul><li>Introduces a new process into your backup strategy </li></ul></ul><ul><ul><li>Typically requires as much as 1.5x your current capacity </li></ul></ul>Primary storage Secondary storage CDP appliance/server Server Control data Data Backup/copy data Source: www.forrester.com/Events/Content/0,5180,-1277,00.ppt with modification
    31. 31. Many to one CDP solutions + “bare metal” restore High Availability (Target) Server Off-Site Disaster Recovery (DR) Server Optional Centralized Tape Backup Server Consolidation Data Migration Snapshot Production (Source) Servers Direct offsite connection or two-stage connection through local HA server Source: www.synegi.com/docs/nsi-synegi-presentation.ppt
    32. 32. Virtualization and P2V recovery Scenario – Phase 1 : Initial P2V <ul><li>Your Servers are : </li></ul><ul><ul><li>Protected </li></ul></ul><ul><ul><li>Ready for Patches / Upgrade tests </li></ul></ul>
    33. 33. How De-Duplication Works: Compares latest backup data to stored data None, only pointers to previous data Source (with STKI modification) : Sepaton http://www.go-dpa.com/media/TSM_Webinar.ppt Current Data New Data Stored Previously Stored Data Eliminates duplicate data within Exchange, Oracle ® and Microsoft ® SQL databases Eliinates storage of unchanged portions of files Eliminates storage of duplicate files (single Instance)
    34. 34. VDI – Overview Thin Client or PC Virtual Infrastructure 3 Server <ul><li>Local users are very excited about this technology </li></ul><ul><li>Selected VMWARE VDI installations: Siemens </li></ul>All Rights Reserved @STKI Moshav Bnei Zion, Israel +972 9 74 444 74 www.stki.info
    35. 35. <ul><li>Microsoft Application Virtualization (formerly SoftGrid) allows applications to be deployed in real-time to any client from a virtual application server </li></ul><ul><li>It removes the need for local installation of the applications. </li></ul><ul><li>Softgrid applications are also sandboxed from each other. </li></ul><ul><li>Softgrid selected clients: Education (Bar Ilan, Afek collage, Hebrew University), IDF </li></ul>Market status and Trends Source: http://www.virtuall.nl/articles/ProductInformation/PQR%20- %20Reality%20of%20Virtualization%20Solutions%20v1.1.ppt All Rights Reserved @STKI Moshav Bnei Zion, Israel +972 9 74 444 74 www.stki.info
    36. 36. Summery <ul><li>The basic issues (and pitfalls) of DRPBCP are here to stay (management, rolls, processes) </li></ul><ul><li>New technologies enable better options </li></ul><ul><li>Some of these technologies need to mature </li></ul><ul><li>The essence of DRP is process not technology </li></ul>
    37. 37. Thank you Pini Cohen EVP & Senior Analyst Architecture & Infrastructure Strategies [email_address]