SlideShare a Scribd company logo
1 of 23
1 
The Modern Data Center 
Topology: 
The High Availability Mantra
2 
GreenField Software 
• Company 
– GreenField Software is a privately held, early stage Indian (Kolkata-based) 
software company looking to be a globally recognized player in Cloud-based 
Intelligent Infrastructure Management 
• Mission 
– GFS delivers pioneering Cloud-based Intelligent Infrastructure Management 
solutions to improve operational and energy efficiencies, safety and 
environmental conditions of facilities with critical infrastructure. 
• Vision 
– Our Cloud-based Intelligent Infrastructure Management solutions help our 
customers to 
• Optimize capex, reduce operating costs and mitigate risks of critical infrastructure 
failures 
• Improve Sustainability through improved energy management and safety of their 
employees and other stakeholders using the facilities
3 
Partners & Customers 
Oil &Gas 
Media House 
Telecom 
Higher Education 
Financial Services 
Power Utility
4 
Today’s Topics 
• The Modern Data Center Overview 
• The High Availability (HA) Mantra 
• Operating Challenges 
• A Solution
5 
Modern Data Center 
Overview
6 
Multiple Classes of Data Centers 
• Internet Data Center 
 used by external clients connecting from the Internet 
 supports servers and devices required for B2C transaction-based applications (e-commerce). 
• Extranet Data Center 
 provides support and services for external B2B partner transactions. 
 accessed over secure VPN connections or private WAN links between the partner 
network and the enterprise extranet. 
• Intranet Data Center 
 hosts applications and services mostly accessed by internal employees with 
connectivity to the internal enterprise network. 
ness services. 
• Special Purpose Data Center 
 For specialized application areas like Geological & Geophysical for Oil & Gas 
Industry 
May or may not be inter-connected
7 
Common Objective: Business Continuity 
• Disaster Recovery Data Center 
 Each Class may have dedicated or Shared DR Center 
 Usually located separately from Primary Data Center 
• High Availability (HA) Data Center 
 Each Data Center provided for with significant redundancies 
 DR Center comes into play only when a Disaster strikes. 
 Component or system failures within any DC should be either self-healing or 
redundancies within the DC should take over 
• Insurance Against Power & Network Outages 
 Reliability through multiple service providers 
 Internal Back-ups 
ness services. 
• Securing the Data Center 
 Against malicious hacking that can bring down the Data Center impacting 
business continuity 
 Implementing Firewalls/ Virtual Firewalls
8 
Common Complexity: Multitude of Assets 
Multitude of Assets 
 Divided between two 
worlds: IT & Facilities 
 Includes Mission 
Critical Applications 
 Like a manufacturing 
operation 
 Raw Material: Power & 
Networks 
 Processing: Data 
 Output: Information 
Service 
 Needs: Asset 
Management, Resource 
Optimization, a la 
Manufacturing
9 
The High Availability 
Mantra
10 
Today’s High Availability Data Center 
Extreme Redundancies for 99.99% Uptime -> Higher Power Consumption 
Huge Population of N+1/N+2 Equipment -> Asset Under utilization & Too complex to 
manage with spreadsheets & Visio tools 
Chain of inter-dependent equipment -> Multiple points of failures 
KW per Rack increases as more processing capacity is added -> Trade-offs: need to 
support more per rack versus extra space & heat loads. 
Growing Heat Loads, Carbon Emissions & e-waste -> Sustainability Issues 
High Availability is Inversely Proportional to Asset Utilization & Energy Efficiency
11 
When HA fails - Tale of Two Disasters 
Amazon RBS 
Tech fault at RBS and Natwest freezes 
millions of UK bank balances 
RBS and Natwest have failed to register inbound 
payments for up to three days, customers have 
reported, leaving people unable to pay for bills, 
travel and even food. The banks - both owned 
by RBS Group - have confirmed that technical 
glitches have left bank accounts displaying the 
wrong balances and certain services 
unavailable. There is no fix date available. 
Amazon cloud outage takes down 
Netflix, Instagram, Pinterest, & more 
With the critical Amazon outage, which is the 
second this month, we wouldn’t be surprised 
if these popular services started looking at 
other options, including Rackspace, SoftLayer, 
Microsoft’s Azure, and Google’s just-introduced 
Compute Engine. Some of 
Amazon’s biggest EC2 outages occurred in 
April and August of last year. 
Which Will Be The Next One?
12 
What’s the High Availability Mantra? 
Availability % Downtime per year Downtime per month* Downtime per week 
99% ("two nines") 3.65 days 7.20 hours 1.68 hours 
99.5% 1.83 days 3.60 hours 50.4 minutes 
99.8% 17.52 hours 86.23 minutes 20.16 minutes 
99.9% ("three nines") 8.76 hours 43.8 minutes 10.1 minutes 
99.95% 4.38 hours 21.56 minutes 5.04 minutes 
99.99% ("four nines") 52.56 minutes 4.32 minutes 1.01 minutes 
99.999% ("five nines") 5.26 minutes 25.9 seconds 6.05 seconds 
99.9999% ("six nines") 31.5 seconds 2.59 seconds 0.605 seconds 
99.99999% ("seven nines") 3.15 seconds 0.259 seconds 0.0605 seconds 
Amazon Data Centers (built to Tier 4 standards and with an expected availability of 99.995%) has had 
two outages already in 2012 – each over 3 hours! 
• Tier 3/Tier 4 just defined by hardware redundancies 
• Glaring gaps in operating procedures to prevent fatal human errors 
• Lack of purpose-built BCP software to predict failures 
• Lack of chain of custody to detect root cause
13 
Delivering the High Availability Promise 
Adequate Redundancies 
• Are there any points of failure – besides power and external networks - that can impact 
uptime? (Not everything is N+1) 
• What are my redundancy paths? 
• Are the relationships & dependencies among critical assets clearly defined? 
• Can I do an impact analysis on the outage/downtime of any equipment? Can I predict 
the cascading effect of such an outage on other assets/applications in the data center? 
Preventing Failures 
• Can any failure be predicted to take proactive measures? Do I get alerts on threshold 
breaches so that I can take preventive actions before a failure happens? 
• Is there a history of a Move-Add-Change (MAC) that I should be aware of? 
• What is the impact of a MAC on space, power, cooling? 
• Where can new devices/servers be best placed? Floor -> Rack -> Cage. How this can be 
determined based on current infrastructure and other dependencies to avoid a failure? 
• How do I prevent a fatal human error?
14 
Operating Challenges
15 
The High Availability Challenge 
Asset Over Provisioning Lack of HA Management Tool 
 IT assets tracked by Systems 
Management Tool 
 Facilities assets tracked by BMS 
 Two not inter-operable: Unable to 
determine missing link for HA 
 Unable to track redundancy paths 
 HA fails if any equipment or 
software in critical path fails 
 HA fails if there’s fatal human error 
 Health and history of equipment, or 
previous MAC impact, not tracked 
 Too many assets; two classes of assets 
 Absence of Software Portfolio (even if 
hardware assets are tracked) 
 Move-Add-Change: Decisions not 
based on simulations, analysis 
 Absence of change management 
 Absence of workflow approvals 
 Unable to predict failures 
 No chain of custody 
Need to Predict Failures
16 
Beyond HA: Infrastructure & Operational Challenges 
Energy Problems Operational Problems 
 Low level asset tracking 
 Under utilization of many computing 
resources 
 Running of old inefficient equipment 
 Decisions not based on analysis 
 Cooling not optimized 
 Floor & Rack Space: Non-optimal 
placements of equipment 
 Increasing demand for rack space 
 Absence of capacity planning 
 Higher power consumption & growing 
power bills 
 Not monitoring power use at device 
levels 
 Dissemination of enormous heat 
 Creation of hot spots 
 Drastic reduction in expected life of 
computing equipment 
 Failing of a data center 
 Increase in CO2 emission
17 
A Solution
18 
Solution That Bridges the Gap Between IT & Facilities 
IT System 
Performance 
Management 
Building 
Management 
System 
Data Center 
Infrastructure 
Management 
Data Center Infrastructure Management (DCIM) Software
19 
Solution That Addresses The High Availability Challenge 
Asset Over Provisioning Lack of HA Management Tool 
 IT assets tracked by Systems 
Management Tool 
 Facilities assets tracked by BMS 
 Two not inter-operable: Unable to 
determine missing link for HA 
 Unable to track redundancy paths 
 HA fails if any equipment or software 
in critical path fails 
 HA fails if there’s fatal human error 
 Health and history of equipment, or 
previous MAC impact, not tracked 
 Too many assets; two classes of assets 
 Absence of Software Portfolio (even if 
hardware assets are tracked) 
 Move-Add-Change: Decisions not 
based on simulations, analysis 
 Absence of change management 
 Absence of workflow approvals 
 Unable to predict failures 
DCIM Helps to Predict Failures 
 No chain of custody
20 
Solution That Addresses Infra & Operational Challenges 
Energy Problems Operational Problems 
 Low level asset tracking 
 Under utilization of many computing 
resources 
 Running of old inefficient equipment 
 Decisions not based on analysis 
 Cooling not optimized 
 Floor & Rack Space: Non-optimal 
placements of equipment 
 Increasing demand for rack space 
 Absence of capacity planning 
 Higher power consumption & growing 
DCIM Improves Energy & Operational Efficiencies 
power bills 
 Not monitoring power use at device 
levels 
 Dissemination of enormous heat 
 Creation of hot spots 
 Drastic reduction in expected life of 
computing equipment 
 Failing of a data center 
 Increase in CO2 emission
21 
Anatomy of a DCIM Software: GFS Crane
22 
Thank You 
http://www.greenfieldsoft.com 
Email: sales@greenfieldsoft.com
23 
See also: 
Data Center Infrastructure 
Management: ERP for the Data 
Center Manager

More Related Content

What's hot

"How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko "How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko Fwdays
 
Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...SolarWinds
 
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...SolarWinds
 
Gary managed services_naples (2)
Gary managed services_naples (2)Gary managed services_naples (2)
Gary managed services_naples (2)Gary Fincher
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Improving Datacenter Performance through Capacity Planning – Netmagic
Improving Datacenter Performance through Capacity Planning – NetmagicImproving Datacenter Performance through Capacity Planning – Netmagic
Improving Datacenter Performance through Capacity Planning – NetmagicNetmagic Solutions Pvt. Ltd.
 
Membangun Data Recovery Center / Disaster Recovery Center
Membangun Data Recovery Center / Disaster Recovery CenterMembangun Data Recovery Center / Disaster Recovery Center
Membangun Data Recovery Center / Disaster Recovery CenterFanky Christian
 
Cyber security: A roadmap to secure solutions
Cyber security: A roadmap to secure solutionsCyber security: A roadmap to secure solutions
Cyber security: A roadmap to secure solutionsSchneider Electric
 
Gary managed services_naples (2)
Gary managed services_naples (2)Gary managed services_naples (2)
Gary managed services_naples (2)Gary Fincher
 
How green standards are changing data center design and operations
How green standards are changing data center design and operationsHow green standards are changing data center design and operations
How green standards are changing data center design and operationsSchneider Electric
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Government and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for PerformanceGovernment and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for PerformanceSolarWinds
 
Disaster recovery solution
Disaster recovery solutionDisaster recovery solution
Disaster recovery solutionAnton An
 
POWER POINT PRESENTATION ON DATA CENTER
POWER POINT PRESENTATION ON DATA CENTERPOWER POINT PRESENTATION ON DATA CENTER
POWER POINT PRESENTATION ON DATA CENTERvivekprajapatiankur
 
Government and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT OperationsGovernment and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT OperationsSolarWinds
 
Taming the DCIM Wave with ITIL
Taming the DCIM Wave with ITILTaming the DCIM Wave with ITIL
Taming the DCIM Wave with ITILAFCOM
 
Government and Education Webinar: Improving Application Performance
Government and Education Webinar: Improving Application PerformanceGovernment and Education Webinar: Improving Application Performance
Government and Education Webinar: Improving Application PerformanceSolarWinds
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
State of the Virtualized Data Center
State of the Virtualized Data CenterState of the Virtualized Data Center
State of the Virtualized Data CenterJuniper Networks
 

What's hot (20)

"How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko "How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko
 
Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...
 
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
 
Gary managed services_naples (2)
Gary managed services_naples (2)Gary managed services_naples (2)
Gary managed services_naples (2)
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Improving Datacenter Performance through Capacity Planning – Netmagic
Improving Datacenter Performance through Capacity Planning – NetmagicImproving Datacenter Performance through Capacity Planning – Netmagic
Improving Datacenter Performance through Capacity Planning – Netmagic
 
Membangun Data Recovery Center / Disaster Recovery Center
Membangun Data Recovery Center / Disaster Recovery CenterMembangun Data Recovery Center / Disaster Recovery Center
Membangun Data Recovery Center / Disaster Recovery Center
 
Cyber security: A roadmap to secure solutions
Cyber security: A roadmap to secure solutionsCyber security: A roadmap to secure solutions
Cyber security: A roadmap to secure solutions
 
Gary managed services_naples (2)
Gary managed services_naples (2)Gary managed services_naples (2)
Gary managed services_naples (2)
 
How green standards are changing data center design and operations
How green standards are changing data center design and operationsHow green standards are changing data center design and operations
How green standards are changing data center design and operations
 
Real time data
Real time data Real time data
Real time data
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Government and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for PerformanceGovernment and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for Performance
 
Disaster recovery solution
Disaster recovery solutionDisaster recovery solution
Disaster recovery solution
 
POWER POINT PRESENTATION ON DATA CENTER
POWER POINT PRESENTATION ON DATA CENTERPOWER POINT PRESENTATION ON DATA CENTER
POWER POINT PRESENTATION ON DATA CENTER
 
Government and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT OperationsGovernment and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT Operations
 
Taming the DCIM Wave with ITIL
Taming the DCIM Wave with ITILTaming the DCIM Wave with ITIL
Taming the DCIM Wave with ITIL
 
Government and Education Webinar: Improving Application Performance
Government and Education Webinar: Improving Application PerformanceGovernment and Education Webinar: Improving Application Performance
Government and Education Webinar: Improving Application Performance
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
State of the Virtualized Data Center
State of the Virtualized Data CenterState of the Virtualized Data Center
State of the Virtualized Data Center
 

Similar to The Modern Data Center Topology

November 2014 Webinar - Disaster Recovery Worthy of a Zombie Apocalypse
November 2014 Webinar - Disaster Recovery Worthy of a Zombie ApocalypseNovember 2014 Webinar - Disaster Recovery Worthy of a Zombie Apocalypse
November 2014 Webinar - Disaster Recovery Worthy of a Zombie ApocalypseRapidScale
 
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...
Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...DellNMS
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkDellNMS
 
Audax Group: CIO Perspectives - Managing The Copy Data Explosion
Audax Group: CIO Perspectives - Managing The Copy Data ExplosionAudax Group: CIO Perspectives - Managing The Copy Data Explosion
Audax Group: CIO Perspectives - Managing The Copy Data Explosionactifio
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and AnalyticsVMware Tanzu
 
Data Center Infrastructure Management Demystified
Data Center Infrastructure Management Demystified Data Center Infrastructure Management Demystified
Data Center Infrastructure Management Demystified Sunbird DCIM
 
On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...Jorge Cardoso
 
Stop Losing Sleep V1.0 20100414
Stop Losing Sleep V1.0 20100414Stop Losing Sleep V1.0 20100414
Stop Losing Sleep V1.0 20100414FONMaster
 
Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...
Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...
Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...CA Technologies
 
The Cost of Doing Nothing: A Ransomware Backup Story
The Cost of Doing Nothing: A Ransomware Backup StoryThe Cost of Doing Nothing: A Ransomware Backup Story
The Cost of Doing Nothing: A Ransomware Backup StoryQuest
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsVMware Tanzu
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalVMware Tanzu Korea
 
The Growth Of Data Centers
The Growth Of Data CentersThe Growth Of Data Centers
The Growth Of Data CentersGina Buck
 
Lessons from Large-Scale Cloud Software at Databricks
Lessons from Large-Scale Cloud Software at DatabricksLessons from Large-Scale Cloud Software at Databricks
Lessons from Large-Scale Cloud Software at DatabricksMatei Zaharia
 
Data Centers in the age of the Industrial Internet
Data Centers in the age of the Industrial InternetData Centers in the age of the Industrial Internet
Data Centers in the age of the Industrial InternetGE_India
 
FirstEigen Brochure- All clouds.pdf
FirstEigen Brochure- All clouds.pdfFirstEigen Brochure- All clouds.pdf
FirstEigen Brochure- All clouds.pdfarifulislam946965
 
Smart Energy in the Data Center
Smart Energy in the Data CenterSmart Energy in the Data Center
Smart Energy in the Data CenterSteve Houck
 

Similar to The Modern Data Center Topology (20)

November 2014 Webinar - Disaster Recovery Worthy of a Zombie Apocalypse
November 2014 Webinar - Disaster Recovery Worthy of a Zombie ApocalypseNovember 2014 Webinar - Disaster Recovery Worthy of a Zombie Apocalypse
November 2014 Webinar - Disaster Recovery Worthy of a Zombie Apocalypse
 
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...
Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...Visualizing Your Network Health -  Driving Visibility in Increasingly Complex...
Visualizing Your Network Health - Driving Visibility in Increasingly Complex...
 
Knowledge is Power - Richard May, Raritan
Knowledge is Power - Richard May, RaritanKnowledge is Power - Richard May, Raritan
Knowledge is Power - Richard May, Raritan
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your Network
 
Audax Group: CIO Perspectives - Managing The Copy Data Explosion
Audax Group: CIO Perspectives - Managing The Copy Data ExplosionAudax Group: CIO Perspectives - Managing The Copy Data Explosion
Audax Group: CIO Perspectives - Managing The Copy Data Explosion
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
Data Center Infrastructure Management Demystified
Data Center Infrastructure Management Demystified Data Center Infrastructure Management Demystified
Data Center Infrastructure Management Demystified
 
On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...On the Application of AI for Failure Management: Problems, Solutions and Algo...
On the Application of AI for Failure Management: Problems, Solutions and Algo...
 
DCIM Software: What & Why?
DCIM Software: What & Why?DCIM Software: What & Why?
DCIM Software: What & Why?
 
Stop Losing Sleep V1.0 20100414
Stop Losing Sleep V1.0 20100414Stop Losing Sleep V1.0 20100414
Stop Losing Sleep V1.0 20100414
 
Why the Cloud?
Why the Cloud?Why the Cloud?
Why the Cloud?
 
Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...
Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...
Case Study: Datotel Extended the Power of Infrastructure Management to the Ph...
 
The Cost of Doing Nothing: A Ransomware Backup Story
The Cost of Doing Nothing: A Ransomware Backup StoryThe Cost of Doing Nothing: A Ransomware Backup Story
The Cost of Doing Nothing: A Ransomware Backup Story
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
 
The Growth Of Data Centers
The Growth Of Data CentersThe Growth Of Data Centers
The Growth Of Data Centers
 
Lessons from Large-Scale Cloud Software at Databricks
Lessons from Large-Scale Cloud Software at DatabricksLessons from Large-Scale Cloud Software at Databricks
Lessons from Large-Scale Cloud Software at Databricks
 
Data Centers in the age of the Industrial Internet
Data Centers in the age of the Industrial InternetData Centers in the age of the Industrial Internet
Data Centers in the age of the Industrial Internet
 
FirstEigen Brochure- All clouds.pdf
FirstEigen Brochure- All clouds.pdfFirstEigen Brochure- All clouds.pdf
FirstEigen Brochure- All clouds.pdf
 
Smart Energy in the Data Center
Smart Energy in the Data CenterSmart Energy in the Data Center
Smart Energy in the Data Center
 

Recently uploaded

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?Watsoo Telematics
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 

Recently uploaded (20)

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 

The Modern Data Center Topology

  • 1. 1 The Modern Data Center Topology: The High Availability Mantra
  • 2. 2 GreenField Software • Company – GreenField Software is a privately held, early stage Indian (Kolkata-based) software company looking to be a globally recognized player in Cloud-based Intelligent Infrastructure Management • Mission – GFS delivers pioneering Cloud-based Intelligent Infrastructure Management solutions to improve operational and energy efficiencies, safety and environmental conditions of facilities with critical infrastructure. • Vision – Our Cloud-based Intelligent Infrastructure Management solutions help our customers to • Optimize capex, reduce operating costs and mitigate risks of critical infrastructure failures • Improve Sustainability through improved energy management and safety of their employees and other stakeholders using the facilities
  • 3. 3 Partners & Customers Oil &Gas Media House Telecom Higher Education Financial Services Power Utility
  • 4. 4 Today’s Topics • The Modern Data Center Overview • The High Availability (HA) Mantra • Operating Challenges • A Solution
  • 5. 5 Modern Data Center Overview
  • 6. 6 Multiple Classes of Data Centers • Internet Data Center  used by external clients connecting from the Internet  supports servers and devices required for B2C transaction-based applications (e-commerce). • Extranet Data Center  provides support and services for external B2B partner transactions.  accessed over secure VPN connections or private WAN links between the partner network and the enterprise extranet. • Intranet Data Center  hosts applications and services mostly accessed by internal employees with connectivity to the internal enterprise network. ness services. • Special Purpose Data Center  For specialized application areas like Geological & Geophysical for Oil & Gas Industry May or may not be inter-connected
  • 7. 7 Common Objective: Business Continuity • Disaster Recovery Data Center  Each Class may have dedicated or Shared DR Center  Usually located separately from Primary Data Center • High Availability (HA) Data Center  Each Data Center provided for with significant redundancies  DR Center comes into play only when a Disaster strikes.  Component or system failures within any DC should be either self-healing or redundancies within the DC should take over • Insurance Against Power & Network Outages  Reliability through multiple service providers  Internal Back-ups ness services. • Securing the Data Center  Against malicious hacking that can bring down the Data Center impacting business continuity  Implementing Firewalls/ Virtual Firewalls
  • 8. 8 Common Complexity: Multitude of Assets Multitude of Assets  Divided between two worlds: IT & Facilities  Includes Mission Critical Applications  Like a manufacturing operation  Raw Material: Power & Networks  Processing: Data  Output: Information Service  Needs: Asset Management, Resource Optimization, a la Manufacturing
  • 9. 9 The High Availability Mantra
  • 10. 10 Today’s High Availability Data Center Extreme Redundancies for 99.99% Uptime -> Higher Power Consumption Huge Population of N+1/N+2 Equipment -> Asset Under utilization & Too complex to manage with spreadsheets & Visio tools Chain of inter-dependent equipment -> Multiple points of failures KW per Rack increases as more processing capacity is added -> Trade-offs: need to support more per rack versus extra space & heat loads. Growing Heat Loads, Carbon Emissions & e-waste -> Sustainability Issues High Availability is Inversely Proportional to Asset Utilization & Energy Efficiency
  • 11. 11 When HA fails - Tale of Two Disasters Amazon RBS Tech fault at RBS and Natwest freezes millions of UK bank balances RBS and Natwest have failed to register inbound payments for up to three days, customers have reported, leaving people unable to pay for bills, travel and even food. The banks - both owned by RBS Group - have confirmed that technical glitches have left bank accounts displaying the wrong balances and certain services unavailable. There is no fix date available. Amazon cloud outage takes down Netflix, Instagram, Pinterest, & more With the critical Amazon outage, which is the second this month, we wouldn’t be surprised if these popular services started looking at other options, including Rackspace, SoftLayer, Microsoft’s Azure, and Google’s just-introduced Compute Engine. Some of Amazon’s biggest EC2 outages occurred in April and August of last year. Which Will Be The Next One?
  • 12. 12 What’s the High Availability Mantra? Availability % Downtime per year Downtime per month* Downtime per week 99% ("two nines") 3.65 days 7.20 hours 1.68 hours 99.5% 1.83 days 3.60 hours 50.4 minutes 99.8% 17.52 hours 86.23 minutes 20.16 minutes 99.9% ("three nines") 8.76 hours 43.8 minutes 10.1 minutes 99.95% 4.38 hours 21.56 minutes 5.04 minutes 99.99% ("four nines") 52.56 minutes 4.32 minutes 1.01 minutes 99.999% ("five nines") 5.26 minutes 25.9 seconds 6.05 seconds 99.9999% ("six nines") 31.5 seconds 2.59 seconds 0.605 seconds 99.99999% ("seven nines") 3.15 seconds 0.259 seconds 0.0605 seconds Amazon Data Centers (built to Tier 4 standards and with an expected availability of 99.995%) has had two outages already in 2012 – each over 3 hours! • Tier 3/Tier 4 just defined by hardware redundancies • Glaring gaps in operating procedures to prevent fatal human errors • Lack of purpose-built BCP software to predict failures • Lack of chain of custody to detect root cause
  • 13. 13 Delivering the High Availability Promise Adequate Redundancies • Are there any points of failure – besides power and external networks - that can impact uptime? (Not everything is N+1) • What are my redundancy paths? • Are the relationships & dependencies among critical assets clearly defined? • Can I do an impact analysis on the outage/downtime of any equipment? Can I predict the cascading effect of such an outage on other assets/applications in the data center? Preventing Failures • Can any failure be predicted to take proactive measures? Do I get alerts on threshold breaches so that I can take preventive actions before a failure happens? • Is there a history of a Move-Add-Change (MAC) that I should be aware of? • What is the impact of a MAC on space, power, cooling? • Where can new devices/servers be best placed? Floor -> Rack -> Cage. How this can be determined based on current infrastructure and other dependencies to avoid a failure? • How do I prevent a fatal human error?
  • 15. 15 The High Availability Challenge Asset Over Provisioning Lack of HA Management Tool  IT assets tracked by Systems Management Tool  Facilities assets tracked by BMS  Two not inter-operable: Unable to determine missing link for HA  Unable to track redundancy paths  HA fails if any equipment or software in critical path fails  HA fails if there’s fatal human error  Health and history of equipment, or previous MAC impact, not tracked  Too many assets; two classes of assets  Absence of Software Portfolio (even if hardware assets are tracked)  Move-Add-Change: Decisions not based on simulations, analysis  Absence of change management  Absence of workflow approvals  Unable to predict failures  No chain of custody Need to Predict Failures
  • 16. 16 Beyond HA: Infrastructure & Operational Challenges Energy Problems Operational Problems  Low level asset tracking  Under utilization of many computing resources  Running of old inefficient equipment  Decisions not based on analysis  Cooling not optimized  Floor & Rack Space: Non-optimal placements of equipment  Increasing demand for rack space  Absence of capacity planning  Higher power consumption & growing power bills  Not monitoring power use at device levels  Dissemination of enormous heat  Creation of hot spots  Drastic reduction in expected life of computing equipment  Failing of a data center  Increase in CO2 emission
  • 18. 18 Solution That Bridges the Gap Between IT & Facilities IT System Performance Management Building Management System Data Center Infrastructure Management Data Center Infrastructure Management (DCIM) Software
  • 19. 19 Solution That Addresses The High Availability Challenge Asset Over Provisioning Lack of HA Management Tool  IT assets tracked by Systems Management Tool  Facilities assets tracked by BMS  Two not inter-operable: Unable to determine missing link for HA  Unable to track redundancy paths  HA fails if any equipment or software in critical path fails  HA fails if there’s fatal human error  Health and history of equipment, or previous MAC impact, not tracked  Too many assets; two classes of assets  Absence of Software Portfolio (even if hardware assets are tracked)  Move-Add-Change: Decisions not based on simulations, analysis  Absence of change management  Absence of workflow approvals  Unable to predict failures DCIM Helps to Predict Failures  No chain of custody
  • 20. 20 Solution That Addresses Infra & Operational Challenges Energy Problems Operational Problems  Low level asset tracking  Under utilization of many computing resources  Running of old inefficient equipment  Decisions not based on analysis  Cooling not optimized  Floor & Rack Space: Non-optimal placements of equipment  Increasing demand for rack space  Absence of capacity planning  Higher power consumption & growing DCIM Improves Energy & Operational Efficiencies power bills  Not monitoring power use at device levels  Dissemination of enormous heat  Creation of hot spots  Drastic reduction in expected life of computing equipment  Failing of a data center  Increase in CO2 emission
  • 21. 21 Anatomy of a DCIM Software: GFS Crane
  • 22. 22 Thank You http://www.greenfieldsoft.com Email: sales@greenfieldsoft.com
  • 23. 23 See also: Data Center Infrastructure Management: ERP for the Data Center Manager