SlideShare a Scribd company logo
IBM SOA
© 2007 IBM Corporation
IBM – BCBS High Availability
Fast Track
Robert R. Rowntree
SOA Enterprise Architect
IBM Software Group
IBM SOA
IBM confidential2 IBM – BCBS High Availability Fast Track
 Introduction
- Availability, 9’s uptime, Work Patterns, Reference Architecture
 HA WebSphere Systems
- Products leverage WAS, SPOF’s, WAS Internals
- Clustering, HA Topologies, HAManager
 Managing HA WebSphere Systems
- ITCAM, SOA Security, SOA Management
- SLAs
 HA Failure Scenarios
- Http Servers, JVMs,
- Containers - Portlet, Web, EJB
- LDAP, Databases, WMQ, ME’s
Agenda
IBM SOA
IBM confidential3 IBM – BCBS High Availability Fast Track
HA – What is Availability? Biggest Impact –App’s/People
Availability = MTBF/(MTBF + MTTR)
 Downtime – Planned/Unplanned – Engineer for Unplanned
- Engineer for unplanned downtime during business hours
- BIGGEST source of unplanned downtime – Software (Applications) and People Errors
 Human Errors – Expertise level, training and tooling. Some companies don’t have any of the 3 – SOL
from the start
 Software Errors – Network, Server, middleware and applications – the biggest culprit - APPLICATIONS
 7X24X365, 6X20 or 5X12 – When will it go down? Don’t know.
- Most applications still typically need minimal disruption during business hours.
- It’s the amount of planned downtime that is different.
- Can’t usually predicate when unplanned downtime will occur – exceptions? – Loads on pays
at banks.
- On a daily basis most businesses can tolerate similar downtimes during biz hours.
 Focus usually is on MTTR
- Strategy is to engineer in the use of fault detection and Auto failover where possible.
- Aircraft have multiple engine systems, Nuclear Plant electronics has triplicated logic.
IBM SOA
IBM confidential4 IBM – BCBS High Availability Fast Track
Levels of Availability
IBM SOA
IBM confidential5 IBM – BCBS High Availability Fast Track
Availability Matrix
IBM SOA
IBM confidential6 IBM – BCBS High Availability Fast Track
Latency for various Workload Patterns
IBM SOA
IBM confidential7 IBM – BCBS High Availability Fast Track
Reference Architecture
IBM SOA
IBM confidential8 IBM – BCBS High Availability Fast Track
Reference Architecture: Product View
IBM SOA
IBM confidential9 IBM – BCBS High Availability Fast Track
 Introduction
- Availability, 9’s uptime, Work Patterns, Reference Architecture
 HA WebSphere Systems
- Products leverage WAS, SPOF’s, WAS Internals
- Clustering, HA Topologies, HAManager
 Managing HA WebSphere Systems
- ITCAM, SOA Security, SOA Management
- SLAs
 HA Failure Scenarios
- Http Servers, JVMs,
- Containers - Portlet, Web, EJB
- LDAP, Databases, WMQ, ME’s
Agenda
IBM SOA
IBM confidential10 IBM – BCBS High Availability Fast Track
WebSphere Systems – Team of Products Leverage WAS
 Several products leverage WebSphere Application Servers HA
Capabilities
1. WebSphere Portal Server
2. WebSphere Process Server
3. WebSphere ESB
4. WebSphere Partner Gateway
IBM SOA
IBM confidential11 IBM – BCBS High Availability Fast Track
WebSphere System – Single Points of Failure
Failure Points Possible Solutions
Firewalls Firewall clustering, firewall
Sprayers
Caching Proxy Backing CP
Http Sprayer Backup load balancer
Web Server Multiple WS
WAS master repository, log files HA share file system, NFS, HW
based clustering
WAS Horizontal, Vertical or both
Node Agent Multiple NA in the cluster, NA as a
OS Service
Deployment Manager OS Service, not a SPOF
IBM SOA
IBM confidential12 IBM – BCBS High Availability Fast Track
WebSphere System – Single Points of Failure
Failure Points Possible Solutions
Entity EJB, application DB HA DB’s
Default Message Provider HAManager configured
Default Message Provider data
store
Clustering, data replication and
parallel databases.
Application database Clustering, data replication,
parallel databases
Session database Memory to Memory replication,
DB clustering
Transaction Logs Shared file system, HAManager
provides failover.
WMQ WMQ cluster
LDAP Master replica, HA LDAP
IBM SOA
IBM confidential13 IBM – BCBS High Availability Fast Track
WebSphere System – Single Points of Failure
Failure Points Possible Solutions
Hubs Multiple interconnected network
paths
OS and other software crashes Clustering, switching to a healthy
node.
Software and Hardware Upgrades Rolling upgrades with clustering or
WLM for 7X24X365, planned
maintenance.
IBM SOA
IBM confidential14 IBM – BCBS High Availability Fast Track
Deployment Manager Failure – Not a SPOF
 Not a Single Point of Failure
1. WAS V6 does not keep routing tables for clustered resource
such as applications or message engines.
1. WAS V6 it is an elected member of cluster. Routing table is now fault tolerant.
2. Responsible less run-time critical tasks such as:
1. Configuration changes
2. Performance Monitoring
3. JMX routing through DM to other components in the cell.
IBM SOA
IBM confidential15 IBM – BCBS High Availability Fast Track
Data – Key point of Failure
 WAS or other components can run without Data
 Can design in redundancy at most tiers but if data is not
available WAS systems can’t run.
 Key Data components required by WAS
- Application Data
- Administrative Repositories
- Persistent Session datastore
- Message Engine datastores
- Transaction Log
- WebSphere System + Application binaries
- HTML Images and files
IBM SOA
IBM confidential16 IBM – BCBS High Availability Fast Track
Address of where to deliver returning request has 4 components.
1. Cache ID
2. Session ID – ID of session once back in the JVM.
3. Clone ID – Used by plug-in to HTTP server to determine both
application server. On failover the failover clone ID is appended and if
fallback is set and once the original JVM is back up, the session will
FAILBACK. This is done to rebalance loads which is important for a
small number of nodes in a cluster.
1. Multiple app servers providing vertical scaling.
2. Need both IP and port – the http plug on the HTTP servers decode this from
the cookie.
Typically a load balancer can determine the destination IP, but it
can’t determine both the IP and port address from the WAS
generated session id. The Http plug translates the SESSION ID
using a XML file generated by the app server to determine the IP
and port end point that precisely ID’s the originating app server.
Session Management –Tracking Down a Session
IBM SOA
IBM confidential17 IBM – BCBS High Availability Fast Track
WebSphere Application Server – Internal Architecture
IBM SOA
IBM confidential18 IBM – BCBS High Availability Fast Track
Clustering – Vertical and Horizontal
IBM SOA
IBM confidential19 IBM – BCBS High Availability Fast Track
Scope of Isolation with System z
Clustering
Possible
Nodes LPAR CEC Geo
Dispersion
Isolation Cost
Vertical Only 1 Same Same No Minimal Lowest
Both 1 or More Same Same No
Both 1 or More 1 or More Same No
Both 1 or More 1 or more 1 or More No
Both 1 or More 1 or more 1 or more Yes Highest Very High
IBM SOA
IBM confidential20 IBM – BCBS High Availability Fast Track
Topology HA Level 1 – Single Node
Best Use – Low Cost, Application with low availability needs, Test Environment
SPOF’s – HTTP Server, Admin Servers, Database
Advantage – Lowest effort to maintain, out of the box install
Disadvantage – Almost everything is a SPOF
IBM SOA
IBM confidential21 IBM – BCBS High Availability Fast Track
Topology - HA Level 2 – Vertical Scaling
Best Use – Low Cost, Some Degree of Fail Over required
SPOF’s – HTTP Server, Database, FW, LDAP
Advantage – F-O if one App Server (JVM) crashes or out of threads temporarily
Disadvantage – OOS/SOL if node level problem, lower level SW or HW has
problems.
IBM SOA
IBM confidential22 IBM – BCBS High Availability Fast Track
Topology - HA Level 3 – Vertical and Horizontal Clustering
Best Use – 1st level providing continuous operation at WAS level
SPOF’s – DB, FW, LDAP
Advantages – Nodal Isolation, On-line Maintenance, Mixed versions possible
Disadvantage – More effort in maintaining the system, HAManager needs NAS, LL
IBM SOA
IBM confidential23 IBM – BCBS High Availability Fast Track
Topology - HA Level 4 – Database Clustering Failover
Best Use – 1st level providing continuous operation at WAS level
SPOF’s – Admin Servers (DMGR and Node Agent), LDAP
Advantages – Option use ARM for auto restart DMGR, NA
Disadvantage – Downed AS, No TPV, No Reconfiguration
IBM SOA
IBM confidential24 IBM – BCBS High Availability Fast Track
Topology - HA Level 5
IBM SOA
IBM confidential25 IBM – BCBS High Availability Fast Track
Failover Clustering Capacity – 2 Types
IP Based Cluster Failover – Slow 1 – 5 minutes
- Tivoli Systems Automation
- IHACMP – AIX
Non IP Cluster – 1 sec to Minutes depending on Configuration
WAS WLM – Http plugin to Http Server, EJB (Corba distributed communication)
Clustering Database Failover
Slow IP based failover
Fast – Parallel Database Partitioning for DB2 UDB EE, Oracle Real App Clusters(RAC), OPS
IBM SOA
IBM confidential26 IBM – BCBS High Availability Fast Track
HAManager
Benefit – Enhance Availability – 2 Area’s
- Transaction Services – Transaction Log Recovery
- Messaging Services
Why – Crashed/Zombie JVM may leave in-flight
transaction with locked resources.
- Block peers from locked records – Snowball effect
- Transaction are not completed
- Frequency – low – but cost can be very high coz s#$% happens just when you
don’t want it to.
Options
- Restart server (Booters) – This is a slow process - WAS V5
- Give Access to another application server – WAS V6 HA Manager or IP based
cluster failover
IBM SOA
IBM confidential27 IBM – BCBS High Availability Fast Track
Key Scenario for Transaction Services
1. JVM crashes with transactions in progress – Tx in
doubt.
2. 2PC may have several resource manager (WMQ, DB2,
SQL Server) involved with objects locked.
3. Without failover to another JVM’s transaction services,
resources will be locked until time-outs are reached.
4. Worse problem – other tx may fail because they cannot
obtain locks.
5. Cascading/snowball effect.
IBM SOA
IBM confidential28 IBM – BCBS High Availability Fast Track
HAManager – Core Group
IBM SOA
IBM confidential29 IBM – BCBS High Availability Fast Track
HAManager – Core Group
-Core Group has coordinator elected
-Tracks info and state names, members, policies,
active/inactive
IBM SOA
IBM confidential30 IBM – BCBS High Availability Fast Track
HAManager – Group Coordination
IBM SOA
IBM confidential31 IBM – BCBS High Availability Fast Track
HAManager – Transaction Managers for Core Group
IBM SOA
IBM confidential32 IBM – BCBS High Availability Fast Track
HAManager – HOW – WAS V6
Recovery process started in peer member of cluster.
1. Waits for lock time-out to expire set by Crashed JVM.
2. Complete’s in-doubt Tx’s.
3. Releases locks in backend resource managers.
4. Releases Tx logs.
5. No new work is performed.
IBM SOA
IBM confidential33 IBM – BCBS High Availability Fast Track
HA Manager – Scenario – 2 PC with Resource Manager
Locking (Database)
IBM SOA
IBM confidential34 IBM – BCBS High Availability Fast Track
HA Manager – Scenario – 2 PC with Resource
Manager Locking (Database)
IBM SOA
IBM confidential35 IBM – BCBS High Availability Fast Track
HAManager – Tx Services (Locking) Policies
-One of N Policy Requirements – Most typical Policy
-Shared file system must support automatic lock recovery.
-Locking critical to prevent corruption of Tx logs.
-Lock recovery is necessary to ensure peer cluster member
access.
-Lock lease time (LLT) default – 45 seconds
- HAManager fails over in 10 second but LLT=45
- HAManager must wait 35 seconds
- Starting point LLT=10
- HAManager = 12 seconds
IBM SOA
IBM confidential36 IBM – BCBS High Availability Fast Track
HAManager – How - Old Way – WAS V5
 Only way in WAS V5 – possible in V6 but complex
-Required IBM HACMP or Tivoli System Automation
-Shared Drive – Config Reporting, log files, Tx logs, WAS
binaries.
-IP Address – Each has its own IP, virtual IP for client access.
-HA Software (HACMP) manages group of IPs, disk, file
systems, start/stop scripts for WAS.
-On Failure – move IPs, disk, starting WAS
-Disadvantage
- Recovery slow, virtual IP on same subnet (local only)
- Complex
IBM SOA
IBM confidential37 IBM – BCBS High Availability Fast Track
HAManager – How- Old Way – WAS V5
IBM SOA
IBM confidential38 IBM – BCBS High Availability Fast Track
HAManager – Configuration Requirements
 Enablers – HAManager, HA File System, Lease-based locking Protocol
 1. Visible Translog – must be accessible to all members in core group.
 2. Platform – Highly Available File System – IBM SAN FS, NAS
 - Needs Lease based exclusive locking protocol
 - CIFS - Common internal file system
 - NFS V4
 3. R/W Access Rights - All App Servers must be able to read/write to the
logs before recovery can occur.
 4. Consequences - if not, locks held by processes on failed node will not be
automatically released.
 - Tx will not be completed, database potential impaired.
 - Peer servers can only recover in-flight tx if database locks are
released
IBM SOA
IBM confidential39 IBM – BCBS High Availability Fast Track
 Introduction
- Availability, 9’s uptime, Work Patterns, Reference Architecture
 HA WebSphere Systems
- Products leverage WAS, SPOF’s, WAS Internals
- Clustering, HA Topologies, HAManager
 Managing HA WebSphere Systems
- ITCAM, SOA Security, SOA Management
- SLAs
 HA Failure Scenarios
- Http Servers, JVMs,
- Containers - Portlet, Web, EJB
- LDAP, Databases, WMQ, ME’s
Agenda
IBM SOA
IBM confidential40 IBM – BCBS High Availability Fast Track
Services
atomic and composite
Operational Systems
Service Components
Consumers
Business Processes
process choreography
ServiceProviderServiceConsumer
SAP Custom
Application
OO
ApplicationISV
Custom Apps
Platform Supporting Middleware
MQ DB2Unix OS/390
Outlook
SCA Portlet WSRP B2B Other
Integrated Console
• Allow for seamless views
across different layers of
abstraction.
Service
Management
Application
Monitoring
Resource
Monitoring
Resource
Monitoring
Transaction
Tracking
Integrated Reporting
• Generate enterprise-
wide service level
reporting
SOA Management: Solution View
Business
Process
Management
IBM SOA
IBM confidential41 IBM – BCBS High Availability Fast Track
SOA Management: Example 2 – Digging out the CICS Data
IBM SOA
IBM confidential42 IBM – BCBS High Availability Fast Track
Example 1: Reuse: Service Creation: Digging Out the CICS Data
IBM SOA
IBM confidential43 IBM – BCBS High Availability Fast Track
Example 1: Reuse: Service Creation: Digging Out the CICS Data
IBM SOA
IBM confidential44 IBM – BCBS High Availability Fast Track
SOA Management: Example 2 – Logical Architecture
IBM SOA
IBM confidential45 IBM – BCBS High Availability Fast Track
 “Contracts” are established between
service requestors and providers, also
known as Service Level Agreements
 Management focus turns to monitoring for
compliance to agreed upon service levels
 “Active” management optimizes systems
to avoid service violations
SOA Management: Service Levels
Service Provider
Service
Service
Requestor
Quality of
Service
Capacity
Security
Performance
XML
WSDL
SOAP
Service Level
Agreement
 # of requests
allowed
 Acceptable response
time
 Charge per request
IBM SOA
IBM confidential46 IBM – BCBS High Availability Fast Track
SOA Security: IBM Security Approach - MASS
IBM SOA
IBM confidential47 IBM – BCBS High Availability Fast Track
Custom
Application
Packaged
Application
Packaged
Application
Custom
Application
consumers
business processes
process choreography
services
atomic and composite
ServiceConsumerServiceProvider
11
22
33
44
55
OO
ApplicationCustom
ApplicationOutlook
SAP Custom
Application
business processes
process choreography
Services (Definitions)
atomic and composite
Service
components
ServiceConsumerServiceProvider
11
22
33
44
55
OO
Application
ISV
Custom Apps
Platform
Operational
systems Supporting Middleware
MQ DB2Unix OS/390
SOA Security: Encompass all Aspects of Security
SOA Security
 Identity
 Authentication
 Authorization
 Auditing
 Confidentiality,
Integrity and
Availability
 Auditing &
Compliance
 Administration and
Policy Management
SCA Portlet WSRP B2B Other
IBM SOA
IBM confidential48 IBM – BCBS High Availability Fast Track
 Introduction
- Availability, 9’s uptime, Work Patterns, Reference Architecture
 HA WebSphere Systems
- Products leverage WAS, SPOF’s, WAS Internals
- Clustering, HA Topologies, HAManager
 Managing HA WebSphere Systems
- ITCAM, SOA Security, SOA Management
- SLAs
 HA Failure Scenarios
- Http Servers, JVMs,
- Containers - Portlet, Web, EJB
- LDAP, Databases, WMQ, ME’s
Agenda
IBM SOA
IBM confidential49 IBM – BCBS High Availability Fast Track

More Related Content

What's hot

Auto scaling and dynamic routing for was liberty collectives
Auto scaling and dynamic routing for was liberty collectivesAuto scaling and dynamic routing for was liberty collectives
Auto scaling and dynamic routing for was liberty collectives
sflynn073
 
Web Logic Jboss Final
Web Logic Jboss FinalWeb Logic Jboss Final
Web Logic Jboss Final
Mohamed Atef
 
Lotus Connections Schaalbaarheid En Performance
Lotus Connections   Schaalbaarheid En PerformanceLotus Connections   Schaalbaarheid En Performance
Lotus Connections Schaalbaarheid En Performance
Social Software Blog
 
AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...
AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...
AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...
WASdev Community
 
WebSphere App Server vs JBoss vs WebLogic vs Tomcat
WebSphere App Server vs JBoss vs WebLogic vs TomcatWebSphere App Server vs JBoss vs WebLogic vs Tomcat
WebSphere App Server vs JBoss vs WebLogic vs Tomcat
WASdev Community
 
Planning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPMPlanning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPM
WASdev Community
 
IBM WebSphere Application Server Update - Technical University (March 2015)
IBM WebSphere Application Server Update - Technical University (March 2015)IBM WebSphere Application Server Update - Technical University (March 2015)
IBM WebSphere Application Server Update - Technical University (March 2015)
Ian Robinson
 
Was liberty in deployments
Was liberty in deploymentsWas liberty in deployments
Was liberty in deployments
sflynn073
 
New Approaches to Faster Oracle Forms System Performance
New Approaches to Faster Oracle Forms System PerformanceNew Approaches to Faster Oracle Forms System Performance
New Approaches to Faster Oracle Forms System Performance
Correlsense
 
AAI-1305 Choosing WebSphere Liberty for Java EE Deployments
AAI-1305 Choosing WebSphere Liberty for Java EE DeploymentsAAI-1305 Choosing WebSphere Liberty for Java EE Deployments
AAI-1305 Choosing WebSphere Liberty for Java EE Deployments
WASdev Community
 
Liberty management
Liberty managementLiberty management
Liberty management
WASdev Community
 
AAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing Workloa
AAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing WorkloaAAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing Workloa
AAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing Workloa
WASdev Community
 
WAS Support & Monitoring Tools
WAS Support & Monitoring ToolsWAS Support & Monitoring Tools
WAS Support & Monitoring Tools
Royal Cyber Inc.
 
vFabric - Ideal Platform for SaaS Apps
vFabric - Ideal Platform for SaaS AppsvFabric - Ideal Platform for SaaS Apps
vFabric - Ideal Platform for SaaS Apps
VMware vFabric
 
WAS vs JBoss, WebLogic, Tomcat (year 2015)
WAS vs JBoss, WebLogic, Tomcat (year 2015)WAS vs JBoss, WebLogic, Tomcat (year 2015)
WAS vs JBoss, WebLogic, Tomcat (year 2015)
Roman Kharkovski
 
WebSphere 6.1 Admin Course 1
WebSphere 6.1 Admin Course 1WebSphere 6.1 Admin Course 1
WebSphere 6.1 Admin Course 1
odedns
 
AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...
AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...
AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...
WASdev Community
 
Open Mic on Sametime9 Install -Best Practices
Open Mic on Sametime9 Install  -Best PracticesOpen Mic on Sametime9 Install  -Best Practices
Open Mic on Sametime9 Install -Best Practices
Vinayak Tavargeri
 
WebSphere Application Server JBoss TCO analysis
WebSphere Application Server JBoss TCO analysisWebSphere Application Server JBoss TCO analysis
WebSphere Application Server JBoss TCO analysis
Shetal Patel
 
Websphere Application Server V8.5
Websphere Application Server V8.5Websphere Application Server V8.5
Websphere Application Server V8.5
IBM WebSphereIndia
 

What's hot (20)

Auto scaling and dynamic routing for was liberty collectives
Auto scaling and dynamic routing for was liberty collectivesAuto scaling and dynamic routing for was liberty collectives
Auto scaling and dynamic routing for was liberty collectives
 
Web Logic Jboss Final
Web Logic Jboss FinalWeb Logic Jboss Final
Web Logic Jboss Final
 
Lotus Connections Schaalbaarheid En Performance
Lotus Connections   Schaalbaarheid En PerformanceLotus Connections   Schaalbaarheid En Performance
Lotus Connections Schaalbaarheid En Performance
 
AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...
AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...
AAI-3281 Smarter Production with WebSphere Application Server ND Intelligent ...
 
WebSphere App Server vs JBoss vs WebLogic vs Tomcat
WebSphere App Server vs JBoss vs WebLogic vs TomcatWebSphere App Server vs JBoss vs WebLogic vs Tomcat
WebSphere App Server vs JBoss vs WebLogic vs Tomcat
 
Planning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPMPlanning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPM
 
IBM WebSphere Application Server Update - Technical University (March 2015)
IBM WebSphere Application Server Update - Technical University (March 2015)IBM WebSphere Application Server Update - Technical University (March 2015)
IBM WebSphere Application Server Update - Technical University (March 2015)
 
Was liberty in deployments
Was liberty in deploymentsWas liberty in deployments
Was liberty in deployments
 
New Approaches to Faster Oracle Forms System Performance
New Approaches to Faster Oracle Forms System PerformanceNew Approaches to Faster Oracle Forms System Performance
New Approaches to Faster Oracle Forms System Performance
 
AAI-1305 Choosing WebSphere Liberty for Java EE Deployments
AAI-1305 Choosing WebSphere Liberty for Java EE DeploymentsAAI-1305 Choosing WebSphere Liberty for Java EE Deployments
AAI-1305 Choosing WebSphere Liberty for Java EE Deployments
 
Liberty management
Liberty managementLiberty management
Liberty management
 
AAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing Workloa
AAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing WorkloaAAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing Workloa
AAI-2075 Evolving an IBM WebSphere Topology to Manage a Changing Workloa
 
WAS Support & Monitoring Tools
WAS Support & Monitoring ToolsWAS Support & Monitoring Tools
WAS Support & Monitoring Tools
 
vFabric - Ideal Platform for SaaS Apps
vFabric - Ideal Platform for SaaS AppsvFabric - Ideal Platform for SaaS Apps
vFabric - Ideal Platform for SaaS Apps
 
WAS vs JBoss, WebLogic, Tomcat (year 2015)
WAS vs JBoss, WebLogic, Tomcat (year 2015)WAS vs JBoss, WebLogic, Tomcat (year 2015)
WAS vs JBoss, WebLogic, Tomcat (year 2015)
 
WebSphere 6.1 Admin Course 1
WebSphere 6.1 Admin Course 1WebSphere 6.1 Admin Course 1
WebSphere 6.1 Admin Course 1
 
AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...
AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...
AAI-4847 Full Disclosure on the Performance Characteristics of WebSphere Appl...
 
Open Mic on Sametime9 Install -Best Practices
Open Mic on Sametime9 Install  -Best PracticesOpen Mic on Sametime9 Install  -Best Practices
Open Mic on Sametime9 Install -Best Practices
 
WebSphere Application Server JBoss TCO analysis
WebSphere Application Server JBoss TCO analysisWebSphere Application Server JBoss TCO analysis
WebSphere Application Server JBoss TCO analysis
 
Websphere Application Server V8.5
Websphere Application Server V8.5Websphere Application Server V8.5
Websphere Application Server V8.5
 

Similar to Toronix - IBM WebSphere HA & High Availability Concepts

Membase Meetup 2010
Membase Meetup 2010Membase Meetup 2010
Membase Meetup 2010
Membase
 
High Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud ComputingHigh Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud Computing
Bob Rhubart
 
Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai
vibrantuser
 
Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai
vibrantuser
 
Membase Meetup - Silicon Valley
Membase Meetup - Silicon ValleyMembase Meetup - Silicon Valley
Membase Meetup - Silicon Valley
Membase
 
5 Ways to Avoid Server and Application Downtime
5 Ways to Avoid Server and Application Downtime5 Ways to Avoid Server and Application Downtime
5 Ways to Avoid Server and Application Downtime
Neverfail Group
 
Impact 2009 1783 Achieving Availability With W A Sz User Experience
Impact 2009 1783  Achieving  Availability With  W A Sz   User ExperienceImpact 2009 1783  Achieving  Availability With  W A Sz   User Experience
Impact 2009 1783 Achieving Availability With W A Sz User Experience
Elena Nanos
 
W23 - Advanced Performance Tactics for WebSphere Performance
W23 - Advanced Performance Tactics for WebSphere PerformanceW23 - Advanced Performance Tactics for WebSphere Performance
W23 - Advanced Performance Tactics for WebSphere Performance
Hendrik van Run
 
A26 MariaDB : The New&Implemented MySQL Branch by Colin Charles
A26 MariaDB : The New&Implemented MySQL Branch by Colin CharlesA26 MariaDB : The New&Implemented MySQL Branch by Colin Charles
A26 MariaDB : The New&Implemented MySQL Branch by Colin Charles
Insight Technology, Inc.
 
system automation, integration and recovery
system automation, integration and recoverysystem automation, integration and recovery
system automation, integration and recovery
Derek Chang
 
IBM MQ vs Apache ActiveMQ
IBM MQ vs Apache ActiveMQIBM MQ vs Apache ActiveMQ
IBM MQ vs Apache ActiveMQ
Roman Kharkovski
 
JUDCon2014-ScalableMessagingWithJBossA-MQ and Apache Camel
JUDCon2014-ScalableMessagingWithJBossA-MQ and Apache CamelJUDCon2014-ScalableMessagingWithJBossA-MQ and Apache Camel
JUDCon2014-ScalableMessagingWithJBossA-MQ and Apache Camel
Naveen Raj Balasubramaniam
 
AMF Flash and .NET
AMF Flash and .NETAMF Flash and .NET
AMF Flash and .NET
Yaniv Uriel
 
WSI33 - Advanced Performance Tactics for IBM WebSphere Application Server
WSI33 - Advanced Performance Tactics for IBM WebSphere Application ServerWSI33 - Advanced Performance Tactics for IBM WebSphere Application Server
WSI33 - Advanced Performance Tactics for IBM WebSphere Application Server
Hendrik van Run
 
Meetup callback
Meetup callbackMeetup callback
Meetup callback
Wayne Scarano
 
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
Edward Burns
 
Enterprise service bus part 2
Enterprise service bus part 2Enterprise service bus part 2
Enterprise service bus part 2
Return on Intelligence
 
Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)
Divante
 
MariaDB 5.5 and what comes next - Percona Live NYC 2012
MariaDB 5.5 and what comes next - Percona Live NYC 2012MariaDB 5.5 and what comes next - Percona Live NYC 2012
MariaDB 5.5 and what comes next - Percona Live NYC 2012
Colin Charles
 
Netherlands Tech Tour 02 - MySQL Fabric
Netherlands Tech Tour 02 -   MySQL FabricNetherlands Tech Tour 02 -   MySQL Fabric
Netherlands Tech Tour 02 - MySQL Fabric
Mark Swarbrick
 

Similar to Toronix - IBM WebSphere HA & High Availability Concepts (20)

Membase Meetup 2010
Membase Meetup 2010Membase Meetup 2010
Membase Meetup 2010
 
High Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud ComputingHigh Availability Infrastructure for Cloud Computing
High Availability Infrastructure for Cloud Computing
 
Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai
 
Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai Websphere-corporate-training-in-mumbai
Websphere-corporate-training-in-mumbai
 
Membase Meetup - Silicon Valley
Membase Meetup - Silicon ValleyMembase Meetup - Silicon Valley
Membase Meetup - Silicon Valley
 
5 Ways to Avoid Server and Application Downtime
5 Ways to Avoid Server and Application Downtime5 Ways to Avoid Server and Application Downtime
5 Ways to Avoid Server and Application Downtime
 
Impact 2009 1783 Achieving Availability With W A Sz User Experience
Impact 2009 1783  Achieving  Availability With  W A Sz   User ExperienceImpact 2009 1783  Achieving  Availability With  W A Sz   User Experience
Impact 2009 1783 Achieving Availability With W A Sz User Experience
 
W23 - Advanced Performance Tactics for WebSphere Performance
W23 - Advanced Performance Tactics for WebSphere PerformanceW23 - Advanced Performance Tactics for WebSphere Performance
W23 - Advanced Performance Tactics for WebSphere Performance
 
A26 MariaDB : The New&Implemented MySQL Branch by Colin Charles
A26 MariaDB : The New&Implemented MySQL Branch by Colin CharlesA26 MariaDB : The New&Implemented MySQL Branch by Colin Charles
A26 MariaDB : The New&Implemented MySQL Branch by Colin Charles
 
system automation, integration and recovery
system automation, integration and recoverysystem automation, integration and recovery
system automation, integration and recovery
 
IBM MQ vs Apache ActiveMQ
IBM MQ vs Apache ActiveMQIBM MQ vs Apache ActiveMQ
IBM MQ vs Apache ActiveMQ
 
JUDCon2014-ScalableMessagingWithJBossA-MQ and Apache Camel
JUDCon2014-ScalableMessagingWithJBossA-MQ and Apache CamelJUDCon2014-ScalableMessagingWithJBossA-MQ and Apache Camel
JUDCon2014-ScalableMessagingWithJBossA-MQ and Apache Camel
 
AMF Flash and .NET
AMF Flash and .NETAMF Flash and .NET
AMF Flash and .NET
 
WSI33 - Advanced Performance Tactics for IBM WebSphere Application Server
WSI33 - Advanced Performance Tactics for IBM WebSphere Application ServerWSI33 - Advanced Performance Tactics for IBM WebSphere Application Server
WSI33 - Advanced Performance Tactics for IBM WebSphere Application Server
 
Meetup callback
Meetup callbackMeetup callback
Meetup callback
 
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
 
Enterprise service bus part 2
Enterprise service bus part 2Enterprise service bus part 2
Enterprise service bus part 2
 
Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)
 
MariaDB 5.5 and what comes next - Percona Live NYC 2012
MariaDB 5.5 and what comes next - Percona Live NYC 2012MariaDB 5.5 and what comes next - Percona Live NYC 2012
MariaDB 5.5 and what comes next - Percona Live NYC 2012
 
Netherlands Tech Tour 02 - MySQL Fabric
Netherlands Tech Tour 02 -   MySQL FabricNetherlands Tech Tour 02 -   MySQL Fabric
Netherlands Tech Tour 02 - MySQL Fabric
 

Recently uploaded

みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 

Recently uploaded (20)

みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 

Toronix - IBM WebSphere HA & High Availability Concepts

  • 1. IBM SOA © 2007 IBM Corporation IBM – BCBS High Availability Fast Track Robert R. Rowntree SOA Enterprise Architect IBM Software Group
  • 2. IBM SOA IBM confidential2 IBM – BCBS High Availability Fast Track  Introduction - Availability, 9’s uptime, Work Patterns, Reference Architecture  HA WebSphere Systems - Products leverage WAS, SPOF’s, WAS Internals - Clustering, HA Topologies, HAManager  Managing HA WebSphere Systems - ITCAM, SOA Security, SOA Management - SLAs  HA Failure Scenarios - Http Servers, JVMs, - Containers - Portlet, Web, EJB - LDAP, Databases, WMQ, ME’s Agenda
  • 3. IBM SOA IBM confidential3 IBM – BCBS High Availability Fast Track HA – What is Availability? Biggest Impact –App’s/People Availability = MTBF/(MTBF + MTTR)  Downtime – Planned/Unplanned – Engineer for Unplanned - Engineer for unplanned downtime during business hours - BIGGEST source of unplanned downtime – Software (Applications) and People Errors  Human Errors – Expertise level, training and tooling. Some companies don’t have any of the 3 – SOL from the start  Software Errors – Network, Server, middleware and applications – the biggest culprit - APPLICATIONS  7X24X365, 6X20 or 5X12 – When will it go down? Don’t know. - Most applications still typically need minimal disruption during business hours. - It’s the amount of planned downtime that is different. - Can’t usually predicate when unplanned downtime will occur – exceptions? – Loads on pays at banks. - On a daily basis most businesses can tolerate similar downtimes during biz hours.  Focus usually is on MTTR - Strategy is to engineer in the use of fault detection and Auto failover where possible. - Aircraft have multiple engine systems, Nuclear Plant electronics has triplicated logic.
  • 4. IBM SOA IBM confidential4 IBM – BCBS High Availability Fast Track Levels of Availability
  • 5. IBM SOA IBM confidential5 IBM – BCBS High Availability Fast Track Availability Matrix
  • 6. IBM SOA IBM confidential6 IBM – BCBS High Availability Fast Track Latency for various Workload Patterns
  • 7. IBM SOA IBM confidential7 IBM – BCBS High Availability Fast Track Reference Architecture
  • 8. IBM SOA IBM confidential8 IBM – BCBS High Availability Fast Track Reference Architecture: Product View
  • 9. IBM SOA IBM confidential9 IBM – BCBS High Availability Fast Track  Introduction - Availability, 9’s uptime, Work Patterns, Reference Architecture  HA WebSphere Systems - Products leverage WAS, SPOF’s, WAS Internals - Clustering, HA Topologies, HAManager  Managing HA WebSphere Systems - ITCAM, SOA Security, SOA Management - SLAs  HA Failure Scenarios - Http Servers, JVMs, - Containers - Portlet, Web, EJB - LDAP, Databases, WMQ, ME’s Agenda
  • 10. IBM SOA IBM confidential10 IBM – BCBS High Availability Fast Track WebSphere Systems – Team of Products Leverage WAS  Several products leverage WebSphere Application Servers HA Capabilities 1. WebSphere Portal Server 2. WebSphere Process Server 3. WebSphere ESB 4. WebSphere Partner Gateway
  • 11. IBM SOA IBM confidential11 IBM – BCBS High Availability Fast Track WebSphere System – Single Points of Failure Failure Points Possible Solutions Firewalls Firewall clustering, firewall Sprayers Caching Proxy Backing CP Http Sprayer Backup load balancer Web Server Multiple WS WAS master repository, log files HA share file system, NFS, HW based clustering WAS Horizontal, Vertical or both Node Agent Multiple NA in the cluster, NA as a OS Service Deployment Manager OS Service, not a SPOF
  • 12. IBM SOA IBM confidential12 IBM – BCBS High Availability Fast Track WebSphere System – Single Points of Failure Failure Points Possible Solutions Entity EJB, application DB HA DB’s Default Message Provider HAManager configured Default Message Provider data store Clustering, data replication and parallel databases. Application database Clustering, data replication, parallel databases Session database Memory to Memory replication, DB clustering Transaction Logs Shared file system, HAManager provides failover. WMQ WMQ cluster LDAP Master replica, HA LDAP
  • 13. IBM SOA IBM confidential13 IBM – BCBS High Availability Fast Track WebSphere System – Single Points of Failure Failure Points Possible Solutions Hubs Multiple interconnected network paths OS and other software crashes Clustering, switching to a healthy node. Software and Hardware Upgrades Rolling upgrades with clustering or WLM for 7X24X365, planned maintenance.
  • 14. IBM SOA IBM confidential14 IBM – BCBS High Availability Fast Track Deployment Manager Failure – Not a SPOF  Not a Single Point of Failure 1. WAS V6 does not keep routing tables for clustered resource such as applications or message engines. 1. WAS V6 it is an elected member of cluster. Routing table is now fault tolerant. 2. Responsible less run-time critical tasks such as: 1. Configuration changes 2. Performance Monitoring 3. JMX routing through DM to other components in the cell.
  • 15. IBM SOA IBM confidential15 IBM – BCBS High Availability Fast Track Data – Key point of Failure  WAS or other components can run without Data  Can design in redundancy at most tiers but if data is not available WAS systems can’t run.  Key Data components required by WAS - Application Data - Administrative Repositories - Persistent Session datastore - Message Engine datastores - Transaction Log - WebSphere System + Application binaries - HTML Images and files
  • 16. IBM SOA IBM confidential16 IBM – BCBS High Availability Fast Track Address of where to deliver returning request has 4 components. 1. Cache ID 2. Session ID – ID of session once back in the JVM. 3. Clone ID – Used by plug-in to HTTP server to determine both application server. On failover the failover clone ID is appended and if fallback is set and once the original JVM is back up, the session will FAILBACK. This is done to rebalance loads which is important for a small number of nodes in a cluster. 1. Multiple app servers providing vertical scaling. 2. Need both IP and port – the http plug on the HTTP servers decode this from the cookie. Typically a load balancer can determine the destination IP, but it can’t determine both the IP and port address from the WAS generated session id. The Http plug translates the SESSION ID using a XML file generated by the app server to determine the IP and port end point that precisely ID’s the originating app server. Session Management –Tracking Down a Session
  • 17. IBM SOA IBM confidential17 IBM – BCBS High Availability Fast Track WebSphere Application Server – Internal Architecture
  • 18. IBM SOA IBM confidential18 IBM – BCBS High Availability Fast Track Clustering – Vertical and Horizontal
  • 19. IBM SOA IBM confidential19 IBM – BCBS High Availability Fast Track Scope of Isolation with System z Clustering Possible Nodes LPAR CEC Geo Dispersion Isolation Cost Vertical Only 1 Same Same No Minimal Lowest Both 1 or More Same Same No Both 1 or More 1 or More Same No Both 1 or More 1 or more 1 or More No Both 1 or More 1 or more 1 or more Yes Highest Very High
  • 20. IBM SOA IBM confidential20 IBM – BCBS High Availability Fast Track Topology HA Level 1 – Single Node Best Use – Low Cost, Application with low availability needs, Test Environment SPOF’s – HTTP Server, Admin Servers, Database Advantage – Lowest effort to maintain, out of the box install Disadvantage – Almost everything is a SPOF
  • 21. IBM SOA IBM confidential21 IBM – BCBS High Availability Fast Track Topology - HA Level 2 – Vertical Scaling Best Use – Low Cost, Some Degree of Fail Over required SPOF’s – HTTP Server, Database, FW, LDAP Advantage – F-O if one App Server (JVM) crashes or out of threads temporarily Disadvantage – OOS/SOL if node level problem, lower level SW or HW has problems.
  • 22. IBM SOA IBM confidential22 IBM – BCBS High Availability Fast Track Topology - HA Level 3 – Vertical and Horizontal Clustering Best Use – 1st level providing continuous operation at WAS level SPOF’s – DB, FW, LDAP Advantages – Nodal Isolation, On-line Maintenance, Mixed versions possible Disadvantage – More effort in maintaining the system, HAManager needs NAS, LL
  • 23. IBM SOA IBM confidential23 IBM – BCBS High Availability Fast Track Topology - HA Level 4 – Database Clustering Failover Best Use – 1st level providing continuous operation at WAS level SPOF’s – Admin Servers (DMGR and Node Agent), LDAP Advantages – Option use ARM for auto restart DMGR, NA Disadvantage – Downed AS, No TPV, No Reconfiguration
  • 24. IBM SOA IBM confidential24 IBM – BCBS High Availability Fast Track Topology - HA Level 5
  • 25. IBM SOA IBM confidential25 IBM – BCBS High Availability Fast Track Failover Clustering Capacity – 2 Types IP Based Cluster Failover – Slow 1 – 5 minutes - Tivoli Systems Automation - IHACMP – AIX Non IP Cluster – 1 sec to Minutes depending on Configuration WAS WLM – Http plugin to Http Server, EJB (Corba distributed communication) Clustering Database Failover Slow IP based failover Fast – Parallel Database Partitioning for DB2 UDB EE, Oracle Real App Clusters(RAC), OPS
  • 26. IBM SOA IBM confidential26 IBM – BCBS High Availability Fast Track HAManager Benefit – Enhance Availability – 2 Area’s - Transaction Services – Transaction Log Recovery - Messaging Services Why – Crashed/Zombie JVM may leave in-flight transaction with locked resources. - Block peers from locked records – Snowball effect - Transaction are not completed - Frequency – low – but cost can be very high coz s#$% happens just when you don’t want it to. Options - Restart server (Booters) – This is a slow process - WAS V5 - Give Access to another application server – WAS V6 HA Manager or IP based cluster failover
  • 27. IBM SOA IBM confidential27 IBM – BCBS High Availability Fast Track Key Scenario for Transaction Services 1. JVM crashes with transactions in progress – Tx in doubt. 2. 2PC may have several resource manager (WMQ, DB2, SQL Server) involved with objects locked. 3. Without failover to another JVM’s transaction services, resources will be locked until time-outs are reached. 4. Worse problem – other tx may fail because they cannot obtain locks. 5. Cascading/snowball effect.
  • 28. IBM SOA IBM confidential28 IBM – BCBS High Availability Fast Track HAManager – Core Group
  • 29. IBM SOA IBM confidential29 IBM – BCBS High Availability Fast Track HAManager – Core Group -Core Group has coordinator elected -Tracks info and state names, members, policies, active/inactive
  • 30. IBM SOA IBM confidential30 IBM – BCBS High Availability Fast Track HAManager – Group Coordination
  • 31. IBM SOA IBM confidential31 IBM – BCBS High Availability Fast Track HAManager – Transaction Managers for Core Group
  • 32. IBM SOA IBM confidential32 IBM – BCBS High Availability Fast Track HAManager – HOW – WAS V6 Recovery process started in peer member of cluster. 1. Waits for lock time-out to expire set by Crashed JVM. 2. Complete’s in-doubt Tx’s. 3. Releases locks in backend resource managers. 4. Releases Tx logs. 5. No new work is performed.
  • 33. IBM SOA IBM confidential33 IBM – BCBS High Availability Fast Track HA Manager – Scenario – 2 PC with Resource Manager Locking (Database)
  • 34. IBM SOA IBM confidential34 IBM – BCBS High Availability Fast Track HA Manager – Scenario – 2 PC with Resource Manager Locking (Database)
  • 35. IBM SOA IBM confidential35 IBM – BCBS High Availability Fast Track HAManager – Tx Services (Locking) Policies -One of N Policy Requirements – Most typical Policy -Shared file system must support automatic lock recovery. -Locking critical to prevent corruption of Tx logs. -Lock recovery is necessary to ensure peer cluster member access. -Lock lease time (LLT) default – 45 seconds - HAManager fails over in 10 second but LLT=45 - HAManager must wait 35 seconds - Starting point LLT=10 - HAManager = 12 seconds
  • 36. IBM SOA IBM confidential36 IBM – BCBS High Availability Fast Track HAManager – How - Old Way – WAS V5  Only way in WAS V5 – possible in V6 but complex -Required IBM HACMP or Tivoli System Automation -Shared Drive – Config Reporting, log files, Tx logs, WAS binaries. -IP Address – Each has its own IP, virtual IP for client access. -HA Software (HACMP) manages group of IPs, disk, file systems, start/stop scripts for WAS. -On Failure – move IPs, disk, starting WAS -Disadvantage - Recovery slow, virtual IP on same subnet (local only) - Complex
  • 37. IBM SOA IBM confidential37 IBM – BCBS High Availability Fast Track HAManager – How- Old Way – WAS V5
  • 38. IBM SOA IBM confidential38 IBM – BCBS High Availability Fast Track HAManager – Configuration Requirements  Enablers – HAManager, HA File System, Lease-based locking Protocol  1. Visible Translog – must be accessible to all members in core group.  2. Platform – Highly Available File System – IBM SAN FS, NAS  - Needs Lease based exclusive locking protocol  - CIFS - Common internal file system  - NFS V4  3. R/W Access Rights - All App Servers must be able to read/write to the logs before recovery can occur.  4. Consequences - if not, locks held by processes on failed node will not be automatically released.  - Tx will not be completed, database potential impaired.  - Peer servers can only recover in-flight tx if database locks are released
  • 39. IBM SOA IBM confidential39 IBM – BCBS High Availability Fast Track  Introduction - Availability, 9’s uptime, Work Patterns, Reference Architecture  HA WebSphere Systems - Products leverage WAS, SPOF’s, WAS Internals - Clustering, HA Topologies, HAManager  Managing HA WebSphere Systems - ITCAM, SOA Security, SOA Management - SLAs  HA Failure Scenarios - Http Servers, JVMs, - Containers - Portlet, Web, EJB - LDAP, Databases, WMQ, ME’s Agenda
  • 40. IBM SOA IBM confidential40 IBM – BCBS High Availability Fast Track Services atomic and composite Operational Systems Service Components Consumers Business Processes process choreography ServiceProviderServiceConsumer SAP Custom Application OO ApplicationISV Custom Apps Platform Supporting Middleware MQ DB2Unix OS/390 Outlook SCA Portlet WSRP B2B Other Integrated Console • Allow for seamless views across different layers of abstraction. Service Management Application Monitoring Resource Monitoring Resource Monitoring Transaction Tracking Integrated Reporting • Generate enterprise- wide service level reporting SOA Management: Solution View Business Process Management
  • 41. IBM SOA IBM confidential41 IBM – BCBS High Availability Fast Track SOA Management: Example 2 – Digging out the CICS Data
  • 42. IBM SOA IBM confidential42 IBM – BCBS High Availability Fast Track Example 1: Reuse: Service Creation: Digging Out the CICS Data
  • 43. IBM SOA IBM confidential43 IBM – BCBS High Availability Fast Track Example 1: Reuse: Service Creation: Digging Out the CICS Data
  • 44. IBM SOA IBM confidential44 IBM – BCBS High Availability Fast Track SOA Management: Example 2 – Logical Architecture
  • 45. IBM SOA IBM confidential45 IBM – BCBS High Availability Fast Track  “Contracts” are established between service requestors and providers, also known as Service Level Agreements  Management focus turns to monitoring for compliance to agreed upon service levels  “Active” management optimizes systems to avoid service violations SOA Management: Service Levels Service Provider Service Service Requestor Quality of Service Capacity Security Performance XML WSDL SOAP Service Level Agreement  # of requests allowed  Acceptable response time  Charge per request
  • 46. IBM SOA IBM confidential46 IBM – BCBS High Availability Fast Track SOA Security: IBM Security Approach - MASS
  • 47. IBM SOA IBM confidential47 IBM – BCBS High Availability Fast Track Custom Application Packaged Application Packaged Application Custom Application consumers business processes process choreography services atomic and composite ServiceConsumerServiceProvider 11 22 33 44 55 OO ApplicationCustom ApplicationOutlook SAP Custom Application business processes process choreography Services (Definitions) atomic and composite Service components ServiceConsumerServiceProvider 11 22 33 44 55 OO Application ISV Custom Apps Platform Operational systems Supporting Middleware MQ DB2Unix OS/390 SOA Security: Encompass all Aspects of Security SOA Security  Identity  Authentication  Authorization  Auditing  Confidentiality, Integrity and Availability  Auditing & Compliance  Administration and Policy Management SCA Portlet WSRP B2B Other
  • 48. IBM SOA IBM confidential48 IBM – BCBS High Availability Fast Track  Introduction - Availability, 9’s uptime, Work Patterns, Reference Architecture  HA WebSphere Systems - Products leverage WAS, SPOF’s, WAS Internals - Clustering, HA Topologies, HAManager  Managing HA WebSphere Systems - ITCAM, SOA Security, SOA Management - SLAs  HA Failure Scenarios - Http Servers, JVMs, - Containers - Portlet, Web, EJB - LDAP, Databases, WMQ, ME’s Agenda
  • 49. IBM SOA IBM confidential49 IBM – BCBS High Availability Fast Track