TSA provides automatic monitoring and availability management of resources configured for high availability in a cluster domain. It monitors DB2 HADR resources and DB2 instance resources, and can start, stop, and fail over these resources between nodes when failures occur. The document provides examples of how DB2 HADR and instance resources are defined and monitored by TSA using the IBM.Application resource type.
z16 zOS Support - March 2023 - SHARE in Atlanta.pdfMarna Walle
This presentation shows how to position z/OS in preparing for the new IBM z16 server. This presentation contains the "must know" information to successfully IPL z/OS on IBM z16.
"Dear Students,
Greetings from www.etraining.guru
We provide BEST online training for IBM DB2 LUW/UDB DBA by a database architect. Our DB2 Trainer comes with a working experience of 11+ years, 9+ years in DB2 and a DB2 certified professional.
DB2 LUW DBA Course Content: http://www.etraining.guru/course/dba/online-training-db2-luw-udb-dba
Course Cost: USD 350 (or) INR 21000
Number of Hours: 30-35 hours
Regards,
Karthik
www.etraining.guru"
DB2 for z/OS - Starter's guide to memory monitoring and controlFlorence Dubois
DB2 for z/OS makes more and more use of REAL memory to improve performance and reduce cost. But if you don't carefully budget and monitor the use of REAL memory on your system, you could be putting your applications at risk. This presentation will go back to the basics and answer the most common questions about REAL memory management including: how does DB2 uses virtual and REAL memory? how to build a budget based on system settings and buffer pool sizes? how to size the LFAREA? what are the key performance indicators and how do I know I am running 'safely'? what can be done to protect the system?
Oracle RAC Virtualized - In VMs, in Containers, On-premises, and in the CloudMarkus Michalewicz
This presentation discusses the support guidelines for using Oracle Real Application Clusters (RAC) in virtualized environments, for which general Oracle Database support guidelines are discussed shortly first.
First presented during DOAG 2021 User Conference, this presentation replaces its predecessor from 2016: https://www.slideshare.net/MarkusMichalewicz/how-to-use-oracle-rac-in-a-cloud-a-support-question
The Top 5 Reasons to Deploy Your Applications on Oracle RACMarkus Michalewicz
A presentation for developers, DBAs, and managers. This presentation was first presented in course of the AIOUG Maximum Availability Architecture (MAA)-focus month August 2021. The first reason might surprise you!
Dear Student,
Greetings from Etraining.guru.
We provide THE BEST DB2 LUW/UDB DBA training in the Industry by THE real time expert. Our Trainer comes with 10+ years of IT working experience on open systems of project team sizes ranging from 2 to 70+.
Course Highlights:
****************
(1) 32+ hours of course content:
http://www.etraining.guru/online-training-db2-luw-udb-dba.aspx
(2) 5 free classes on Unix & Shell scripting of topics of student's interest
(3) 10 hours of Free job support. Beyond these hours, job support is provided for a nominal charge of $10/hour
(4) Course fees: $400 for working professionals; $300 for students/job seekers
(5) Class Timing: Flexible, depending upon student's requirement
(6) Email: info@etraining.guru
Please feel free to let us know if any further details needed.
Thanks,
Bhavana
www.etraining.guru
Email:info@etraining.guru
This presentation discusses the principles of IP Routing as they apply to z/OS, the z/OS implementation of static routing through the TCP/IP profile, and dynamic routing with OMPROUTE.
z16 zOS Support - March 2023 - SHARE in Atlanta.pdfMarna Walle
This presentation shows how to position z/OS in preparing for the new IBM z16 server. This presentation contains the "must know" information to successfully IPL z/OS on IBM z16.
"Dear Students,
Greetings from www.etraining.guru
We provide BEST online training for IBM DB2 LUW/UDB DBA by a database architect. Our DB2 Trainer comes with a working experience of 11+ years, 9+ years in DB2 and a DB2 certified professional.
DB2 LUW DBA Course Content: http://www.etraining.guru/course/dba/online-training-db2-luw-udb-dba
Course Cost: USD 350 (or) INR 21000
Number of Hours: 30-35 hours
Regards,
Karthik
www.etraining.guru"
DB2 for z/OS - Starter's guide to memory monitoring and controlFlorence Dubois
DB2 for z/OS makes more and more use of REAL memory to improve performance and reduce cost. But if you don't carefully budget and monitor the use of REAL memory on your system, you could be putting your applications at risk. This presentation will go back to the basics and answer the most common questions about REAL memory management including: how does DB2 uses virtual and REAL memory? how to build a budget based on system settings and buffer pool sizes? how to size the LFAREA? what are the key performance indicators and how do I know I am running 'safely'? what can be done to protect the system?
Oracle RAC Virtualized - In VMs, in Containers, On-premises, and in the CloudMarkus Michalewicz
This presentation discusses the support guidelines for using Oracle Real Application Clusters (RAC) in virtualized environments, for which general Oracle Database support guidelines are discussed shortly first.
First presented during DOAG 2021 User Conference, this presentation replaces its predecessor from 2016: https://www.slideshare.net/MarkusMichalewicz/how-to-use-oracle-rac-in-a-cloud-a-support-question
The Top 5 Reasons to Deploy Your Applications on Oracle RACMarkus Michalewicz
A presentation for developers, DBAs, and managers. This presentation was first presented in course of the AIOUG Maximum Availability Architecture (MAA)-focus month August 2021. The first reason might surprise you!
Dear Student,
Greetings from Etraining.guru.
We provide THE BEST DB2 LUW/UDB DBA training in the Industry by THE real time expert. Our Trainer comes with 10+ years of IT working experience on open systems of project team sizes ranging from 2 to 70+.
Course Highlights:
****************
(1) 32+ hours of course content:
http://www.etraining.guru/online-training-db2-luw-udb-dba.aspx
(2) 5 free classes on Unix & Shell scripting of topics of student's interest
(3) 10 hours of Free job support. Beyond these hours, job support is provided for a nominal charge of $10/hour
(4) Course fees: $400 for working professionals; $300 for students/job seekers
(5) Class Timing: Flexible, depending upon student's requirement
(6) Email: info@etraining.guru
Please feel free to let us know if any further details needed.
Thanks,
Bhavana
www.etraining.guru
Email:info@etraining.guru
This presentation discusses the principles of IP Routing as they apply to z/OS, the z/OS implementation of static routing through the TCP/IP profile, and dynamic routing with OMPROUTE.
SQL Server Reporting Services Disaster Recovery webinarDenny Lee
This is the PASS DW|BI virtual chapter webinar on SQL Server Reporting Services Disaster Recovery with Ayad Shammout and myself - hosted by Julie Koesmarno (@mssqlgirl)
UKOUG Tech15 - Deploying Oracle 12c Cloud Control in Maximum Availability Arc...Zahid Anwar (OCM)
Common Cloud Control deployments can sometimes be exposed to single points of failure. In this presentation we will be discussing these pitfalls and how, through deploying Cloud Control within the Maximum Availability Architecture can provide a robust system. Aimed at a technical audience - we will dive into giving High Availability and Disaster Recovery for the OMS repository and OMS Web Tier through the use of RAC, Web Tier Clustering, Data Guard and Storage Replication. We will take our audience through the simple but effective steps required for this type of deployment in addition to the license implications of using Maximum Availability Architecture including what Oracle give you for free under a restricted-use license. This presentation is based on a recent project completed by our speaker Zahid Anwar. This project saw Zahid provide Maximum Availability Architecture for Cloud Control which was monitoring 6, critical X4-2 Eighth Exadata Machines.
Dear Students,
Greetings from www.etraining.guru
We provide BEST online training for IBM DB2 LUW/UDB DBA in Hyderabad by a database architect. Our DB2 Trainer comes with a working experience of 11+ years, 9+ years in DB2 and a DB2 certified professional.
DB2 LUW DBA Course Content: http://www.etraining.guru/course/dba/online-training-db2-luw-udb-dba
Course Cost: USD 300 (or) INR 18000
Number of Hours: 30 hours
If there is any opportunity, we will be very happy to serve you. Appreciate if you can explore other training opportunities in our website as well.
We can be reachable at info@etraining.guru (or) 91-996-669-2446 for any further info/details.
Regards,
Karthik
www.etraining.guru"
"Title: Online Training in IBM DB2 LUW/UDB DBA in Hyderabad (www.etraining.guru)
Dear Students,
Greetings from www.etraining.guru
We provide BEST online training for IBM DB2 LUW/UDB DBA in Hyderabad by a database architect. Our DB2 Trainer comes with a working experience of 11+ years, 9+ years in DB2 and a DB2 certified professional.
DB2 LUW DBA Course Content: http://www.etraining.guru/course/dba/online-training-db2-luw-udb-dba
Course Cost: USD 300 (or) INR 18000
Number of Hours: 30 hours
If there is any opportunity, we will be very happy to serve you. Appreciate if you can explore other training opportunities in our website as well.
We can be reachable at info@etraining.guru (or) 91-996-669-2446 for any further info/details.
Regards,
Karthik
www.etraining.guru"
Dear Students,
Greetings from www.etraining.guru
We provide BEST online training for IBM DB2 LUW/UDB DBA in Hyderabad by a database architect. Our DB2 Trainer comes with a working experience of 11+ years, 9+ years in DB2 and a DB2 certified professional.
DB2 LUW DBA Course Content: http://www.etraining.guru/course/dba/online-training-db2-luw-udb-dba
Course Cost: USD 300 (or) INR 18000
Number of Hours: 30 hours
If there is any opportunity, we will be very happy to serve you. Appreciate if you can explore other training opportunities in our website as well.
We can be reachable at info@etraining.guru (or) 91-996-669-2446 for any further info/details.
Regards,
Karthik
www.etraining.guru"
1049: Best and Worst Practices for Deploying IBM Connections - IBM Connect 2016panagenda
Depending on deployment size, operating system and security considerations you have different options to configure IBM Connections. This session show good and bad examples on how to do it from multiple customer deployments. Christoph Stoettner describes things he found and how you can optimize your systems. Main topics include simple (documented) tasks that should be applied, missing documentation, automated user synchronization, TDI solutions and user synchronization, performance tuning, security optimizing and planning Single Sign On for mail, IBM Sametime and SPNEGO. This is valuable information that will help you to be successful in your next IBM Connections deployment project.
A presentation from Christoph Stoettner (panagenda).
Logging is important for troubleshooting a DNS service. Conveniently with BIND 9, almost all problems will show up somewhere in the log output, but only if the logging is enabled and configured correctly.
In this webinar, we’ll discuss the BIND 9 logging configuration and best practices in searching through large log-files to find the entries of interest. In addition, we’ll release log-management tools used by Men & Mice Services.
Setup oracle golden gate 11g replicationKanwar Batra
How to setup Oracle Goldengate Replication between 11gR2 RAC or Single node instances. For RAC setup the GoldenGate custom cluster service . Not part of this document
SQL Server Reporting Services Disaster Recovery WebinarDenny Lee
This is the PASS DW/BI Webinar for SQL Server Reporting Services (SSRS) Disaster Recovery webinar. You can find the video at: http://www.youtube.com/watch?v=gfT9ETyLRlA
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
1. A DBA’s guide to using TSA
Frederik Engelen
RealDolmen
Session Code: C03
16 November 2011 | Platform: LUW
2. Click to edit Master title style
Session Goal
• Quick overview of HADR functionality and advantages
• Description of Tivoli System Automation for Multiplatforms
(TSAMP) software and integration with DB2
• Operational tasks and lessons learned
2
3. Click to edit Master title style
Why HADR?
• Failover solution available from DB2 v8.2
• Design choices:
• Ultra-fast failover capability
• Negligible impact on performance
• Configurable degree of consistency
• Low administrative costs
• Avoid service interruption for fixpack installs and changes
requiring instance restart
• Transparent failover and failback for applications
• Spread over different geographic locations
• Built in clustering software in DB2 9.5
• Cheap
3
4. Click to edit Master title style
Basic Principle HADR
• Database scope
• Allows flexible configurations
• Primary database
• Processes transactions
• Send logs to standby
• Standby database
• Applies transactions recieved from primary
• Available for read-only workload (DB2 v9.7 FP1)
• In case of primary failure, standby database can take over
primary role
4
Primary Standby
Log shipping
Primary connection
5. Click to edit Master title style
HADR Synchronisation Modes
• Configurable degree of consistency
• SYNC, NEARSYNC, ASYNC, SUPERASYNC (NEW)
5
New logs
send() recieve()
Logs
ASYNC NEARSYNC
SYNC
TCP/IP
SUPER
ASYNC
6. Click to edit Master title style
Ready to implement?
• Network layout
• High speed network
• Private or public HADR communication?
• Reserve HADR communication port
• Use identical OS & database version and hardware family
• For automation with TSA, make sure DB2 > v9.5 FP5 or v9.7 FP1
• Use identical file system layout
• Keep clock synchronised
• Recommended:
• Same amount of memory & hardware
6
7. Click to edit Master title style
Implementing
• Configure primary database and backup
• Restore on standby server
• Adjust db cfg on standby database
• Start HADR
• Configure Automatic Client Reroute
7
8. Click to edit Master title style
Configure primary database
• Database configuration
db2 update db cfg for <DBNAME> using LOGARCHMETH1 DISK:/share/logarchs
db2 update db cfg for <DBNAME> using INDEXREC RESTART
db2 update db cfg for <DBNAME> using LOGINDEXBUILD ON
db2 update db cfg for <DBNAME> using HADR_TIMEOUT 30
db2 update db cfg for <DBNAME> using HADR_PEER_WINDOW 120
db2 update db cfg for <DBNAME> using HADR_LOCAL_HOST <PRIMARY_HOST>
db2 update db cfg for <DBNAME> using HADR_LOCAL_SVC <PRIMARY_HADR_PORT>
db2 update db cfg for <DBNAME> using HADR_REMOTE_HOST <STANDBY_HOST>
db2 update db cfg for <DBNAME> using HADR_REMOTE_SVC <STANDBY_HADR_PORT>
db2 update db cfg for <DBNAME> using HADR_REMOTE_INST <STANDBY_INST>
db2 update db cfg for <DBNAME> using HADR_SYNCMODE NEARSYNC
• Take a backup
8
9. Click to edit Master title style
Configure standby server
• Restore backup
• Keep name
• Database configuration
db2 update db cfg for <DBNAME> using HADR_LOCAL_HOST <STANDBY_HOST>
db2 update db cfg for <DBNAME> using HADR_LOCAL_SVC <STANDBY_HADR_PORT>
db2 update db cfg for <DBNAME> using HADR_REMOTE_HOST <PRIMARY_HOST>
db2 update db cfg for <DBNAME> using HADR_REMOTE_SVC <PRIMARY_HADR_PORT>
db2 update db cfg for <DBNAME> using HADR_REMOTE_INST <PRIMARY_INST>
9
10. Click to edit Master title style
Start HADR
• Standby
db2 update hadr on db <DBNAME> as standby
• Replay local log files
• Primary
db2 start hadr on db <DBNAME> as primary
• Replay primary archived logs
• Replay primary active logs
• Replay in-memory log buffer
-> Peer state
10
HADR start
Local Catchup
Remote Catch-
up Pending
Catch-up
Pending
Peer
Disconnected
Peer
Connectionlost
Connect
Disconnected
Reconnected
Peerwindowexpires
11. Click to edit Master title style
HADR result
• db2pd –hadr –db SAMPLE
Database Partition 0 -- Database SAMPLE -- Active -- Up 0 days 00:16:25 -- Date
08/21/2011 08:28:39
HADR Information:
Role State SyncMode HeartBeatsMissed LogGapRunAvg (bytes)
Primary Peer Nearsync 0 0
ConnectStatus ConnectTime Timeout
Connected Sun Aug 21 08:13:29 2011 (1313928809) 120
PeerWindowEnd PeerWindow
Sun Aug 21 08:30:39 2011 (1313929839) 120
LocalHost LocalService
batman 60000
RemoteHost RemoteService RemoteInstance
robin 60000 db2insts
PrimaryFile PrimaryPg PrimaryLSN
S0000005.LOG 0 0x00000000032C8010
StandByFile StandByPg StandByLSN
S0000005.LOG 0 0x00000000032C8010
11
12. Click to edit Master title style
Basic Operations
• Takeover
db2 takeover hadr on db <dbname> [by force [peer window
only]]
• Stop database
db2 deactivate db <dbname> && db2stop
“stop hadr“ will turn db to standalone db
• Actually, that’s about it…
12
13. Click to edit Master title style
Automatic Client Reroute
• DB2 client reroutes connection to alternative server in
case of communication error
db2 update alternate server for <dbname> using hostname <host>
port <svcename>
• On connection failure:
SQL30108N A connection failed but has been reestablished
• HADR is not required
• JDBC URL
jdbc:db2://<dbhost>:<port>/<db>:clientRerouteAlternateServerNam
e=<althost>;clientRerouteAlternatePortNumber=<altport>;
• Configure on client and server
13
14. Click to edit Master title style
Things to keep in mind
• What gets replicated? Everything that’s logged.
• What not?
• DB CFG
• History file
• Not Logged Initially
• LOB columns that are not logged (>1G)
• Common operations
• LOAD: COPY YES
• REORG: consider reorg online or deactivate standby to avoid
blocking primary
• Tuning
• HADR simulator
14
15. Click to edit Master title style
Archival Logs
• Only primary server archives logs
• Store on shared file system
• Faster catch-up process
• Avoid manually copying logs when restore on standby
• Attention when using AUTO_DEL_REC_OBJ
15
16. Click to edit Master title style
Blocknonlogged
• Introduced in DB2 v9.5 FP4 / v9.7
• Blocks non-logged operations
• Not logged initially
• Not logged colums (BLOB and CLOB > 1G)
• Simple measure to protect from human error or an
unsuspecting application
16
17. Click to edit Master title style
Monitoring
• db2pd -hadr
• Database snapshot
• Health Monitor
• db2diag.log – check for messages from d2hadrp and
db2hadrs
db2insts@robin:~> db2diag -g "eduname:=db2hadr" -fmt "%eduname @msg
@changeevent“
db2hadrs Initiating HADR takeover request, forcing applications.
db2hadrs Info: Standby has initiated a takeover.
db2hadrs Info: Standby switching roles to primary.
db2hadrs HADR role set to Primary (was Standby)
db2hadrp HADR state set to P-Peer (was S-Peer)
db2hadrp Info: Standby has completed takeover (now primary).
db2hadrp Info: Standby Finished.
db2hadrp Info: Primary Started.
db2hadrp Info: HADR Socket send buffer size, SO_SNDBUF: 16384 bytes
db2hadrp Info: HADR Socket receive buffer size, SO_RCVBUF: 87380 bytes
17
18. Click to edit Master title style
Where does TSA fit in?
• TSA provides automatic monitoring and availability
management of “resources”
• TSA helps you when one of the cluster components fails
• But…TSA will also be there when you are
• Performing maintenance activities
• Troubleshooting
-> Basic knowledge of TSA and how it interacts with DB2 is
essential to succesfully perform your tasks as a DBA
18
19. Click to edit Master title style
Goal
• Components and terminology
• TSA domain configuration
• Operations
• Monitoring & troubleshooting
19
20. Click to edit Master title style
RSCT/TSA Terminology
• Cluster domain
• Node
• Resource
• Resource group
• Quorum
• Tiebreaker
• Equivalency
• Let’s see that in more detail…
20
21. Click to edit Master title style
Domain/Node
• Group of server nodes hosting resources configured for
high availability with RSCT
• Each node runs RSCT subsystems/daemons
• preprpnode, lsrpnode, addrpnode, rmrpnode,
stoprpnode, startrpnode
• mkrpdomain, lsrpdomain, rmrpdomain,
startrpdomain, stoprpdomain
21
22. Click to edit Master title style
Domain – RSCT components
# lssrc -a
Subsystem Group PID Status
IBM.ConfigRM rsct_rm 3610 active
ctcas rsct 3891 active
cthats cthats 5457 active
cthags cthags 5505 active
ctrmc rsct 5625 active
IBM.GblResRM rsct_rm 5792 active
IBM.RecoveryRM rsct_rm 5793 active
IBM.StorageRM rsct_rm 5794 active
IBM.TestRM rsct_rm 5795 active
22
23. Click to edit Master title style
Domain – RSCT components
• CTHATS: Topology Services
• Node connectivity (heartbeat)
• Messaging
• CTHAGS: Groups Services
• Synchronised configuration
• Messagging
• CTRMC: Resource Monitoring & Control
• Backbone of RSCT
• Monitoring & control infrastructure across cluster
• CTCAS: Cluster Authentication Services
• IBM.ConfigRM: Configuration Resource Manager
• Management application
• Create and administer a domain
23
24. Click to edit Master title style
Resource
• Any piece of hard- or software (configuration) that
provides services
• HADR role, DB2 instance, virtual IP, file system, tiebreaker,…
• Instance of Resource Class
• Most common IBM.Application (start/stop/monitor scriptsà)
• Provided by resource managers
• GlbResRM, TestRM, StorageRM, RecoveryRM
• Fixed/floating/concurrent
• Not directly controllable
• lsrsrc, rmrsrc, chrsrc, resetrsrc,
lsrsrcdef
24
25. Click to edit Master title style
IBM.Application class
Example of a DB2 HADR Resource
# lsrsrc –s “Name = ‘db2_db2instp_db2insts_SAMPLE-rs’ ” -Ab IBM.Application
resource 1:
Name = "db2_db2instp_db2insts_SAMPLE-rs"
ResourceType = 0 (Constituent)
AggregateResource = "0x2028 0xffff 0x5e5ca445 0xb9d31ef8 0x9239e701
0x23ba7500"
StartCommand = "/usr/sbin/rsct/sapolicies/db2/hadrV97_start.ksh
db2instp db2insts SAMPLE"
StopCommand = "/usr/sbin/rsct/sapolicies/db2/hadrV97_stop.ksh
db2instp db2insts SAMPLE"
MonitorCommand = "/usr/sbin/rsct/sapolicies/db2/hadrV97_monitor.ksh
db2instp db2insts SAMPLE"
MonitorCommandPeriod = 21
MonitorCommandTimeout = 29
StartCommandTimeout = 330
StopCommandTimeout = 140
ActivePeerDomain = "gotham"
NodeNameList = {"robin"}
OpState = 2 (Offline on this node)
25
26. Click to edit Master title style
IBM.Application class
Example of a DB2 HADR Resource
# lsrsrc –s “Name = ‘db2_db2instp_db2insts_SAMPLE-rs’ ” -Ab IBM.Application
resource 2:
Name = "db2_db2instp_db2insts_SAMPLE-rs"
ResourceType = 0 (Constituent)
AggregateResource = "0x2028 0xffff 0x5e5ca445 0xb9d31ef8 0x9239e701
0x23ba7500"
StartCommand = "/usr/sbin/rsct/sapolicies/db2/hadrV97_start.ksh
db2instp db2insts SAMPLE"
StopCommand = "/usr/sbin/rsct/sapolicies/db2/hadrV97_stop.ksh
db2instp db2insts SAMPLE"
MonitorCommand = "/usr/sbin/rsct/sapolicies/db2/hadrV97_monitor.ksh
db2instp db2insts SAMPLE"
MonitorCommandPeriod = 21
MonitorCommandTimeout = 29
StartCommandTimeout = 330
StopCommandTimeout = 140
ActivePeerDomain = "gotham"
NodeNameList = {"batman"}
OpState = 1 (Online on this node)
26
30. Click to edit Master title style
IBM.ServiceIP class
Example of a HADR Virtual IP
# lsrsrc -Ab IBM.ServiceIP
resource 1:
Name = "db2ip_192_168_13_132-rs"
ResourceType = 0 (Constituent)
AggregateResource = "0x2029 0xffff 0x5e5ca445 0xb9d31ef8 0x9239e864
0x5d11b368"
IPAddress = "192.168.13.132"
NetMask = "255.255.255.0"
ProtectionMode = 1
NetPrefix = 0
ActivePeerDomain = "gotham"
NodeNameList = {"robin"}
OpState = 2 (Offline on this node)
ConfigChanged = 0
ChangedAttributes = {}
30
31. Click to edit Master title style
IBM.ServiceIP class
Example of a HADR Virtual IP
# lsrsrc -Ab IBM.ServiceIP
resource 2:
Name = "db2ip_192_168_13_132-rs"
ResourceType = 0 (Constituent)
AggregateResource = "0x2029 0xffff 0x5e5ca445 0xb9d31ef8 0x9239e864
0x5d11b368"
IPAddress = "192.168.13.132"
NetMask = "255.255.255.0"
ProtectionMode = 1
NetPrefix = 0
ActivePeerDomain = "gotham"
NodeNameList = {"batman"}
OpState = 1 (Online on this node)
ConfigChanged = 0
ChangedAttributes = {}
31
32. Click to edit Master title style
IBM.ServiceIP class
Example of a HADR Virtual IP
# lsrsrc -Ab IBM.ServiceIP
resource 3:
Name = "db2ip_192_168_13_132-rs"
ResourceType = 1 (Aggregate)
AggregateResource = "0x3fff 0xffff 0x00000000 0x00000000 0x00000000
0x00000000"
IPAddress = "192.168.13.132"
NetMask = "255.255.255.0"
ProtectionMode = 1
NetPrefix = 0
ActivePeerDomain = "gotham"
NodeNameList = {"batman","robin"}
OpState = 1 (Online globally)
ConfigChanged = 0
ChangedAttributes = {}
32
33. Click to edit Master title style
Resource Managers
• IBM.GlbResRM
• Provides the IBM.Application and IBM.ServiceIP resource class
• IBM.RecoveryRM
• Brain of TSA, decision engine
• Gathers monitoring info and decides on action to take
• IBM.StorageRM
• Montoring and control of storage resources
• IBM.TestRM
• Provides IBM.Test resource class
• Simulates automation scenarios
• RM logfiles: /var/ct/${domain}/log/mc/IBM.${daemon}RM/
33
34. Click to edit Master title style
Resource Managers
# lssrc -ls IBM.RecoveryRM
Subsystem : IBM.RecoveryRM
Cluster Name : gotham
Node Number : 2
Daemon start time : 09/04/11 06:55:01
Daemon State:
My Node Name : robin
Master Node Name : batman (node number = 1)
Our IVN : 3.2.1.1
Our AVN : 3.2.1.1
Total Node Count : 2
Joined Member Count : 2
Config Quorum Count : 2
Startup Quorum Count : 1
Operational Quorum State: HAS_QUORUM
In Config Quorum : TRUE
In Config State : TRUE
In Jeopardy : FALSE
34
35. Click to edit Master title style
Equivalency
• Set of resources that provide the same functionality
• Main resource classes:
• IBM.PeerNode
• IBM.NetworkInterface
• lsequ, mkequ, rmequ, chequ
35
36. Click to edit Master title style
Equivalency example
HADR nodes
# lsequ -s "Name='db2_db2instp_db2insts_SAMPLE-rg_group-equ'"
Equivalency 1:
Name = db2_db2instp_db2insts_SAMPLE-rg_group-equ
MemberClass = IBM.PeerNode
Resource:Node[Membership] = {batman:batman,robin:robin}
SelectString = ""
SelectFromPolicy = ANY
MinimumNecessary = 1
Subscription = {}
Color = 0
ActivePeerDomain = gotham
ConfigValidity =
36
37. Click to edit Master title style
Equivalency example
Virtual IP
# lsequ -s "Name='db2_public_network_0'"
Equivalency 1:
Name = db2_public_network_0
MemberClass = IBM.NetworkInterface
Resource:Node[Membership] = {eth0:batman,eth0:robin}
SelectString = ""
SelectFromPolicy = ANY
MinimumNecessary = 1
Subscription = {}
Color = 0
ActivePeerDomain = gotham
ConfigValidity =
37
38. Click to edit Master title style
Resource Group
• Groups resources (surprise…)
• All included resources share the same Nominal State
• MemberLocation attribute controls placement on nodes
• Controllable operational state
• Resides on node/equivalency
38
40. Click to edit Master title style
Resource Group example
HADR DB
# lsrg -s "Name='db2_db2instp_db2insts_SAMPLE-rg'"
Resource Group 1:
Name = db2_db2instp_db2insts_SAMPLE-rg
MemberLocation = Collocated
Priority = 0
AllowedNode = db2_db2instp_db2insts_SAMPLE-rg_group-equ
NominalState = Online
ActivePeerDomain = gotham
OpState = Online
TopGroup = db2_db2instp_db2insts_SAMPLE-rg
TopGroupNominalState = Online
40
41. Click to edit Master title style
Resource Group example
HADR DB
# lsrg -m -g db2_db2instp_db2insts_SAMPLE-rg
Member Resource 1:
Class:Resource:Node[ManagedResource] =
IBM.Application:db2_db2instp_db2insts_SAMPLE-rs
Mandatory = True
MemberOf = db2_db2instp_db2insts_SAMPLE-rg
SelectFromPolicy = ORDERED
ActivePeerDomain = gotham
OpState = Online
Member Resource 2:
Class:Resource:Node[ManagedResource] = IBM.ServiceIP:db2ip_192_168_13_132-rs
Mandatory = True
MemberOf = db2_db2instp_db2insts_SAMPLE-rg
SelectFromPolicy = ORDERED
ActivePeerDomain = gotham
ConfigValidity =
OpState = Online
41
42. Click to edit Master title style
Relationship
• Configured between resources, resource groups and
equivalencies (1-N)
• Mainly DependsOn type
• StartAfter, ForcedDownBy, …
• lsrel, mkrel, chrel, rmrel
42
43. Click to edit Master title style
Relationship example
DB2 instance -> public network
# lsrel -s "Name = 'db2_db2instp_batman_0-rs_DependsOn_db2_public_network_0-
rel'"
Name = db2_db2instp_batman_0-
rs_DependsOn_db2_public_network_0-rel
Class:Resource:Node[Source] = IBM.Application:db2_db2instp_batman_0-rs
Class:Resource:Node[Target] = {IBM.Equivalency:db2_public_network_0}
Relationship = DependsOn
Conditional = NoCondition
ConfigValidity =
ResourceGroup[Source] = db2_db2instp_batman_0-rg
43
44. Click to edit Master title style
Quorum
• Quorum: the number of nodes in a cluster that are
required to control the resources (majority)
• Example log entry
ConfigRM[3610]: (Recorded using libct_ffdc.a cv 2):::Error ID: :::Reference
ID: :::Template ID: 0:::Details File: :::Location:
RSCT,PeerDomain.C,1.99.22.19,18004 :::CONFIGRM_PENDINGQUORUM_ER
The operational quorum state of the active peer domain has changed to
PENDING_QUORUM. This state usually indicates that exactly half of the nodes
that are defined in the peer domain are online. In this state cluster
resources cannot be recovered although none will be stopped explicitly.
• Majority with only 2 HADR nodes? Tiebreaker
44
45. Click to edit Master title style
Tiebreaker
• Determine quorum when half of the nodes survives
• Network Tiebreaker
• Only supported type out of the box
• A pingable IP address (default gateway is a good choice)
• Disk Tiebreaker
• Possible to add manually
• SCSI Reservation, faster
• More secure
• Without quorum (+ tiebreaker), no failover will occur !
45
46. Click to edit Master title style
Communication TSA/DB2
• DB2->TSA
• API implemented by cluster manager
• CLUSTER_MGR = TSA
• TSA->DB2
• HA Policy scripts in /usr/sbin/rsct/sapolicies
# ls /usr/sbin/rsct/sapolicies/db2/
db2V97_monitor|start|stop.ksh
hadrV97_monitor|start|stop.ksh
mountV97_monitor|start|stop.ksh
nfsserverctrl-server
forceAllApps
rovingV97_failover.ksh
46
47. Click to edit Master title style
Policy script example
batman:~ # head -n 17 /usr/sbin/rsct/sapolicies/db2/hadrV97_monitor.ksh
#!/bin/ksh -p
#-----------------------------------------------------------------------
# (C) COPYRIGHT International Business Machines Corp. 2001-2009
# All Rights Reserved
#
# US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
#
# VERSION: db2_v97fp4 s110330
# INPUT: hadrV97_monitor.ksh db2instp db2insts hadrdb [verbose]
#
# OUTPUT: 1 if online, 2 if offline, 0 if not known
#
# NOTES: Can only be used in the context of a TSA environment configured
# with the db2 HA Integrated Configuration Utility (db2haicu)
#
#-----------------------------------------------------------------------
47
48. Click to edit Master title style
Configuring for HA
• Install TSA + DB2 (lots of fixes -> latest fixpack)
• Who would’ve guessed…
• Configure HADR (Peer State)
• We already know how to do that
• Disable db2fm
• Prepare nodes
• #preprpnode batman robin
• Run DB2 High Availability Instance Configuration Utility
(db2haicu)
• Interactively
• XML
48
50. Click to edit Master title style
DB2HAICU XML File example (continued)
<PhysicalNetwork physicalNetworkName="db2_private_network_0“
physicalNetworkProtocol="ip">
<Interface interfaceName="eth1" clusterNodeName="batman">
<IPAddress baseAddress="192.168.102.2"
subnetMask="255.255.255.0"
networkName="db2_private_network_0"/>
</Interface>
<Interface interfaceName="eth1" clusterNodeName="robin">
<IPAddress baseAddress="192.168.102.3"
subnetMask="255.255.255.0”
networkName="db2_private_network_0"/>
</Interface>
</PhysicalNetwork>
<ClusterNode clusterNodeName="batman"/>
<ClusterNode clusterNodeName="robin“/>
</ClusterDomain>
50
51. Click to edit Master title style
DB2HAICU XML File example (continued)
<FailoverPolicy>
<HADRFailover></HADRFailover>
</FailoverPolicy>
<DB2PartitionSet>
<DB2Partition dbpartitionnum="0" instanceName="db2instp"/>
</DB2PartitionSet>
<HADRDBSet>
<HADRDB databaseName="SAMPLE" localInstance="db2instp"
remoteInstance="db2insts" localHost ="batman“ remoteHost="robin" />
<VirtualIPAddress baseAddress="192.168.13.132“
subnetMask="255.255.255.0"
networkName="db2_public_network_0"/>
</HADRDBSet>
</DB2Cluster>
51
52. Click to edit Master title style
DB2HAICU
• Start on standby instance
• Don’t mix primary/standby databases in the same instance
$db2haicu –f ha_cfg.xml
Repeat for primary instance
Done
52
53. Click to edit Master title style
DB2HAICU standby output
db2haicu determined the current DB2 database manager instance is db2insts. The cluster
configuration that follows will apply to this instance.
Creating domain gotham in the cluster ...
Creating domain gotham in the cluster was successful.
Configuring quorum device for domain gotham ...
Configuring quorum device for domain gotham was successful.
Adding network interface card eth0 on cluster node batman to the network db2_public_network_0 ...
Adding network interface card eth0 on cluster node batman to the network db2_public_network_0 was
successful.
Adding network interface card eth0 on cluster node robin to the network db2_public_network_0 ...
Adding network interface card eth0 on cluster node robin to the network db2_public_network_0 was
successful.
Adding network interface card eth1 on cluster node batman to the network db2_private_network_0 ...
Adding network interface card eth1 on cluster node batman to the network db2_private_network_0 was
successful.
Adding network interface card eth1 on cluster node robin to the network db2_private_network_0 ...
Adding network interface card eth1 on cluster node robin to the network db2_private_network_0 was
successful.
Adding DB2 database partition 0 to the cluster ...
Adding DB2 database partition 0 to the cluster was successful.
The HADR database SAMPLE has been determined to be valid for high availability. However, the
database cannot be added to the cluster from this node because db2haicu detected this node is the
standby for the HADR database SAMPLE. Run db2haicu on the primary for the HADR database SAMPLE to
configure the database for automated failover.
All cluster configurations have been completed successfully. db2haicu exiting ...
53
54. Click to edit Master title style
DB2HAICU primary output
db2haicu determined the current DB2 database manager instance is db2instp. The cluster
configuration that follows will apply to this instance.
Configuring quorum device for domain gotham ...
Configuring quorum device for domain gotham was successful.
The network adapter eth0 on node batman is already defined in network db2_public_network_0 and
cannot be added to another network until it is removed from its current network.
The network adapter eth0 on node robin is already defined in network db2_public_network_0 and
cannot be added to another network until it is removed from its current network.
The network adapter eth1 on node batman is already defined in network db2_private_network_0 and
cannot be added to another network until it is removed from its current network.
The network adapter eth1 on node robin is already defined in network db2_private_network_0 and
cannot be added to another network until it is removed from its current network.
Adding DB2 database partition 0 to the cluster ...
Adding DB2 database partition 0 to the cluster was successful.
Adding HADR database SAMPLE to the domain ...
Adding HADR database SAMPLE to the domain was successful.
All cluster configurations have been completed successfully. db2haicu exiting ...
54
55. Click to edit Master title style
Cluster domain overview
55
PeerNode: BATMAN PeerNode: ROBIN
Equivalency: db2_db2instp_db2insts_SAMPLE-rg_group-equ
Resource Group: db2_db2instp_db2insts_SAMPLE-rg
Floating Resource:
db2_db2instp_db2insts_SAMPLE-rs
HADR Primary HADR Secondary
Floating Resource:
db2ip_192_168_13_132-rs
Virtual IP Virtual IP
db2_db2instp_batman_0-
rg_group-equ
db2_db2instp_
batman-rg
db2_db2insts_robin_0-
rg_group-equ
db2_db2insts_
robin-rg
Resource:
db2_db2instp_
batman-rs
Resource:
db2_db2insts_
robin-rs
Private Network
Equivalency: db2_public_network_0 eth0eth0
Equivalency: db2_private_network_0 eth1eth1
Public Network
Relation:
db2_db2instp_batman_0-rs_
DependsOn_
db2_public_network_0-rel
Relation:
db2_db2insts_robin_0-rs_
DependsOn_
db2_public_network_0-rel
58. Click to edit Master title style
Operations
• Use regular commands whenever possible:
• Takeover, db2stop, db2start,…
• Check the result on the cluster domain
• TSA commands still work
• rgreq –o move <HADR RG>
• chrg -o offline <DB2 Instance RG>
• Takeover example
• $ db2 takeover hadr on db <DBNAME>
58
72. Click to edit Master title style
Samdiag output
DB2 instance
# samdiag -g db2_db2instp_batman_0-rg
Displaying information for the following:
Resource Group "db2_db2instp_batman_0-rg":
Diagnosis::Resource: db2_db2instp_batman_0-rg/ResGroup/IBM.ResourceGroup
type: CHARM Resource Group
Status -
Observed: Offline - SoftDown
Desired: Offline - Requested Offline
(Nominal: Offline - Nominal State: Offline)
Automation: Idle - CharmBase trigger linked
Startable: Yes - Resource is startable
Binding: Unbound - Unbound
Compound: Satisfactory - Satisfactory
Move: Not_Supported - Resource Move State is Not Supported
72
73. Click to edit Master title style
Samdiag output
DB2 instance (continued)
Resource Based Quorum: In Quorum - Resource has Quorum
Members and Memberships:
+---HasMember ---> db2_db2instp_batman_0-
rs/Float/IBM.Application
Group Constraint: Collocated
Binding Constraints:
Flags:
None
Orders:
Outstanding Order: None - Resource is Unavailable
Dependencies:
Start: Satisfied
+---InCluster ---> Cluster
Stop: Satisfied
Binding exceptions:
There are unbound members.
Static Relationships:
+---InCluster ---> Cluster
Dynamic Relationships:"
73
74. Click to edit Master title style
Tips &tricks
• Use TSA to improve availability of standalone databases
• Replace db2fm
• Use HADR failover without specifying HADR database
• Instance start-script will also activate databases
• Keep the # of databases / instance low
• Rebuild the domain quickly with sampolicy
74
75. Click to edit Master title style
Library & References
• DB2 Information Center
• TSAMP documentation
• http://publib.boulder.ibm.com/tividd/td/IBMTivoliSystemAutomationforMultiplatforms3.1.html
• Automated Cluster Controlled HADR Configuration Setup
using the IBM DB2 High Availability Instance Utility
• ftp://ftp.software.ibm.com/software/data/pubs/papers/HADR_db2haicu.pdf
• Automating DB2 HADR Failover on Linux using Tivoli System
Automation for Multiplatforms
• ftp://ftp.software.ibm.com/software/data/pubs/papers/hadr_tsa.pdf
• DB2 system topology and configuration for automated multi-site HA
and DR
• http://public.dhe.ibm.com/software/dw/data/dm-1002automateddb2hadr/Automated_long-
distance_HADR_configurations.pdf
75
76. Click to edit Master title style
Library & References
• Integrating Tivoli System Automation for Multiplatforms” (TSAMP)
with DB2 High Availability & Disaster Recovery (HADR) v9.5 (Garreth
Holl)
• http://www-01.ibm.com/support/docview.wss?uid=swg27016774&aid=1
• High Availability and Disaster Recovery Options for DB2 on Linux,
UNIX, and Windows
• www.redbooks.ibm.com/redbooks/pdfs/sg247363.pdf
• DB2 Best Practices – HADR
• http://www.ibm.com/developerworks/data/bestpractices/
76
77. Click to edit Master title style 77
Thank you for attending.
Any questions?
Editor's Notes
Resetrsrc will force resource offline; used in primary hadr failure
Lsrsrcdef shows the persisten atttributes for a specific resource class (ex. IBM.Application)
A network equivalency is required even if there is only one NIC per node, or a bonded definition exists to appear as a single NIC per node. Each equivalency represents the IP address(es) usable by DB2 on that node for communications. The equivalency is mapped to a virtual IP address used for external communications to DB2.
MemberLocation: Collocated, None
Rgmbrreq allows actions on single resources
Unplugging network cable will trigger database shutdown
SuspendedPropageted: lock request is propagated to resource
Local instance failure
Shows hadr and instance resource are not linked