SlideShare a Scribd company logo
1 of 28
Looking at RAC,
        GI/Clusterware Diagnostic Tools

Leighton L. Nelson
Oracle DBA Team Lead (10 yrs experience, 6 years with RAC)
RAC SIG US Events Chair and IOUG Liaison



Session# 373
Clusterware & RAC is Complex!
Where do I begin?
Clusterware, ASM & RAC Diagnostics

•   Diagcollection

•   Cluster Verification Utility (cluvfy)

•   Cluster Health Monitor (CHM)

•   Remote Diagnostics Agent (RDA)

•   ADRCI/Support Workbench

•   OS Utilities
Diagcollection
•   Gathers and packages Clusterware logs, traces plus OS logs and core files*

•   $ORA_CRS_HOME/bin/diagcollection.pl --collect --crshome
    $ORA_CRS_HOME (10gR2)

•   $GRID_HOME/bin/diagcollection.pl --collect --core|crs|all (11gR2)

•   Logs can be filtered by date/time with --adr --beforetime --aftertime

•   Allocate enough space in current directory for diagnostic files
•   Needs to be run on all nodes in the cluster.
•   Limited information collected if not run as root
•   In 11.2 diagcollection enhanced to collect ADR and CHM data
diagcollection example
[root@oelgrid02 u02]# /u01/app/11.2.0/grid/bin/diagcollection.sh --collect

Production Copyright 2004, 2010, Oracle.   All rights reserved

Cluster Ready Services (CRS) diagnostic collection tool

The following CRS diagnostic archives will be created in the local directory:

crsData_oelgrid02_20120225_1723.tar.gz -> logs, traces and cores from CRS home.
    Note: core files will be packaged only with the --core option.

ocrData_oelgrid02_20120225_1723.tar.gz -> ocrdump, ocrcheck etc

coreData_oelgrid02_20120225_1723.tar.gz -> contents of CRS core files in text
    format

osData_oelgrid02_20120225_1723.tar.gz -> logs from Operating System

Collecting crs data
Cluster Verification Utility

•   Cluvfy runs in stage mode or component mode

•   Can be executed from the Grid Infrastructure Home in 11gR2 or from
    installation media

•   New resource in 11.2.0.2.0 - ora.cvu

•   “cluvfy comp –list” displays components that can be checked

•   For standalone cluvfy set CV_HOME CV_JDKHOME and CV_DESTLOC
Cluster Verification Utility
•   Use stage mode during installation/upgrade
•   Use component mode to diagnose components after
    Clusterware installation
•   Doesn’t diagnose all components e.g. HAIP
•   $GRID_HOME/bin/cluvfy
•   $INSTALL_DISK/runcluvfy.sh

•   New in 11.2.0.3.0 :
    cluvfy comp healthcheck
Cluster Verification Utility



cluvfy comp –list output
Cluster Health Monitor (CHM)

•   Cluster Health Monitor (CHM) monitors and collect OS and
    clusterware metrics in real-time

•   Installed by default in 11.2.0.2+

•   Collects metrics at 1 sec interval in 11.2.0.2 and 5 sec interval in
    11.2.0.3

•   Command Line Interface $GRID_HOME/bin/oclumon

•   Collects CHM data using diagcollection.pl --collect --chmos
Cluster Health Monitor (CHM)

•   Useful for troubleshooting root cause analysis - node
    reboots/hangs, instance evictions, performance degradations etc
•   OTN version of CHM and 11.2.0.2 version are incompatible. If
    you have 11.2.0.2 then you cannot install OTN version.
•   Uses OS API to collect metrics reducing overhead
•   Clusterware resource called ora.crf
•   CHM doesn’t require RAC or Clusterware
OS Watcher Black Box
•   OS Watcher v4.0 has been renamed to OS Watcher Black Box (OSWbb)

•   UNIX shell scripts for monitoring the OS (ps, top, mpstat, iostat, netstat, vmstat)

•   Useful for diagnosing OS resource and performance problems, node reboots

•   Should run on all nodes in a cluster

•   Setup private interconnect monitoring

•   Execute startOSWbb.sh arg1 arg2 where arg1=collection frequency and
    arg2=retention time
    nohup ./startOSWbb.sh 60 48 &
OS Watcher Black Box

•   Bundled with OS Watcher Black Box Analyzer
    (OSWbba)

•   Requires Java 1.4.2 or greater

•   Correlate OS statistics using the analyzer profile

•   Generates graphs and reports for memory, cpu, disk

•   Use CLI option to script profile generation for
    troubleshooting
OS Watcher Black Box
OS Watcher Black Box
OSWbb Free Memory Graph
RACcheck –
            RAC Configuration Audit Tool




•   RACCHECK OUTPUT
RACcheck –
                 RAC Configuration Audit Tool


•   Assess the configuration of RAC, Clusterware and ASM

•   Useful for pre-upgrade and post-upgrade system verification

•   Uses “Best Practices” to report configuration problems –
    PASS/WARNING/FAIL/INFO

•   Generates detailed and summary reports with scorecard
Remote Diagnostics Assistant

•   The diagnostics tool recommended by MOS

•   Collects a wealth of information based on configuration –
    OS/Clusterware/Database logs

•   Runs AWR/Statspack report for Performance problems

•   Generates reports in HTML format
Procwatcher
•   Debug Oracle & Clusterware processes using
    oradebug short_stack or OS debugger (e.g. gdb,
    pstack)

•   Run as Oracle process owner to debug database or as
    root for clusterware processes

•   Can be deployed as a Clusterware resource

•   Useful for troubleshooting session hangs, severe
    performance problems, instance evictions
Procwatcher
grid@node1[+ASM1]-/u02 >./prw.sh start all

Wed Feb 25 02:30:26 CDT 2012: Starting Procwatcher

Wed Feb 25 02:30:26 CDT 2012: Thank you for using Procwatcher.
   :-)

Wed Feb 25 02:30:26 CDT 2012: Please add a comment to Oracle
   Support Note 459694.1

Wed Feb 25 02:30:26 CDT 2012: if you have any comments,
   suggestions, or issues with this tool.

Wed Feb 25 02:30:26 CDT 2012: Started Procwatcher
ADRCI/Support Workbench

•   Automatic Diagnostic Repository (ADR) stores database
    diagnostic information

•   Package diagnostics files using ADRCI or Support Workbench

•   Manages incidents and problems from alert logs

•   Enterprise Manager provides GUI interface to ADR called Support
    Workbench
ADRCI/Support Workbench
RACDIAG.SQL



•   Gathers debug information for RAC Session Hangs

•   One-time data capture

•   Performs hanganalyze dumps

•   Certain types of hangs will prevent it from running
OS Utilities


•   truss/strace – trace system calls and signals

•   pstack – dump stack trace for process

•   pmap/procmap – maps process memory

•   nmon/nmon analyzer – collects and analyzes OS stats

•   collectl /collectl utils – collects and analyzes OS stats
Summary
Tool/Utility     Instance Evictions   Node reboots   Clusterware   RAC Performance
                                                     Problems
diagcollection           ✓                   ✓              ✓             ✗
cluvfy                    ✗                  ✗              ✓             ✗
CHM                      ✓                   ✓              ✓             ✓
OSWbb/OSWbba             ✓                   ✓              ✓             ✓
RDA                      ✓                   ✓              ✓             ✓
RACcheck                 ✓                   ✓              ✓             ✗
Procwatcher              ✓                   ✗              ✓             ✓
ADRCI/SW                  ✗                  ✗              ✗             ✓
MOS Notes
•   OS Watcher Black Box User Guide [ID 301137.1]

•   OS Watcher Black Box Analyzer User Guide [ID 461053.1]

•   Data Gathering for Troubleshooting Oracle Clusterware (CRS or GI) Issues [ID 289690.1]

•   CRS 10gR2/ 11gR1/ 11gR2 Diagnostic Collection Guide [ID 330358.1]

•   Diagnosability for Oracle Clusterware (CRS or Grid Infrastructure) Component and Resource [ID 357808.1]

•   Data Gathering for Troubleshooting RAC Issues [ID 556679.1]

•   Cluster Health Monitor (CHM) FAQ [ID 1328466.1]

•   Introducing Cluster Health Monitor (IPD/OS) [ID 736752.1]

•   RACcheck - RAC Configuration Audit Tool [ID 1268927.1]

•   Procwatcher: Script to Monitor and Examine Oracle DB and Clusterware Processes [ID 459694.1]

•   Script to Collect RAC Diagnostic Information (racdiag.sql) [ID 135714.1]
Contact Information



•   Website - blogs.griddba.com

•   LinkedIn – Leighton Nelson

•   Twitter - @leight0nn

•   Email: leighton.nelson@mercy.net

More Related Content

What's hot

Understanding PostgreSQL LW Locks
Understanding PostgreSQL LW LocksUnderstanding PostgreSQL LW Locks
Understanding PostgreSQL LW Locks
Jignesh Shah
 

What's hot (20)

A Second Look at Oracle RAC 12c
A Second Look at Oracle RAC 12cA Second Look at Oracle RAC 12c
A Second Look at Oracle RAC 12c
 
End-to-end Troubleshooting Checklist for Microsoft SQL Server
End-to-end Troubleshooting Checklist for Microsoft SQL ServerEnd-to-end Troubleshooting Checklist for Microsoft SQL Server
End-to-end Troubleshooting Checklist for Microsoft SQL Server
 
[NYC Meetup] Docker at Nuxeo
[NYC Meetup] Docker at Nuxeo[NYC Meetup] Docker at Nuxeo
[NYC Meetup] Docker at Nuxeo
 
SQL Monitoring in Oracle Database 12c
SQL Monitoring in Oracle Database 12cSQL Monitoring in Oracle Database 12c
SQL Monitoring in Oracle Database 12c
 
12 Things About WebLogic 12.1.3 #oow2014 #otnla15
12 Things About WebLogic 12.1.3 #oow2014 #otnla1512 Things About WebLogic 12.1.3 #oow2014 #otnla15
12 Things About WebLogic 12.1.3 #oow2014 #otnla15
 
IaC MeetUp Active Directory Setup for Oracle Security LAB
IaC MeetUp Active Directory Setup for Oracle Security LABIaC MeetUp Active Directory Setup for Oracle Security LAB
IaC MeetUp Active Directory Setup for Oracle Security LAB
 
PostgreSQL and Linux Containers
PostgreSQL and Linux ContainersPostgreSQL and Linux Containers
PostgreSQL and Linux Containers
 
Create non-cdb (traditional) oracle database 12c on windows
Create non-cdb (traditional) oracle database 12c on windowsCreate non-cdb (traditional) oracle database 12c on windows
Create non-cdb (traditional) oracle database 12c on windows
 
Performance Tuning Oracle Weblogic Server 12c
Performance Tuning Oracle Weblogic Server 12cPerformance Tuning Oracle Weblogic Server 12c
Performance Tuning Oracle Weblogic Server 12c
 
Oracle Exadata Performance: Latest Improvements and Less Known Features
Oracle Exadata Performance: Latest Improvements and Less Known FeaturesOracle Exadata Performance: Latest Improvements and Less Known Features
Oracle Exadata Performance: Latest Improvements and Less Known Features
 
12 Things about Oracle WebLogic Server 12c
12 Things	 about Oracle WebLogic Server 12c12 Things	 about Oracle WebLogic Server 12c
12 Things about Oracle WebLogic Server 12c
 
Understanding PostgreSQL LW Locks
Understanding PostgreSQL LW LocksUnderstanding PostgreSQL LW Locks
Understanding PostgreSQL LW Locks
 
MySQL Webinar 2/4 Performance tuning, hardware, optimisation
MySQL Webinar 2/4 Performance tuning, hardware, optimisationMySQL Webinar 2/4 Performance tuning, hardware, optimisation
MySQL Webinar 2/4 Performance tuning, hardware, optimisation
 
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
 
Oracle Unified Directory. Lessons learnt. Is it ready for a move from OID? (O...
Oracle Unified Directory. Lessons learnt. Is it ready for a move from OID? (O...Oracle Unified Directory. Lessons learnt. Is it ready for a move from OID? (O...
Oracle Unified Directory. Lessons learnt. Is it ready for a move from OID? (O...
 
An introduction into Oracle VM V3.x
An introduction into Oracle VM V3.xAn introduction into Oracle VM V3.x
An introduction into Oracle VM V3.x
 
Introduction to Stacki - World's fastest Linux server provisioning Tool
Introduction to Stacki - World's fastest Linux server provisioning ToolIntroduction to Stacki - World's fastest Linux server provisioning Tool
Introduction to Stacki - World's fastest Linux server provisioning Tool
 
WebLogic on ODA - Oracle Open World 2013
WebLogic on ODA - Oracle Open World 2013WebLogic on ODA - Oracle Open World 2013
WebLogic on ODA - Oracle Open World 2013
 
Simplifying EBS 12.2 ADOP - Collaborate 2019
Simplifying EBS 12.2 ADOP - Collaborate 2019   Simplifying EBS 12.2 ADOP - Collaborate 2019
Simplifying EBS 12.2 ADOP - Collaborate 2019
 
PDB Provisioning with Oracle Multitenant Self Service Application
PDB Provisioning with Oracle Multitenant Self Service ApplicationPDB Provisioning with Oracle Multitenant Self Service Application
PDB Provisioning with Oracle Multitenant Self Service Application
 

Viewers also liked

Proyecto dinosaurio
Proyecto dinosaurioProyecto dinosaurio
Proyecto dinosaurio
Arecha
 
Fresh & Easy Neighborhood Market vegetarian & vegan shopping list
Fresh & Easy Neighborhood Market vegetarian & vegan shopping listFresh & Easy Neighborhood Market vegetarian & vegan shopping list
Fresh & Easy Neighborhood Market vegetarian & vegan shopping list
Fresh & Easy Neighborhood Market
 
Saludo de fiestas patrias
Saludo de fiestas patriasSaludo de fiestas patrias
Saludo de fiestas patrias
Julio Mendoza
 
JSMS review
JSMS reviewJSMS review
JSMS review
town27
 
Kun agüero power point
Kun agüero power pointKun agüero power point
Kun agüero power point
Martinito17
 

Viewers also liked (20)

SQL Developer for DBAs
SQL Developer for DBAsSQL Developer for DBAs
SQL Developer for DBAs
 
Operativo
Operativo Operativo
Operativo
 
Proyecto dinosaurio
Proyecto dinosaurioProyecto dinosaurio
Proyecto dinosaurio
 
M. de transporte
M. de transporteM. de transporte
M. de transporte
 
DeltaX Social Facebook Ads Buying Platform
DeltaX Social Facebook Ads Buying PlatformDeltaX Social Facebook Ads Buying Platform
DeltaX Social Facebook Ads Buying Platform
 
Fresh & Easy Neighborhood Market vegetarian & vegan shopping list
Fresh & Easy Neighborhood Market vegetarian & vegan shopping listFresh & Easy Neighborhood Market vegetarian & vegan shopping list
Fresh & Easy Neighborhood Market vegetarian & vegan shopping list
 
dawanda
dawandadawanda
dawanda
 
Cv4356
Cv4356Cv4356
Cv4356
 
BW technologies certificate
BW technologies certificateBW technologies certificate
BW technologies certificate
 
Viroc catalogue
Viroc catalogueViroc catalogue
Viroc catalogue
 
FORMACIÓN DE DOCENTES TIC
FORMACIÓN DE DOCENTES TICFORMACIÓN DE DOCENTES TIC
FORMACIÓN DE DOCENTES TIC
 
Bed expansion formula
Bed expansion formulaBed expansion formula
Bed expansion formula
 
Saludo de fiestas patrias
Saludo de fiestas patriasSaludo de fiestas patrias
Saludo de fiestas patrias
 
1 video entrevista nino con aquamarina
1 video entrevista nino con aquamarina1 video entrevista nino con aquamarina
1 video entrevista nino con aquamarina
 
JSMS review
JSMS reviewJSMS review
JSMS review
 
20150107150528762
2015010715052876220150107150528762
20150107150528762
 
El estudio Nielsen analiza a los nuevos consumidores online | Estrategia Digital
El estudio Nielsen analiza a los nuevos consumidores online | Estrategia DigitalEl estudio Nielsen analiza a los nuevos consumidores online | Estrategia Digital
El estudio Nielsen analiza a los nuevos consumidores online | Estrategia Digital
 
Presentacion3
Presentacion3Presentacion3
Presentacion3
 
Kun agüero power point
Kun agüero power pointKun agüero power point
Kun agüero power point
 
Advanced Research II
Advanced Research IIAdvanced Research II
Advanced Research II
 

Similar to Looking at RAC, GI/Clusterware Diagnostic Tools

OGG Architecture Performance
OGG Architecture PerformanceOGG Architecture Performance
OGG Architecture Performance
Enkitec
 
Oracle GoldenGate Architecture Performance
Oracle GoldenGate Architecture PerformanceOracle GoldenGate Architecture Performance
Oracle GoldenGate Architecture Performance
Enkitec
 
COUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_FeaturesCOUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_Features
Alfredo Abate
 
Getting optimal performance from oracle e business suite(aioug aug2015)
Getting optimal performance from oracle e business suite(aioug aug2015)Getting optimal performance from oracle e business suite(aioug aug2015)
Getting optimal performance from oracle e business suite(aioug aug2015)
pasalapudi123
 

Similar to Looking at RAC, GI/Clusterware Diagnostic Tools (20)

Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
Analysis of Database Issues using AHF and Machine Learning v2 -  AOUG2022Analysis of Database Issues using AHF and Machine Learning v2 -  AOUG2022
Analysis of Database Issues using AHF and Machine Learning v2 - AOUG2022
 
Maximizing Oracle RAC Uptime
Maximizing Oracle RAC UptimeMaximizing Oracle RAC Uptime
Maximizing Oracle RAC Uptime
 
OGG Architecture Performance
OGG Architecture PerformanceOGG Architecture Performance
OGG Architecture Performance
 
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
Analysis of Database Issues using AHF and Machine Learning v2 -  SOUGAnalysis of Database Issues using AHF and Machine Learning v2 -  SOUG
Analysis of Database Issues using AHF and Machine Learning v2 - SOUG
 
Oracle GoldenGate Presentation from OTN Virtual Technology Summit - 7/9/14 (PDF)
Oracle GoldenGate Presentation from OTN Virtual Technology Summit - 7/9/14 (PDF)Oracle GoldenGate Presentation from OTN Virtual Technology Summit - 7/9/14 (PDF)
Oracle GoldenGate Presentation from OTN Virtual Technology Summit - 7/9/14 (PDF)
 
Oracle GoldenGate Architecture Performance
Oracle GoldenGate Architecture PerformanceOracle GoldenGate Architecture Performance
Oracle GoldenGate Architecture Performance
 
les10.pdf
les10.pdfles10.pdf
les10.pdf
 
Swift at Scale: The IBM SoftLayer Story
Swift at Scale: The IBM SoftLayer StorySwift at Scale: The IBM SoftLayer Story
Swift at Scale: The IBM SoftLayer Story
 
RMAN – The Pocket Knife of a DBA
RMAN – The Pocket Knife of a DBA RMAN – The Pocket Knife of a DBA
RMAN – The Pocket Knife of a DBA
 
Expert performance tuning tips for Oracle RAC
Expert performance tuning tips for Oracle RACExpert performance tuning tips for Oracle RAC
Expert performance tuning tips for Oracle RAC
 
COUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_FeaturesCOUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_Features
 
Dynamics ax performance tuning
Dynamics ax performance tuningDynamics ax performance tuning
Dynamics ax performance tuning
 
Getting optimal performance from oracle e business suite
Getting optimal performance from oracle e business suiteGetting optimal performance from oracle e business suite
Getting optimal performance from oracle e business suite
 
Getting optimal performance from oracle e business suite(aioug aug2015)
Getting optimal performance from oracle e business suite(aioug aug2015)Getting optimal performance from oracle e business suite(aioug aug2015)
Getting optimal performance from oracle e business suite(aioug aug2015)
 
les12.pdf
les12.pdfles12.pdf
les12.pdf
 
Getting optimal performance from oracle e-business suite presentation
Getting optimal performance from oracle e-business suite presentationGetting optimal performance from oracle e-business suite presentation
Getting optimal performance from oracle e-business suite presentation
 
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,GrafanaPrometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
 
AppFabric Velocity
AppFabric VelocityAppFabric Velocity
AppFabric Velocity
 
Performance Whackamole (short version)
Performance Whackamole (short version)Performance Whackamole (short version)
Performance Whackamole (short version)
 
Boris Stoyanov - Troubleshooting the Virtual Router - Run and Get Diagnostics
Boris Stoyanov - Troubleshooting the Virtual Router - Run and Get DiagnosticsBoris Stoyanov - Troubleshooting the Virtual Router - Run and Get Diagnostics
Boris Stoyanov - Troubleshooting the Virtual Router - Run and Get Diagnostics
 

More from Leighton Nelson

Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareManaging Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Leighton Nelson
 
Oracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAs
Oracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAsOracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAs
Oracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAs
Leighton Nelson
 
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareManaging Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Leighton Nelson
 
Configuring Oracle Enterprise Manager Cloud Control 12c for HA White Paper
Configuring Oracle Enterprise Manager Cloud Control 12c for HA White PaperConfiguring Oracle Enterprise Manager Cloud Control 12c for HA White Paper
Configuring Oracle Enterprise Manager Cloud Control 12c for HA White Paper
Leighton Nelson
 
Migrating from Oracle Enterprise Manager 10g to 12c Cloud Control
Migrating from Oracle Enterprise Manager 10g to 12c Cloud ControlMigrating from Oracle Enterprise Manager 10g to 12c Cloud Control
Migrating from Oracle Enterprise Manager 10g to 12c Cloud Control
Leighton Nelson
 

More from Leighton Nelson (10)

IOUG Collaborate 2015 - PDB Cloning Using SQL Commands
IOUG Collaborate 2015 - PDB Cloning Using SQL CommandsIOUG Collaborate 2015 - PDB Cloning Using SQL Commands
IOUG Collaborate 2015 - PDB Cloning Using SQL Commands
 
Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...
Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...
Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...
 
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareManaging Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
 
Oracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAs
Oracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAsOracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAs
Oracle Enterprise Manager Cloud Control 12c - Top 10 Features for DBAs
 
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareManaging Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
 
UPGRADING FROM ORACLE ENTERPRISE MANAGER 10G TO CLOUD CONTROL 12C WITH ZERO D...
UPGRADING FROM ORACLE ENTERPRISE MANAGER 10G TO CLOUD CONTROL 12C WITH ZERO D...UPGRADING FROM ORACLE ENTERPRISE MANAGER 10G TO CLOUD CONTROL 12C WITH ZERO D...
UPGRADING FROM ORACLE ENTERPRISE MANAGER 10G TO CLOUD CONTROL 12C WITH ZERO D...
 
Configuring Oracle Enterprise Manager Cloud Control 12c for HA White Paper
Configuring Oracle Enterprise Manager Cloud Control 12c for HA White PaperConfiguring Oracle Enterprise Manager Cloud Control 12c for HA White Paper
Configuring Oracle Enterprise Manager Cloud Control 12c for HA White Paper
 
Configuring Oracle Enterprise Manager Cloud Control 12c for High Availability
Configuring Oracle Enterprise Manager Cloud Control 12c for High AvailabilityConfiguring Oracle Enterprise Manager Cloud Control 12c for High Availability
Configuring Oracle Enterprise Manager Cloud Control 12c for High Availability
 
Migrating from Oracle Enterprise Manager 10g to 12c Cloud Control
Migrating from Oracle Enterprise Manager 10g to 12c Cloud ControlMigrating from Oracle Enterprise Manager 10g to 12c Cloud Control
Migrating from Oracle Enterprise Manager 10g to 12c Cloud Control
 
Oracle RAC - Standard Edition, Enterprise Edition & One Node
Oracle RAC - Standard Edition, Enterprise Edition & One NodeOracle RAC - Standard Edition, Enterprise Edition & One Node
Oracle RAC - Standard Edition, Enterprise Edition & One Node
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Looking at RAC, GI/Clusterware Diagnostic Tools

  • 1.
  • 2. Looking at RAC, GI/Clusterware Diagnostic Tools Leighton L. Nelson Oracle DBA Team Lead (10 yrs experience, 6 years with RAC) RAC SIG US Events Chair and IOUG Liaison Session# 373
  • 3. Clusterware & RAC is Complex!
  • 4. Where do I begin?
  • 5. Clusterware, ASM & RAC Diagnostics • Diagcollection • Cluster Verification Utility (cluvfy) • Cluster Health Monitor (CHM) • Remote Diagnostics Agent (RDA) • ADRCI/Support Workbench • OS Utilities
  • 6. Diagcollection • Gathers and packages Clusterware logs, traces plus OS logs and core files* • $ORA_CRS_HOME/bin/diagcollection.pl --collect --crshome $ORA_CRS_HOME (10gR2) • $GRID_HOME/bin/diagcollection.pl --collect --core|crs|all (11gR2) • Logs can be filtered by date/time with --adr --beforetime --aftertime • Allocate enough space in current directory for diagnostic files • Needs to be run on all nodes in the cluster. • Limited information collected if not run as root • In 11.2 diagcollection enhanced to collect ADR and CHM data
  • 7. diagcollection example [root@oelgrid02 u02]# /u01/app/11.2.0/grid/bin/diagcollection.sh --collect Production Copyright 2004, 2010, Oracle. All rights reserved Cluster Ready Services (CRS) diagnostic collection tool The following CRS diagnostic archives will be created in the local directory: crsData_oelgrid02_20120225_1723.tar.gz -> logs, traces and cores from CRS home. Note: core files will be packaged only with the --core option. ocrData_oelgrid02_20120225_1723.tar.gz -> ocrdump, ocrcheck etc coreData_oelgrid02_20120225_1723.tar.gz -> contents of CRS core files in text format osData_oelgrid02_20120225_1723.tar.gz -> logs from Operating System Collecting crs data
  • 8. Cluster Verification Utility • Cluvfy runs in stage mode or component mode • Can be executed from the Grid Infrastructure Home in 11gR2 or from installation media • New resource in 11.2.0.2.0 - ora.cvu • “cluvfy comp –list” displays components that can be checked • For standalone cluvfy set CV_HOME CV_JDKHOME and CV_DESTLOC
  • 9. Cluster Verification Utility • Use stage mode during installation/upgrade • Use component mode to diagnose components after Clusterware installation • Doesn’t diagnose all components e.g. HAIP • $GRID_HOME/bin/cluvfy • $INSTALL_DISK/runcluvfy.sh • New in 11.2.0.3.0 : cluvfy comp healthcheck
  • 10. Cluster Verification Utility cluvfy comp –list output
  • 11. Cluster Health Monitor (CHM) • Cluster Health Monitor (CHM) monitors and collect OS and clusterware metrics in real-time • Installed by default in 11.2.0.2+ • Collects metrics at 1 sec interval in 11.2.0.2 and 5 sec interval in 11.2.0.3 • Command Line Interface $GRID_HOME/bin/oclumon • Collects CHM data using diagcollection.pl --collect --chmos
  • 12. Cluster Health Monitor (CHM) • Useful for troubleshooting root cause analysis - node reboots/hangs, instance evictions, performance degradations etc • OTN version of CHM and 11.2.0.2 version are incompatible. If you have 11.2.0.2 then you cannot install OTN version. • Uses OS API to collect metrics reducing overhead • Clusterware resource called ora.crf • CHM doesn’t require RAC or Clusterware
  • 13. OS Watcher Black Box • OS Watcher v4.0 has been renamed to OS Watcher Black Box (OSWbb) • UNIX shell scripts for monitoring the OS (ps, top, mpstat, iostat, netstat, vmstat) • Useful for diagnosing OS resource and performance problems, node reboots • Should run on all nodes in a cluster • Setup private interconnect monitoring • Execute startOSWbb.sh arg1 arg2 where arg1=collection frequency and arg2=retention time nohup ./startOSWbb.sh 60 48 &
  • 14. OS Watcher Black Box • Bundled with OS Watcher Black Box Analyzer (OSWbba) • Requires Java 1.4.2 or greater • Correlate OS statistics using the analyzer profile • Generates graphs and reports for memory, cpu, disk • Use CLI option to script profile generation for troubleshooting
  • 16. OS Watcher Black Box OSWbb Free Memory Graph
  • 17. RACcheck – RAC Configuration Audit Tool • RACCHECK OUTPUT
  • 18. RACcheck – RAC Configuration Audit Tool • Assess the configuration of RAC, Clusterware and ASM • Useful for pre-upgrade and post-upgrade system verification • Uses “Best Practices” to report configuration problems – PASS/WARNING/FAIL/INFO • Generates detailed and summary reports with scorecard
  • 19. Remote Diagnostics Assistant • The diagnostics tool recommended by MOS • Collects a wealth of information based on configuration – OS/Clusterware/Database logs • Runs AWR/Statspack report for Performance problems • Generates reports in HTML format
  • 20. Procwatcher • Debug Oracle & Clusterware processes using oradebug short_stack or OS debugger (e.g. gdb, pstack) • Run as Oracle process owner to debug database or as root for clusterware processes • Can be deployed as a Clusterware resource • Useful for troubleshooting session hangs, severe performance problems, instance evictions
  • 21. Procwatcher grid@node1[+ASM1]-/u02 >./prw.sh start all Wed Feb 25 02:30:26 CDT 2012: Starting Procwatcher Wed Feb 25 02:30:26 CDT 2012: Thank you for using Procwatcher. :-) Wed Feb 25 02:30:26 CDT 2012: Please add a comment to Oracle Support Note 459694.1 Wed Feb 25 02:30:26 CDT 2012: if you have any comments, suggestions, or issues with this tool. Wed Feb 25 02:30:26 CDT 2012: Started Procwatcher
  • 22. ADRCI/Support Workbench • Automatic Diagnostic Repository (ADR) stores database diagnostic information • Package diagnostics files using ADRCI or Support Workbench • Manages incidents and problems from alert logs • Enterprise Manager provides GUI interface to ADR called Support Workbench
  • 24. RACDIAG.SQL • Gathers debug information for RAC Session Hangs • One-time data capture • Performs hanganalyze dumps • Certain types of hangs will prevent it from running
  • 25. OS Utilities • truss/strace – trace system calls and signals • pstack – dump stack trace for process • pmap/procmap – maps process memory • nmon/nmon analyzer – collects and analyzes OS stats • collectl /collectl utils – collects and analyzes OS stats
  • 26. Summary Tool/Utility Instance Evictions Node reboots Clusterware RAC Performance Problems diagcollection ✓ ✓ ✓ ✗ cluvfy ✗ ✗ ✓ ✗ CHM ✓ ✓ ✓ ✓ OSWbb/OSWbba ✓ ✓ ✓ ✓ RDA ✓ ✓ ✓ ✓ RACcheck ✓ ✓ ✓ ✗ Procwatcher ✓ ✗ ✓ ✓ ADRCI/SW ✗ ✗ ✗ ✓
  • 27. MOS Notes • OS Watcher Black Box User Guide [ID 301137.1] • OS Watcher Black Box Analyzer User Guide [ID 461053.1] • Data Gathering for Troubleshooting Oracle Clusterware (CRS or GI) Issues [ID 289690.1] • CRS 10gR2/ 11gR1/ 11gR2 Diagnostic Collection Guide [ID 330358.1] • Diagnosability for Oracle Clusterware (CRS or Grid Infrastructure) Component and Resource [ID 357808.1] • Data Gathering for Troubleshooting RAC Issues [ID 556679.1] • Cluster Health Monitor (CHM) FAQ [ID 1328466.1] • Introducing Cluster Health Monitor (IPD/OS) [ID 736752.1] • RACcheck - RAC Configuration Audit Tool [ID 1268927.1] • Procwatcher: Script to Monitor and Examine Oracle DB and Clusterware Processes [ID 459694.1] • Script to Collect RAC Diagnostic Information (racdiag.sql) [ID 135714.1]
  • 28. Contact Information • Website - blogs.griddba.com • LinkedIn – Leighton Nelson • Twitter - @leight0nn • Email: leighton.nelson@mercy.net

Editor's Notes

  1. RAC is complex When something goes wrong where to start?
  2. Logs
  3. Diagcollection script needs to be run on all nodes in the cluster. Limited information collected if not run as root In 11.2 diagcollection enhanced to collect ADR and CHM data Core files only packaged with the –core option.
  4. Use stage mode during installation Use component mode to diagnose components after Clusterware installation Doesn’t diagnose all components e.g. HAIP $GRID_HOME/bin/cluvfy $INSTALL_DISK/runcluvfy.sh ora.cvu New option in 11.2.0.3.0 : cluvfy comp healthcheck [-collect {cluster|databas[-db db_unique_name] [-bestpractice|-mandatory] [- deviations] [-html] [-save [-savedir directory_path]
  5. Use stage mode during installation Use component mode to diagnose components after Clusterware installation Doesn’t diagnose all components e.g. HAIP $GRID_HOME/bin/cluvfy $INSTALL_DISK/runcluvfy.sh ora.cvu New option in 11.2.0.3.0 : cluvfy comp healthcheck [-collect {cluster|databas[-db db_unique_name] [-bestpractice|-mandatory] [- deviations] [-html] [-save [-savedir directory_path]
  6. Use stage mode during installation Use component mode to diagnose components after Clusterware installation Doesn’t diagnose all components e.g. HAIP $GRID_HOME/bin/cluvfy $INSTALL_DISK/runcluvfy.sh ora.cvu New option in 11.2.0.3.0 : cluvfy comp healthcheck [-collect {cluster|databas[-db db_unique_name] [-bestpractice|-mandatory] [- deviations] [-html] [-save [-savedir directory_path]
  7. Useful for troubleshooting root cause analysis - node reboots/hangs, instance evictions, performance degradations etc OTN version of CHM and 11.2.0.2 version are incompatible. If you have 11.2.0.2 then you cannot install OTN version. Uses OS API to collect metrics reducing overhead Clusterware resource called ora.crf CHM doesn’t require RAC or Clusterware
  8. Useful for troubleshooting root cause analysis - node reboots/hangs, instance evictions, performance degradations etc OTN version of CHM and 11.2.0.2 version are incompatible. If you have 11.2.0.2 then you cannot install OTN version. Uses OS API to collect metrics reducing overhead Clusterware resource called ora.crf CHM doesn’t require RAC or Clusterware
  9. OSWatcher Black Box is certified to run on AIX, Solaris, HP-UX, and Linux. Collects data every 30 minutes and archives 48 hrs worth of data by default ps top mpstat iostat netstat traceroute vmstat
  10. Requires Java 1.4.2 or greater Parses OSWbb data Menu driven or CLI Disks graphs will only be generated if iostat is used with extended statistics Correlate OS statistics using the analyzer profile OS Watcher Black Box User Guide [301137.1]
  11. Requires Java 1.4.2 or greater Parses OSWbb data Menu driven or CLI Disks graphs will only be generated if iostat is used with extended statistics Correlate OS statistics using the analyzer profile OS Watcher Black Box User Guide [301137.1]
  12. Requires Java 1.4.2 or greater Parses OSWbb data Menu driven or CLI Disks graphs will only be generated if iostat is used with extended statistics Correlate OS statistics using the analyzer profile OS Watcher Black Box User Guide [301137.1]
  13. Supported on Linux, AIX (bash) and Solaris SPARC RACcheck - RAC Configuration Audit Tool [ID 1268927.1]
  14. Supported on Linux, AIX (bash) and Solaris SPARC RACcheck - RAC Configuration Audit Tool [ID 1268927.1]
  15. RDA for RAC requires initial setup. Run RDA regularly to detect problems proactively
  16. Procwatcher: Script to Monitor and Examine Oracle DB and Clusterware Processes [ID 459694.1] Calls pstack by default Procwatcher is a tool to examine and monitor Oracle database and/or clusterware processes at an interval. The tool will collect stack traces of these processes using Oracle tools like oradebug short_stack and/or OS debuggers like pstack, gdb, dbx, or ladebug and collect SQL data if specified. Session level hangs or severe contention in the database/instance. Severe performance issues. Instance evictions and/or DRM timeouts. Clusterware or DB processes stuck or consuming high CPU (must set EXAMINE_CLUSTER=true and run as root for clusterware processes) ORA-4031 and SGA memory management issues. (Set USE_SQL=true and sgastat=y which are the defaults, also set heapdetails=y (not the default). ORA-4030 and DB process memory issues. (Set USE_SQL=true and process_memory=y). RMAN slowness/contention during a backup. (Set USE_SQL=true and rmanclient=y).
  17. Procwatcher: Script to Monitor and Examine Oracle DB and Clusterware Processes [ID 459694.1] Calls pstack by default Procwatcher is a tool to examine and monitor Oracle database and/or clusterware processes at an interval. The tool will collect stack traces of these processes using Oracle tools like oradebug short_stack and/or OS debuggers like pstack, gdb, dbx, or ladebug and collect SQL data if specified. Session level hangs or severe contention in the database/instance. Severe performance issues. Instance evictions and/or DRM timeouts. Clusterware or DB processes stuck or consuming high CPU (must set EXAMINE_CLUSTER=true and run as root for clusterware processes) ORA-4031 and SGA memory management issues. (Set USE_SQL=true and sgastat=y which are the defaults, also set heapdetails=y (not the default). ORA-4030 and DB process memory issues. (Set USE_SQL=true and process_memory=y). RMAN slowness/contention during a backup. (Set USE_SQL=true and rmanclient=y).
  18. ADRCI is a command-line tool that is part of the fault diagnosability infrastructure introduced in Oracle Database Release 11g. ADRCI enables you to: View diagnostic data within the Automatic Diagnostic Repository (ADR). View Health Monitor reports. Package incident and problem information into a zip file for transmission to Oracle Support.
  19. ADRCI is a command-line tool that is part of the fault diagnosability infrastructure introduced in Oracle Database Release 11g. ADRCI enables you to: View diagnostic data within the Automatic Diagnostic Repository (ADR). View Health Monitor reports. Package incident and problem information into a zip file for transmission to Oracle Support.
  20. ADRCI is a command-line tool that is part of the fault diagnosability infrastructure introduced in Oracle Database Release 11g. ADRCI enables you to: View diagnostic data within the Automatic Diagnostic Repository (ADR). View Health Monitor reports. Package incident and problem information into a zip file for transmission to Oracle Support.
  21. Data Gathering for Troubleshooting RAC Issues [ID 556679.1]