The document discusses analytical modeling techniques for assessing system performance. It describes choosing a representative baseline period for modeling, such as a peak usage time. Various modeling scenarios are presented, like increasing a workload by 5% or changing hardware, to determine how systems would be affected. Both mainframe and distributed modeling examples are provided that examine impacts of changes to CPU, storage, and workloads on system performance. The document emphasizes gathering accurate system data and validating model results against real systems.
Modeling and Forecasting – Effective Baselines for Capacity Management
1. ANALYTICAL MODELING AND
CREATING EFFECTIVE BASELINES
Charles Johnson – Senior Solutions Engineer
Bill Bostridge – VP Sales, Data Infrastructure Optimization
2. Leading the next technology [r]evolution:
Big Iron to Big Data
Big Iron to Big Data: a fast-growing market segment
composed of solutions that optimize traditional data
systems and deliver mission-critical data from these systems
to next-generation analytic environments.
Syncsort Confidential and Proprietary - do not copy or distribute 2
3. Syncsort - Trusted Industry Leadership
Syncsort Confidential and Proprietary - do not copy or distribute
500+
Experienced & Talented
Data Professionals
>7,000
Customers
1968
50 Years of Market Leadership
& Award-Winning Customer Support
84
of Fortune 100 are Customers
3x
Revenue Growth
In Last 12 Months
The global leader in Big Iron to Big Data
3
5. Differentiated Product Portfolio & Technical Expertise
Syncsort Confidential and Proprietary - do not copy or distribute
Data
Infrastructure Optimization
Data
Availability
Data
Integration
Data
Quality
Market-leading
data quality capability
Best-in-class resource utilization
and performance, on premise
or in the cloud
#1 in high availability for
IBM i and AIX Power Systems
Industry-leading mainframe
data access and highest
performing ETL
• Trillium Software System
• Trillium Quality for Big
Data
• Trillium Precise
• Trillium Cloud
• Trillium Global Locator
• DL/2
• Zen Suite
• MFX® for z/OS
• ZPSaver Suite
• EZ-DB2
• EZ-IDMS
• DMX & DMX-h
• DMX AppMod
• athene®
• athene
SaaS®
• MIMIX Availability & DR
• MIMIX Move
• MIMIX Share
• Quick-EDD/HA
• iTera Availability
• Enforcive, Cilasoft, CSI
• Ironstream®
• Ironstream® Transaction
Tracing
• DMX & DMX-h
• DMX Change Data Capture
Big Iron to Big Data
A fast-growing market segment composed of solutions that optimize traditional data systems and
deliver mission-critical data from these systems to next-generation analytic environments.
5
6. Agenda
Modeling Tools and Techniques
What Information do I need?
What questions can a model answer?
Determine the baseline for the modelling question
Modeling scenarios
Recap
6Syncsort Confidential and Proprietary - do not copy or distribute
7. “A model is a simplification of
reality, built for a specific purpose”
Models
Syncsort Confidential and Proprietary - do not copy or distribute 7
8. What Questions can a Model Answer?
Where are my current bottlenecks?
How will increasing workloads affect response time?
How much more work can my systems handle?
What is the degree of tolerance in the forecast?
How sensitive is it to workload growth estimates?
When do I need to upgrade, and what performance benefit will be gained?
Syncsort Confidential and Proprietary - do not copy or distribute 8
9. Quality of Service for Stakeholders
9Syncsort Confidential and Proprietary - do not copy or distribute
10. Analytical Models
Modeling Tools and Techniques
Trending
Consolidation
Vendor
Syncsort Confidential and Proprietary - do not copy or distribute 10
11. Analytical Models
Analytical modeling
– Multi-class queuing network theory; stable and well understood
– Operational research laws applied to computers
– Known as Queuing Network Models (QNM)
Modest time, effort and expertise to model a system
Attainable accuracy OK for business decisions
Few metrics, given automatic data collection
Few parameters for What-If changes
Quick scenario evaluations
Quick sensitivity analyses
Quality of Service to Stakeholder
Syncsort Confidential and Proprietary - do not copy or distribute 11
12. Analytical Models – Pros and Cons
Pros
– Accurate enough for most planning purposes
– Quick to build and get results
– Does not require detailed expertise
– Completely reproducible results
Cons
– Limited to specific workload mixes at a single point in time
Syncsort Confidential and Proprietary - do not copy or distribute 12
13. Simulation Models
Discrete Event Simulation (DES) engine
Detailed knowledge of system internals
Even more details to achieve high accuracy
Good instrumentation or expertise to collect data
Expertise to model a system and calibrate the model
Syncsort Confidential and Proprietary - do not copy or distribute 13
14. Simulation Models
Pros
– Potentially very accurate if set up by an expert
– Can model behavior over time
– Can model detailed system internals such as database buffer pools
Cons
– Potentially very inaccurate if not set up by an expert
– High degree of detailed knowledge and specialist expertise
– Time and effort to set up
– Time to produce a report
– More time to consider all potential scenarios
– Even more time to do sensitivity analysis
Syncsort Confidential and Proprietary - do not copy or distribute 14
15. Modeling vs. Trending
Trending
– Easy to do
– Scales well for multiple systems e.g. Windows , UNIX
– Easy to alert on
– No relationship to the performance (service) of a system or application
– Looks at just one metric
Modeling
– Relates service to utilization
– View whole system interactions
– ITIL recommends modeling
– Moderate effort
15Syncsort Confidential and Proprietary - do not copy or distribute
16. Spreadsheet Analysis
Limitations occur when too much of too many types of data must be analysed
and reported on repeatedly
16Syncsort Confidential and Proprietary - do not copy or distribute
http://optimityanz.com/wp-
content/uploads/2014/10/spreadsheet.png
17. Analytical Modeling Workflow
17Syncsort Confidential and Proprietary - do not copy or distribute
Modify Model and
Predict Performance
Build and Calibrate
the Model
Current Configuration
Measure Current
Environment
Technical & Business
Inputs
Describe the
expected system
Technical & Business
Inputs Expected
System
Expected Configuration
Measure and
Compare with
Predictions
Actual Inputs
Actual Configuration
Actual Inputs
Actual Configuration
18. What information do I need?
Service Level Agreements (SLA)
Business information, i.e. # of widgets made quota
New applications
New hardware from vendors
18Syncsort Confidential and Proprietary - do not copy or distribute
19. Determine the baseline for the modeling question
Time and Duration
– Identify peak times (when future problems may occur)
– Ensure that important workloads are present
Consistent behaviour
– Type of work
– Transaction arrival rate
– Total system loading
Not affected by external events
– Normal availability
– No media failures, looping tasks, etc.
Syncsort Confidential and Proprietary - do not copy or distribute 19
20. Choosing a modeling period - Peak vs. Average
What method do you want to base the modeling?
20Syncsort Confidential and Proprietary - do not copy or distribute
21. Choosing a modeling period
21Syncsort Confidential and Proprietary - do not copy or distribute
22. Choosing a modeling period
22Syncsort Confidential and Proprietary - do not copy or distribute
23. Choosing a modeling period - Time of Day
Does the time of day affect workloads
Is the environment transactional based
Is over-night batch running into online uptime
23Syncsort Confidential and Proprietary - do not copy or distribute
24. Business Transactions vs. CPU Usage
Syncsort Confidential and Proprietary - do not copy or distribute 24
25. Modeling Scenarios
Change in Workload characteristics
Change in hardware characteristics
Hardware / Merger & Acquisition consolidation
Moving to the Cloud
25Syncsort Confidential and Proprietary - do not copy or distribute
26. Syncsort Confidential and Proprietary - do not copy or distribute
26
MAINFRAME ANALYTICAL MODEL SCENARIO
27. Model Baseline - CPU/CEC Modeling – Hardware & I/O
27Syncsort Confidential and Proprietary - do not copy or distribute
28. Increase ProdOnl 5% - CPU/CEC Modeling – Hardware & I/O
28Syncsort Confidential and Proprietary - do not copy or distribute
29. Results CPU Change - CPU/CEC Modeling – Hardware & I/O
29Syncsort Confidential and Proprietary - do not copy or distribute
31. Model – oracleq000 10% growth – server change
31Syncsort Confidential and Proprietary - do not copy or distribute
32. Model – Linux server change & disk change
32Syncsort Confidential and Proprietary - do not copy or distribute
33. Baseline – Hardware & I/O
33Syncsort Confidential and Proprietary - do not copy or distribute
34. Model – Increase ProdOnl workload 5%
34Syncsort Confidential and Proprietary - do not copy or distribute
35. Results CPU Change based on workload increase
35Syncsort Confidential and Proprietary - do not copy or distribute
36. Recap
What is the question you are attempting to answer?
Gather as much data and information as possible
Ask questions and document
Ensure your baseline matches the real world
Compare your modeling results to the actual timeframe
36Syncsort Confidential and Proprietary - do not copy or distribute