SlideShare a Scribd company logo
1 of 22
Scalable Storage Configuration for the
Physics Database Services
Luca Canali, CERN IT
LCG Database Deployment and
Persistency Workshop
October, 2005
Database Workshop, October 2005 L.Canali, CERN 2
Outline
• In this talk I will discuss
– Storage configuration for scalable database services
• Main challenges
• Best practices
– An implementation of scalable storage
• Impacts on DB logical to physical mapping
• Performance and resource allocations
– How we can help you to size new database projects or
to scale up existing applications
• Performance testing
• Benchmark data
Database Workshop, October 2005 L.Canali, CERN 3
Oracle Scalable Architecture
Goal: A database infrastructure that provides the required system
resources to the end-users and applications.
How: A modular architecture that can scale up to a large number of
components
Database Workshop, October 2005 L.Canali, CERN 4
RAID 1: HA Storage
• Mirroring
– 2-Way mirroring (RAID 1) protects against single point
of failures
– Can be used to redistribute I/O load (performance)
Database Workshop, October 2005 L.Canali, CERN 5
RAID 0: Scalable Performances
• RAID 0 (Striping) automatically redistributes files across
multiple disks.
• Performance and scalability are increased
• Error resiliency is decreased
Unstriped Disks Striped Disks
Database Workshop, October 2005 L.Canali, CERN 6
Mechanical and Geometrical constraints
• The external part of the disk provides
– More throughput
– Less latency
Database Workshop, October 2005 L.Canali, CERN 7
S.A.M.E Strategy
• Goal: optimize storage I/O utilization
• S.A.M.E. (Stripe And Mirror Everything) Strategy
– Built on the concepts of RAID 1 + 0
– Proposed by J. Loaiza (Oracle) in 1999
– Replaces “old recipes”: manual balancing across
volumes
• Need a Software or Hardware Volume Manager
– ASM is Oracle’s solution with 10g “S.A.M.E. out of the
box”
– Other solutions available from different vendors
require configuration
Database Workshop, October 2005 L.Canali, CERN 8
Storage Configuration Guidelines
• Use all available disk drives
• Place frequently used data at outer half of disk
– Fastest transfer rate
– Minimize seek time
• Stripe data at 1MB extents
– Distribute the workload across disks
– Eliminate hot spots
– Optimum sequential bandwidth gained with1MB I/O
• Stripe redo logs across multiple drives
– Maximize write throughput for small writes
– Smaller stripe size (128KB) and/or dedicated disks
• Use cache on the controller
– ‘Write-back’ cache
– Battery-backed cache
Database Workshop, October 2005 L.Canali, CERN 9
Oracle’s ASM Main Features
• Mirror protection:
– 2-way and 3-way mirroring available.
– Mirror on a per-file basis
– Can mirror across storage arrays
• Data striping across the volume:
– 1MB and 128KB stripes available
• Supports clustering and single instance
• Dynamic data distribution
– A solution to avoid ‘hot spots’
– On-line add/drop disk with minimal data relocation
– Automatic database file management
• Database File System with performance of RAW I/O
Database Workshop, October 2005 L.Canali, CERN 10
ASM’s Configuration – Examples
• ASM is a volume manager, its output are disk groups (DG) that Oracle
databases can mount to allocate their files
RECOVERY-DG
DATA-DG
Config 1: Disk groups created
with dedicated disks
Config 2: Disk groups created
by ‘horizontal’ slicing
Database Workshop, October 2005 L.Canali, CERN 11
Proposed Storage Configuration
• Proposed storage configuration:
– High availability - Allows backups to disk
– High performance - Allows clusterware mirroring (10.2)
– DBs have dedicated resources
Storage
Arrays
Oracle RAC
Nodes
Data DG-2 Recovery DG-1
Data DG-1 Recovery DG-2 Disk Groups
(ASM)
DB N.1 DB N.2
Database Workshop, October 2005 L.Canali, CERN 12
FAQ 1: Datafiles
• Do I need to worry on the number and names of
the datafiles allocated for each tablespace?
• “Traditional” storage allocation across multiple volumes:
– Requires a careful allocation of multiple datafiles across logical
volumes and/or filesystems
– Datafile-to-filesystem and filesystem-to-physical storage mappings
have to be frequently tuned
• S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access
across disks
– There is NO NEED, for performance reasons, to allocate multiple
datafiles per tablespace.
– 10g new feature “bigfile tablespace” allows for tablespaces with a
single datafile that can grow up to 32 TB (db_block_size=8k)
Database Workshop, October 2005 L.Canali, CERN 13
FAQ 2: Data and Index Tablespaces
• Do I need dedicated tablespaces for indexes and
tables?
• Separation of indexes and tables has often been advised to:
– Distribute I/O
– Reduce fragmentation
– Allow separate backup of tables and indexes
• S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access
across disks
– No performance gains are expected by using dedicated
tablespaces for INDEXes and TABLEs.
• Additional Notes:
– Tablespaces fragmentation has little impact when using locally managed
TBSs and automatic segment space management (9i and 10g)
– Very large database can profit from using multiple tablespaces for admin
purposes and logical separation of objects
Database Workshop, October 2005 L.Canali, CERN 14
FAQ 3: Sizing Storage
• Storage sizing for database should not take size as the
first requirement
– Bandwidth and performance metrics are bound to the number of
disk spindles
– Magnetic HD technology has improved the GByte/$ ratio
– The rest of HD technology has not seen much improvements in
the last 5 years (since 15K rpms HDs)
• Sizing for storage requirements should
– Be based on stress test measurements
– Past performance measurements on comparable systems
– New projects can leverage benchmark data.
• Extra HD space is not wasted
– Can be used to strengthen the B&R policy with Disk Backups
Database Workshop, October 2005 L.Canali, CERN 15
IO Benchmark Measurements
• Benchmark data can be used for
– Sizing of new projects and upgrades
– Performance baseline, testing for new hardware
• The following metrics have been measured:
– Sequential throughput (full scans)
– Random access (indexed access)
– I/O per second (indexed access)
– Metrics are measured as a function of workload
• Other test details
– Benchmark tool: Oracle’s ORION
– Infortrend Storage array: 16 SATA x 400 GB disks, 1
controller and 1 GB cache
Database Workshop, October 2005 L.Canali, CERN 16
IO Benchmark Data
Sequential Workload Performance
(Read Only Workload)
0
20
40
60
80
100
120
140
160
1 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32
Load (N# Outstanding asynch I/Os)
Throughput
(MBps)
12 LUNs, noraid
16 LUNs, no raid,
external partition
Throughput:
+ 25%
Database Workshop, October 2005 L.Canali, CERN 17
IO Benchmark Data
Latency for Small Random I/O
(Read Only Workload)
0
20
40
60
80
100
120
140
160
1 2 3 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80
Load (N# Outstanding asynch I/Os)
Latency
(ms)
12 LUNs, noraid
16 LUNs, no raid,
external partition
Latency:
- 35%
Database Workshop, October 2005 L.Canali, CERN 18
IO Benchmark Data
Small Random I/O Performance
(Read Only Workload)
0
100
200
300
400
500
600
700
800
900
1000
1 4 16 28 40 52 64 76
Load (N# Outstanding asynch I/Os)
I/O
per
second
12 LUNs, noraid
16 LUNs, no raid,
external partition
I/O /sec:
+60%
Database Workshop, October 2005 L.Canali, CERN 19
Conclusions
• The Database Services for Physics can provide
– Scalable Database services
• Scalable on CPU and Memory resources
• Scalable on Storage resources
– Sizing for new projects or upgrades
• Stress/Performance testing
• Integration and Validation Testing
• Benchmark data for capacity planning
Database Workshop, October 2005 L.Canali, CERN 20
Additional Benchmark Data
Sequential Workload Performance
(Read Only Workload)
0
20
40
60
80
100
120
140
1 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32
Load (N# Outstanding asynch I/Os)
Throughput
(MBps)
7 LUNs, raid0
14 LUNs, no raid
14 LUNs, no raid, outer
partition used
14 LUN, no raid, inner
partition used
Database Workshop, October 2005 L.Canali, CERN 21
Additional Benchmark Data
Latency for Small Random I/O
(Read Only Workload)
0
20
40
60
80
100
120
140
1 2 3 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80
Load (N# Outstanding asynch I/Os)
Latency
(ms)
7 LUNs, raid0
14 LUNs, no raid
14 LUNs, no raid, outer
partition used
14 LUNS, no raid, inner
partition used
Database Workshop, October 2005 L.Canali, CERN 22
Additional Benchmark Data
Small Random I/O Performance
(Read Only Workload)
0
100
200
300
400
500
600
700
800
900
1 4 16 28 40 52 64 76
Load (N# Outstanding asynch I/Os)
I/O
per
second
7 LUNs, raid0
14 LUNs, no raid
14 LUNs, noraid, outer
partition used
14 LUNS, noraid, inner
partition used

More Related Content

Similar to Scalable Storage Configuration for the Physics Database Services

Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph Community
 
Oracle 11g data warehouse introdution
Oracle 11g data warehouse introdutionOracle 11g data warehouse introdution
Oracle 11g data warehouse introdutionAditya Trivedi
 
The Apache Spark config behind the indsutry's first 100TB Spark SQL benchmark
The Apache Spark config behind the indsutry's first 100TB Spark SQL benchmarkThe Apache Spark config behind the indsutry's first 100TB Spark SQL benchmark
The Apache Spark config behind the indsutry's first 100TB Spark SQL benchmarkLenovo Data Center
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics PlatformSantanu Dey
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureCeph Community
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitecturePatrick McGarry
 
Spark Summit EU talk by Berni Schiefer
Spark Summit EU talk by Berni SchieferSpark Summit EU talk by Berni Schiefer
Spark Summit EU talk by Berni SchieferSpark Summit
 
DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)Gustavo Rene Antunez
 
Cost Effectively Run Multiple Oracle Database Copies at Scale
Cost Effectively Run Multiple Oracle Database Copies at Scale Cost Effectively Run Multiple Oracle Database Copies at Scale
Cost Effectively Run Multiple Oracle Database Copies at Scale NetApp
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community
 
Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...
 Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov... Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...
Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...Databricks
 
Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017Junping Du
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateDataWorks Summit
 
Accelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheAccelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheDavid Grier
 
Severalnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines
 

Similar to Scalable Storage Configuration for the Physics Database Services (20)

Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale
 
Oracle 11g data warehouse introdution
Oracle 11g data warehouse introdutionOracle 11g data warehouse introdution
Oracle 11g data warehouse introdution
 
Oracle
OracleOracle
Oracle
 
The Apache Spark config behind the indsutry's first 100TB Spark SQL benchmark
The Apache Spark config behind the indsutry's first 100TB Spark SQL benchmarkThe Apache Spark config behind the indsutry's first 100TB Spark SQL benchmark
The Apache Spark config behind the indsutry's first 100TB Spark SQL benchmark
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
NSCC Training Introductory Class
NSCC Training Introductory Class NSCC Training Introductory Class
NSCC Training Introductory Class
 
Spark Summit EU talk by Berni Schiefer
Spark Summit EU talk by Berni SchieferSpark Summit EU talk by Berni Schiefer
Spark Summit EU talk by Berni Schiefer
 
DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)
 
OOW-IMC-final
OOW-IMC-finalOOW-IMC-final
OOW-IMC-final
 
Cost Effectively Run Multiple Oracle Database Copies at Scale
Cost Effectively Run Multiple Oracle Database Copies at Scale Cost Effectively Run Multiple Oracle Database Copies at Scale
Cost Effectively Run Multiple Oracle Database Copies at Scale
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
 
Frb Briefing Database
Frb Briefing DatabaseFrb Briefing Database
Frb Briefing Database
 
Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...
 Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov... Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...
Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...
 
Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community Update
 
Accelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheAccelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cache
 
Severalnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IX
 

Recently uploaded

Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsMichael W. Hawkins
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...lizamodels9
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...Any kyc Account
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyEthan lee
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurSuhani Kapoor
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 DelhiCall Girls in Delhi
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesDipal Arora
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Tina Ji
 

Recently uploaded (20)

Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael Hawkins
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
 

Scalable Storage Configuration for the Physics Database Services

  • 1. Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005
  • 2. Database Workshop, October 2005 L.Canali, CERN 2 Outline • In this talk I will discuss – Storage configuration for scalable database services • Main challenges • Best practices – An implementation of scalable storage • Impacts on DB logical to physical mapping • Performance and resource allocations – How we can help you to size new database projects or to scale up existing applications • Performance testing • Benchmark data
  • 3. Database Workshop, October 2005 L.Canali, CERN 3 Oracle Scalable Architecture Goal: A database infrastructure that provides the required system resources to the end-users and applications. How: A modular architecture that can scale up to a large number of components
  • 4. Database Workshop, October 2005 L.Canali, CERN 4 RAID 1: HA Storage • Mirroring – 2-Way mirroring (RAID 1) protects against single point of failures – Can be used to redistribute I/O load (performance)
  • 5. Database Workshop, October 2005 L.Canali, CERN 5 RAID 0: Scalable Performances • RAID 0 (Striping) automatically redistributes files across multiple disks. • Performance and scalability are increased • Error resiliency is decreased Unstriped Disks Striped Disks
  • 6. Database Workshop, October 2005 L.Canali, CERN 6 Mechanical and Geometrical constraints • The external part of the disk provides – More throughput – Less latency
  • 7. Database Workshop, October 2005 L.Canali, CERN 7 S.A.M.E Strategy • Goal: optimize storage I/O utilization • S.A.M.E. (Stripe And Mirror Everything) Strategy – Built on the concepts of RAID 1 + 0 – Proposed by J. Loaiza (Oracle) in 1999 – Replaces “old recipes”: manual balancing across volumes • Need a Software or Hardware Volume Manager – ASM is Oracle’s solution with 10g “S.A.M.E. out of the box” – Other solutions available from different vendors require configuration
  • 8. Database Workshop, October 2005 L.Canali, CERN 8 Storage Configuration Guidelines • Use all available disk drives • Place frequently used data at outer half of disk – Fastest transfer rate – Minimize seek time • Stripe data at 1MB extents – Distribute the workload across disks – Eliminate hot spots – Optimum sequential bandwidth gained with1MB I/O • Stripe redo logs across multiple drives – Maximize write throughput for small writes – Smaller stripe size (128KB) and/or dedicated disks • Use cache on the controller – ‘Write-back’ cache – Battery-backed cache
  • 9. Database Workshop, October 2005 L.Canali, CERN 9 Oracle’s ASM Main Features • Mirror protection: – 2-way and 3-way mirroring available. – Mirror on a per-file basis – Can mirror across storage arrays • Data striping across the volume: – 1MB and 128KB stripes available • Supports clustering and single instance • Dynamic data distribution – A solution to avoid ‘hot spots’ – On-line add/drop disk with minimal data relocation – Automatic database file management • Database File System with performance of RAW I/O
  • 10. Database Workshop, October 2005 L.Canali, CERN 10 ASM’s Configuration – Examples • ASM is a volume manager, its output are disk groups (DG) that Oracle databases can mount to allocate their files RECOVERY-DG DATA-DG Config 1: Disk groups created with dedicated disks Config 2: Disk groups created by ‘horizontal’ slicing
  • 11. Database Workshop, October 2005 L.Canali, CERN 11 Proposed Storage Configuration • Proposed storage configuration: – High availability - Allows backups to disk – High performance - Allows clusterware mirroring (10.2) – DBs have dedicated resources Storage Arrays Oracle RAC Nodes Data DG-2 Recovery DG-1 Data DG-1 Recovery DG-2 Disk Groups (ASM) DB N.1 DB N.2
  • 12. Database Workshop, October 2005 L.Canali, CERN 12 FAQ 1: Datafiles • Do I need to worry on the number and names of the datafiles allocated for each tablespace? • “Traditional” storage allocation across multiple volumes: – Requires a careful allocation of multiple datafiles across logical volumes and/or filesystems – Datafile-to-filesystem and filesystem-to-physical storage mappings have to be frequently tuned • S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disks – There is NO NEED, for performance reasons, to allocate multiple datafiles per tablespace. – 10g new feature “bigfile tablespace” allows for tablespaces with a single datafile that can grow up to 32 TB (db_block_size=8k)
  • 13. Database Workshop, October 2005 L.Canali, CERN 13 FAQ 2: Data and Index Tablespaces • Do I need dedicated tablespaces for indexes and tables? • Separation of indexes and tables has often been advised to: – Distribute I/O – Reduce fragmentation – Allow separate backup of tables and indexes • S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disks – No performance gains are expected by using dedicated tablespaces for INDEXes and TABLEs. • Additional Notes: – Tablespaces fragmentation has little impact when using locally managed TBSs and automatic segment space management (9i and 10g) – Very large database can profit from using multiple tablespaces for admin purposes and logical separation of objects
  • 14. Database Workshop, October 2005 L.Canali, CERN 14 FAQ 3: Sizing Storage • Storage sizing for database should not take size as the first requirement – Bandwidth and performance metrics are bound to the number of disk spindles – Magnetic HD technology has improved the GByte/$ ratio – The rest of HD technology has not seen much improvements in the last 5 years (since 15K rpms HDs) • Sizing for storage requirements should – Be based on stress test measurements – Past performance measurements on comparable systems – New projects can leverage benchmark data. • Extra HD space is not wasted – Can be used to strengthen the B&R policy with Disk Backups
  • 15. Database Workshop, October 2005 L.Canali, CERN 15 IO Benchmark Measurements • Benchmark data can be used for – Sizing of new projects and upgrades – Performance baseline, testing for new hardware • The following metrics have been measured: – Sequential throughput (full scans) – Random access (indexed access) – I/O per second (indexed access) – Metrics are measured as a function of workload • Other test details – Benchmark tool: Oracle’s ORION – Infortrend Storage array: 16 SATA x 400 GB disks, 1 controller and 1 GB cache
  • 16. Database Workshop, October 2005 L.Canali, CERN 16 IO Benchmark Data Sequential Workload Performance (Read Only Workload) 0 20 40 60 80 100 120 140 160 1 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 Load (N# Outstanding asynch I/Os) Throughput (MBps) 12 LUNs, noraid 16 LUNs, no raid, external partition Throughput: + 25%
  • 17. Database Workshop, October 2005 L.Canali, CERN 17 IO Benchmark Data Latency for Small Random I/O (Read Only Workload) 0 20 40 60 80 100 120 140 160 1 2 3 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80 Load (N# Outstanding asynch I/Os) Latency (ms) 12 LUNs, noraid 16 LUNs, no raid, external partition Latency: - 35%
  • 18. Database Workshop, October 2005 L.Canali, CERN 18 IO Benchmark Data Small Random I/O Performance (Read Only Workload) 0 100 200 300 400 500 600 700 800 900 1000 1 4 16 28 40 52 64 76 Load (N# Outstanding asynch I/Os) I/O per second 12 LUNs, noraid 16 LUNs, no raid, external partition I/O /sec: +60%
  • 19. Database Workshop, October 2005 L.Canali, CERN 19 Conclusions • The Database Services for Physics can provide – Scalable Database services • Scalable on CPU and Memory resources • Scalable on Storage resources – Sizing for new projects or upgrades • Stress/Performance testing • Integration and Validation Testing • Benchmark data for capacity planning
  • 20. Database Workshop, October 2005 L.Canali, CERN 20 Additional Benchmark Data Sequential Workload Performance (Read Only Workload) 0 20 40 60 80 100 120 140 1 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 Load (N# Outstanding asynch I/Os) Throughput (MBps) 7 LUNs, raid0 14 LUNs, no raid 14 LUNs, no raid, outer partition used 14 LUN, no raid, inner partition used
  • 21. Database Workshop, October 2005 L.Canali, CERN 21 Additional Benchmark Data Latency for Small Random I/O (Read Only Workload) 0 20 40 60 80 100 120 140 1 2 3 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 72 76 80 Load (N# Outstanding asynch I/Os) Latency (ms) 7 LUNs, raid0 14 LUNs, no raid 14 LUNs, no raid, outer partition used 14 LUNS, no raid, inner partition used
  • 22. Database Workshop, October 2005 L.Canali, CERN 22 Additional Benchmark Data Small Random I/O Performance (Read Only Workload) 0 100 200 300 400 500 600 700 800 900 1 4 16 28 40 52 64 76 Load (N# Outstanding asynch I/Os) I/O per second 7 LUNs, raid0 14 LUNs, no raid 14 LUNs, noraid, outer partition used 14 LUNS, noraid, inner partition used