SlideShare a Scribd company logo
1 of 18
Download to read offline
Gluster for Geeks:
Performance Tips &
Tricks

Jacob Shucart
August 25th, 2011
Some Housekeeping Items…

 Ask a question at any time              How To Ask a Question?

 Questions will be answered at
the end of the webinar


 Slides will be available after
the webinar


 The webinar is being
recorded


            A Better Way To Do Storage                       2
Gluster for Geeks

  The Gluster for Geeks webinar series is designed
 for technical audiences who are familiar with
 GlusterFS


  In this edition, “Performance tuning tips and tricks”
 we will discuss in detail the performance related
 considerations for a running a GlusterFS
 deployment




        A Better Way To Do Storage                    3
Topics

 Planning
 Configuration
 Implementing
 Tuning
 Benchmarking
 Top 5 Issues




         A Better Way To Do Storage   4
Planning – Key Considerations

 Performance requirements
  – What performance do you need to hit & how do you plan to get to it?
       •   Read
       •   Write
       •   Throughput
       •   Availability
 For a given performance level what type is required?
  – E.g. for a throughput of X and capacity of Y what is needed?
 Workloads
  –   What is the workload in the environment?
  –   Small files?
  –   Large files?
  –   Is throughput your only consideration?
  –   What is the application?



             A Better Way To Do Storage                              5
Planning - Sizing and Architecture

 Gluster performance relies on hardware/underlying infrastructure
  –   CPU, memory, disks, network
  –   Virtual machine & cloud infrastructure
  –   Number of systems in the cluster depends on performance and capacity requirements
  –   There are many ways to meet organizational needs
  –   For on-prem 2U & 4U DAS systems and JBODS are great building blocks
 Examples: 3 common deployment scenarios
  – Capacity-centric environments
       • 2U & 4U DAS systems with multiple JBODS
       • Lower RAM and CPU requirements
       • Lower network requirements
  – Mixed capacity and performance environments
       • 2U & 4U DAS systems with 1-2 JBODS max
       • Higher RAM and CPU requirements
       • Low to high network requirements
  – High performance environments
       • 1U or 2U systems with no JBODS
       • Highest RAM and CPU requirements
       • Fast disks and fast network




            A Better Way To Do Storage                                                    6
Configuration

 Choosing the correct volume type for a workload
 Volume options include
  – Distribute – higher performance, no redundancy
  – Replicate(or distribute+replicate) – general purpose, HA, faster
    reads
  – Stripe(or distribute+stripe) – high concurrent reads, low writes, no
    redundancy
 Protocols & performance
  – GlusterFS gives the best overall performance (pNFS like
    functionality)
  – NFS gives excellent performance given right workload
  – CIFS should only be used for Windows systems
 Data flow
  – How do supported protocols differ?



         A Better Way To Do Storage                                        7
Implementing – Cluster Hardware Configuration

 Node and cluster configurations
  – More CPU means greater parallel threads on servers
  – More RAM means more cached operations
  – More network means more throughput
  Dedicated backend network for node
 communication
  – Dedicated back end network should be used for NFS and
    CIFS
  – Recommend 10GBe minimum
  GlusterFS native only uses inter-node
 communication for management calls


        A Better Way To Do Storage                          8
Implementing Gluster - Fundamentals

 Distribute only
  • Non-redundant at the brick level
    •   Cuts hardware, software costs in half.
    •   Failure of a brick or node results in loss of access to the data on those bricks.
    •   Writes destined to the failed brick will fail.
    •   Redundant RAID, hardware is strongly recommended.




           A Better Way To Do Storage                                                       9
Implementing Gluster - Fundamentals

 Distribute with replica
  • Redundant at the brick level
    •   Failure of a brick or node does not affect I/O.
    •   Writes are written simultaneously to each replica.
    •   Any number of replicas are supported.
    •   Gluster Native, CIFS, and NFS support stateful failover. (Gluster Native only in AWS)
    •   Redundant RAID, hardware is strongly recommended.




          A Better Way To Do Storage                                                            10
Implementing Gluster - Fundamentals

 Gluster Native client data flow




       A Better Way To Do Storage     11
Implementing Gluster - Fundamentals
 NFS, CIFS dataflow




       A Better Way To Do Storage     12
Tuning

 Key tuning parameters
  –   performance.write-behind-window-size 65535 (in bytes)
  –   performance.cache-refresh-timeout 1 (in seconds)
  –   performance.cache-size 1073741824 (in bytes)
  –   performance.read-ahead off (only for 1GbE)
  –   Default settings are suitable for mixed workloads
 Tuning for different environments
  – For Amazon, m1.xlarge or greater
  – Understand hardware/firmware settings and their impact on
    performance(for example, CPU frequency scaling and IB,
    10GbE and the TCP Offload Engine)



          A Better Way To Do Storage                            13
Benchmarking

 From the Gluster Performance white paper
  – iozone –R –l 3 –u 5 –r 512k –s 256m –F /mnt/1 /mnt/2 /mnt/3
    /mnt/4 /mnt/5
  – dd if=/dev/zero of=/mnt/test bs=1M count=1

 Performance expectations
  – Get a baseline benchmark of disks on systems
  – What can you expect from your network?

 IOPS vs. throughput
  – Is your workload better measured in throughput
  – Certain operations have different impact(dir creation)
  – If IOPS is your measurement remember latency



         A Better Way To Do Storage                               14
Top 5 Causes for Performance Issues

  Straight from our professional services
 performance team
1.   Underpowered/mis-configured disks
2.   Underpowered/mis-configured network
3.   Faulty hardware(broken/bad blocks/etc)
4.   Too few servers
5.   Wrong protocol for the job




         A Better Way To Do Storage           15
Conclusion

 GlusterFS performance depends heavily on the underlying
hardware
 You should understand your workloads to guide your
hardware configuration
The default parameters work well for general workloads
Several tuning parameters are available
 When experiencing performance issues check the disks
and network first




          A Better Way To Do Storage                       16
Polling Question

What should we talk about in next months Gluster
             Geeks Only webinar?
      A.    Setting up a basic Gluster cluster
                B.    Gluster Geo-Replication
           C.    Frequently Asked Questions
                     D.    Gluster Translators
                 E.       Other technical topics




           A Better Way To Do Storage              17
Questions & Resources

What are your performance questions?
 – Ask now using the Go-to-webinar questions panel

Helpful resources
 – Performance white paper posted here:
   http://www.gluster.com/products/resources/
 – Documentation: http://gluster.com/community/documentation
 – Questions?: http://community.gluster.org/




            A Better Way To Do Storage                         18

More Related Content

What's hot

Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
DataWorks Summit
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
enissoz
 

What's hot (20)

Ceph scale testing with 10 Billion Objects
Ceph scale testing with 10 Billion ObjectsCeph scale testing with 10 Billion Objects
Ceph scale testing with 10 Billion Objects
 
Crimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryCrimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent Memory
 
A crash course in CRUSH
A crash course in CRUSHA crash course in CRUSH
A crash course in CRUSH
 
When is MyRocks good?
When is MyRocks good? When is MyRocks good?
When is MyRocks good?
 
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
 
OpenStack Backup, Restore, DR (Freezer)
OpenStack Backup, Restore, DR (Freezer)OpenStack Backup, Restore, DR (Freezer)
OpenStack Backup, Restore, DR (Freezer)
 
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data AnalyticsApache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
 
Understanding and Measuring I/O Performance
Understanding and Measuring I/O PerformanceUnderstanding and Measuring I/O Performance
Understanding and Measuring I/O Performance
 
Red Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed Hat Gluster Storage Performance
Red Hat Gluster Storage Performance
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
Ceph as software define storage
Ceph as software define storageCeph as software define storage
Ceph as software define storage
 
5 Steps to PostgreSQL Performance
5 Steps to PostgreSQL Performance5 Steps to PostgreSQL Performance
5 Steps to PostgreSQL Performance
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
RocksDB compaction
RocksDB compactionRocksDB compaction
RocksDB compaction
 
Lisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-onLisa 2015-gluster fs-hands-on
Lisa 2015-gluster fs-hands-on
 
Lisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introductionLisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introduction
 
Percona xtrabackup - MySQL Meetup @ Mumbai
Percona xtrabackup - MySQL Meetup @ MumbaiPercona xtrabackup - MySQL Meetup @ Mumbai
Percona xtrabackup - MySQL Meetup @ Mumbai
 
Big table
Big tableBig table
Big table
 
Ceph and RocksDB
Ceph and RocksDBCeph and RocksDB
Ceph and RocksDB
 

Viewers also liked

Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2
Giuseppe Paterno'
 
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Gluster.org
 

Viewers also liked (20)

Performance comparison of Distributed File Systems on 1Gbit networks
Performance comparison of Distributed File Systems on 1Gbit networksPerformance comparison of Distributed File Systems on 1Gbit networks
Performance comparison of Distributed File Systems on 1Gbit networks
 
Erasure codes and storage tiers on gluster
Erasure codes and storage tiers on glusterErasure codes and storage tiers on gluster
Erasure codes and storage tiers on gluster
 
Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2Filesystem Comparison: NFS vs GFS2 vs OCFS2
Filesystem Comparison: NFS vs GFS2 vs OCFS2
 
Gluster Data Tiering
Gluster Data TieringGluster Data Tiering
Gluster Data Tiering
 
State of Gluster Performance
State of Gluster PerformanceState of Gluster Performance
State of Gluster Performance
 
Gluster.community.day.2013
Gluster.community.day.2013Gluster.community.day.2013
Gluster.community.day.2013
 
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
Performance bottlenecks for metadata workload in Gluster with Poornima Gurusi...
 
Gfs vs hdfs
Gfs vs hdfsGfs vs hdfs
Gfs vs hdfs
 
GlusterFS座談会テクニカルセッション
GlusterFS座談会テクニカルセッションGlusterFS座談会テクニカルセッション
GlusterFS座談会テクニカルセッション
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
 
Gluster the ugly parts with Jeff Darcy
Gluster  the ugly parts with Jeff DarcyGluster  the ugly parts with Jeff Darcy
Gluster the ugly parts with Jeff Darcy
 
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff DarcyHands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
 
Gluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmapGluster fs current_features_and_roadmap
Gluster fs current_features_and_roadmap
 
Deep Dive: Maximizing EC2 and EBS Performance
Deep Dive: Maximizing EC2 and EBS PerformanceDeep Dive: Maximizing EC2 and EBS Performance
Deep Dive: Maximizing EC2 and EBS Performance
 
(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014
(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014
(SDD416) Amazon EBS Deep Dive | AWS re:Invent 2014
 
Red Hat Storage Server For AWS
Red Hat Storage Server For AWSRed Hat Storage Server For AWS
Red Hat Storage Server For AWS
 
멀티 티어 시스템을 위한 통합 성능 관리 방안_APM
멀티 티어 시스템을 위한 통합 성능 관리 방안_APM멀티 티어 시스템을 위한 통합 성능 관리 방안_APM
멀티 티어 시스템을 위한 통합 성능 관리 방안_APM
 
Challenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan LambrightChallenges with Gluster and Persistent Memory with Dan Lambright
Challenges with Gluster and Persistent Memory with Dan Lambright
 
Erasure Coding Costs and Benefits
Erasure Coding Costs and BenefitsErasure Coding Costs and Benefits
Erasure Coding Costs and Benefits
 
Trying and evaluating the new features of GlusterFS 3.5
Trying and evaluating the new features of GlusterFS 3.5Trying and evaluating the new features of GlusterFS 3.5
Trying and evaluating the new features of GlusterFS 3.5
 

Similar to Gluster for Geeks: Performance Tuning Tips & Tricks

Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Community
 
Designing Information Structures For Performance And Reliability
Designing Information Structures For Performance And ReliabilityDesigning Information Structures For Performance And Reliability
Designing Information Structures For Performance And Reliability
bryanrandol
 
Tips and Tricks for SAP Sybase IQ
Tips and Tricks for SAP  Sybase IQTips and Tricks for SAP  Sybase IQ
Tips and Tricks for SAP Sybase IQ
Don Brizendine
 

Similar to Gluster for Geeks: Performance Tuning Tips & Tricks (20)

Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
 
Tuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy WorkloadTuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy Workload
 
SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performance
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
 
Capacity Planning
Capacity PlanningCapacity Planning
Capacity Planning
 
MongoDB Capacity Planning
MongoDB Capacity PlanningMongoDB Capacity Planning
MongoDB Capacity Planning
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
 
Presentation db2 best practices for optimal performance
Presentation   db2 best practices for optimal performancePresentation   db2 best practices for optimal performance
Presentation db2 best practices for optimal performance
 
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
 
071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Backup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
Backup management with Ceph Storage - Camilo Echevarne, Félix BarbeiraBackup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
Backup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_cluster
 
Designing Information Structures For Performance And Reliability
Designing Information Structures For Performance And ReliabilityDesigning Information Structures For Performance And Reliability
Designing Information Structures For Performance And Reliability
 
Tips and Tricks for SAP Sybase IQ
Tips and Tricks for SAP  Sybase IQTips and Tricks for SAP  Sybase IQ
Tips and Tricks for SAP Sybase IQ
 
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
 
Taking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - ArchitectureTaking Splunk to the Next Level - Architecture
Taking Splunk to the Next Level - Architecture
 
Comparison of-foss-distributed-storage
Comparison of-foss-distributed-storageComparison of-foss-distributed-storage
Comparison of-foss-distributed-storage
 

More from GlusterFS

Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
GlusterFS
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS
 

More from GlusterFS (14)

Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
 
Introduction to GlusterFS Webinar - September 2011
Introduction to GlusterFS Webinar - September 2011Introduction to GlusterFS Webinar - September 2011
Introduction to GlusterFS Webinar - September 2011
 
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS StorageWebinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
 
Intro to GlusterFS Webinar - August 2011
Intro to GlusterFS Webinar - August 2011Intro to GlusterFS Webinar - August 2011
Intro to GlusterFS Webinar - August 2011
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3
 
Cloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and DeploymentCloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and Deployment
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Gluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFSGluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFS
 
Award winning scale-up and scale-out storage for Xen
Award winning scale-up and scale-out storage for XenAward winning scale-up and scale-out storage for Xen
Award winning scale-up and scale-out storage for Xen
 
Future of cloud storage
Future of cloud storageFuture of cloud storage
Future of cloud storage
 
Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2
 
Gluster Blog 11.15.2010
Gluster Blog 11.15.2010Gluster Blog 11.15.2010
Gluster Blog 11.15.2010
 
Gluster Storage Platform Installation Guide
Gluster Storage Platform Installation GuideGluster Storage Platform Installation Guide
Gluster Storage Platform Installation Guide
 
How to Install Gluster Storage Platform
How to Install Gluster Storage PlatformHow to Install Gluster Storage Platform
How to Install Gluster Storage Platform
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Gluster for Geeks: Performance Tuning Tips & Tricks

  • 1. Gluster for Geeks: Performance Tips & Tricks Jacob Shucart August 25th, 2011
  • 2. Some Housekeeping Items… Ask a question at any time How To Ask a Question? Questions will be answered at the end of the webinar Slides will be available after the webinar The webinar is being recorded A Better Way To Do Storage 2
  • 3. Gluster for Geeks The Gluster for Geeks webinar series is designed for technical audiences who are familiar with GlusterFS In this edition, “Performance tuning tips and tricks” we will discuss in detail the performance related considerations for a running a GlusterFS deployment A Better Way To Do Storage 3
  • 4. Topics Planning Configuration Implementing Tuning Benchmarking Top 5 Issues A Better Way To Do Storage 4
  • 5. Planning – Key Considerations Performance requirements – What performance do you need to hit & how do you plan to get to it? • Read • Write • Throughput • Availability For a given performance level what type is required? – E.g. for a throughput of X and capacity of Y what is needed? Workloads – What is the workload in the environment? – Small files? – Large files? – Is throughput your only consideration? – What is the application? A Better Way To Do Storage 5
  • 6. Planning - Sizing and Architecture Gluster performance relies on hardware/underlying infrastructure – CPU, memory, disks, network – Virtual machine & cloud infrastructure – Number of systems in the cluster depends on performance and capacity requirements – There are many ways to meet organizational needs – For on-prem 2U & 4U DAS systems and JBODS are great building blocks Examples: 3 common deployment scenarios – Capacity-centric environments • 2U & 4U DAS systems with multiple JBODS • Lower RAM and CPU requirements • Lower network requirements – Mixed capacity and performance environments • 2U & 4U DAS systems with 1-2 JBODS max • Higher RAM and CPU requirements • Low to high network requirements – High performance environments • 1U or 2U systems with no JBODS • Highest RAM and CPU requirements • Fast disks and fast network A Better Way To Do Storage 6
  • 7. Configuration Choosing the correct volume type for a workload Volume options include – Distribute – higher performance, no redundancy – Replicate(or distribute+replicate) – general purpose, HA, faster reads – Stripe(or distribute+stripe) – high concurrent reads, low writes, no redundancy Protocols & performance – GlusterFS gives the best overall performance (pNFS like functionality) – NFS gives excellent performance given right workload – CIFS should only be used for Windows systems Data flow – How do supported protocols differ? A Better Way To Do Storage 7
  • 8. Implementing – Cluster Hardware Configuration Node and cluster configurations – More CPU means greater parallel threads on servers – More RAM means more cached operations – More network means more throughput Dedicated backend network for node communication – Dedicated back end network should be used for NFS and CIFS – Recommend 10GBe minimum GlusterFS native only uses inter-node communication for management calls A Better Way To Do Storage 8
  • 9. Implementing Gluster - Fundamentals Distribute only • Non-redundant at the brick level • Cuts hardware, software costs in half. • Failure of a brick or node results in loss of access to the data on those bricks. • Writes destined to the failed brick will fail. • Redundant RAID, hardware is strongly recommended. A Better Way To Do Storage 9
  • 10. Implementing Gluster - Fundamentals Distribute with replica • Redundant at the brick level • Failure of a brick or node does not affect I/O. • Writes are written simultaneously to each replica. • Any number of replicas are supported. • Gluster Native, CIFS, and NFS support stateful failover. (Gluster Native only in AWS) • Redundant RAID, hardware is strongly recommended. A Better Way To Do Storage 10
  • 11. Implementing Gluster - Fundamentals Gluster Native client data flow A Better Way To Do Storage 11
  • 12. Implementing Gluster - Fundamentals NFS, CIFS dataflow A Better Way To Do Storage 12
  • 13. Tuning Key tuning parameters – performance.write-behind-window-size 65535 (in bytes) – performance.cache-refresh-timeout 1 (in seconds) – performance.cache-size 1073741824 (in bytes) – performance.read-ahead off (only for 1GbE) – Default settings are suitable for mixed workloads Tuning for different environments – For Amazon, m1.xlarge or greater – Understand hardware/firmware settings and their impact on performance(for example, CPU frequency scaling and IB, 10GbE and the TCP Offload Engine) A Better Way To Do Storage 13
  • 14. Benchmarking From the Gluster Performance white paper – iozone –R –l 3 –u 5 –r 512k –s 256m –F /mnt/1 /mnt/2 /mnt/3 /mnt/4 /mnt/5 – dd if=/dev/zero of=/mnt/test bs=1M count=1 Performance expectations – Get a baseline benchmark of disks on systems – What can you expect from your network? IOPS vs. throughput – Is your workload better measured in throughput – Certain operations have different impact(dir creation) – If IOPS is your measurement remember latency A Better Way To Do Storage 14
  • 15. Top 5 Causes for Performance Issues Straight from our professional services performance team 1. Underpowered/mis-configured disks 2. Underpowered/mis-configured network 3. Faulty hardware(broken/bad blocks/etc) 4. Too few servers 5. Wrong protocol for the job A Better Way To Do Storage 15
  • 16. Conclusion GlusterFS performance depends heavily on the underlying hardware You should understand your workloads to guide your hardware configuration The default parameters work well for general workloads Several tuning parameters are available When experiencing performance issues check the disks and network first A Better Way To Do Storage 16
  • 17. Polling Question What should we talk about in next months Gluster Geeks Only webinar? A. Setting up a basic Gluster cluster B. Gluster Geo-Replication C. Frequently Asked Questions D. Gluster Translators E. Other technical topics A Better Way To Do Storage 17
  • 18. Questions & Resources What are your performance questions? – Ask now using the Go-to-webinar questions panel Helpful resources – Performance white paper posted here: http://www.gluster.com/products/resources/ – Documentation: http://gluster.com/community/documentation – Questions?: http://community.gluster.org/ A Better Way To Do Storage 18