SlideShare a Scribd company logo
1 of 38
Apache Performance Tuning
Part 2: Scaling Out
Sander Temme <sander@temme.net>
Agenda
• Introduction
• Redundancy in Hardware
• Building Out: Separate Tiers
• Building Out: Load Balancing
• Caching Content
• Conclusion
Introduction
• Why do This?
– Scalability (Oh my gosh, I’m so popular!)
– Reliability (We need five nines!)
• Why NOT do This?
– It costs money
Redundancy in Hardware
• Moving Parts Break
– Hard Disks
– Power Supplies
– Hard Disks
– Fans
– Hard Disks
• Buy High Quality Disks
– Refurbished, OEM, Brand Name
– Which has longer warranty?
– Which is more reliable?
Server Configuration
• Mirror those Disks
– Install the RAID utility
– Have it warn you
– RAID is no good if you don’t learn of
failures!
• Redundant Power Supplies
– On different circuits
Scaling Vertically
Scaling Vertically
• Move Services to Other Hosts
• Pros:
– Less resource contention
– Specialized hardware
– Scale out tiers individually
• Cons:
– Development/Deployment harder
– More hosts to manage
Scaling Horizontally
Scaling Horizontally
• Multiple servers per tier
• All receive requests
• All serve same content
• Some arbitration scheme
Load Balancing Schemes
• DNS Tricks
• Peer Server Pools
– Network Load Balancing (Win2k3)
– Wackamole
• Load Balancing Appliance
– Box from F5, Juniper, Cisco, Foundry, …
– Linux Virtual Server
DNS Round-Robin
• Easy!
• Multiple A Records in DNS Zone File
• Not Smart:
– DNS Lookups are cached
– Load on Server
– Server Outage
Example Zone File
scalingout.org. 86400 IN SOA ns.scalingout.org. sctemme.scalingout.org. (
2006051401 ; Serial
86400 ; refresh (1 day)
7200 ; retry (2 hours)
8640000 ; expire (10 days)
86400 ) ; minimum (1 day)
scalingout.org. IN NS bagheera.scalingout.org.
gw IN A 10.11.0.1
bagheera IN A 10.11.0.2
; ...
mail IN CNAME bagheera
ns IN CNAME bagheera
www IN A 10.11.0.113
IN A 10.11.0.114
IN A 10.11.0.115
Peer-based: NLB
• Windows 2000 Server Enterprise Ed.,
Windows Server 2003
• Up to 32 hosts in cluster
• All hosts assume cluster IP, MAC
• NLB makes LB decision
– Only one host gets to answer TCP
handshake
• Should be application independent
Peer-based: Wackamole
• High Availability Solution
• When Host Fails
– Other hosts take over its IP addresses
– Distribute IP addresses among cluster
– Every IP address reliably available
• No Load Balancing!
– Use with RR DNS (or something)
http://www.backhand.org/wackamole/
Load Balancing Device
Client Internet
Load Balancing
• One Load Balancer
• Many Web Servers
• Choice of Balancing Schemes
– Round-robin, Least Used, …
• Reliability
– Heartbeats, unavailable servers don’t
receive requests
• Feature War
Linux Virtual Server
• Free, Open Source, etc.
• IP Virtual Server module in kernel
• Lots of auxiliary modules
– Like a box of Legos
– May come with Your Distribution
• Do It Yourself
http://www.linuxvirtualserver.org/
Example: mod_proxy_balancer
• New in Apache HTTP Server 2.2
• Part of mod_proxy
• Two Load Balancing Methods
– By number of requests
– By number of bytes
• Detects failed backends
Apache Configuration
Listen 80
LogLevel debug
TransferLog logs/access_log
LoadModule proxy_module modules/mod_proxy.so
LoadModule proxy_http_module modules/mod_proxy_http.so
LoadModule proxy_balancer_module modules/mod_proxy_balancer.so
ProxyPass / balancer://mycluster/
ProxyPassReverse / http://1.2.3.4:80
ProxyPassReverse / http://1.2.3.5:80
<Proxy balancer://mycluster>
BalancerMember http://1.2.3.4:80
BalancerMember http://1.2.3.5:80
</Proxy>
Example: Tomcat, mod_jk
• Apache + mod_jk
• Multiple Tomcat servers
• Balancer Worker
Apache Configuration
LoadModule jk_module /Volumes/Files/asf/httpd-r415210w/modules/mod_jk.so
JKMount /servlets-examples/* loadbalancer
JKMount /*.jsp loadbalancer
JkMount /jkmanager/* jkstatus
JKLogFile logs/jk_log
JKLogLevel debug
JKWorkerProperty worker.list=loadbalancer,jkstatus
JKWorkerProperty worker.tc1.port=15109
JKWorkerProperty worker.tc1.host=localhost
JKWorkerProperty worker.tc1.type=ajp13
JKWorkerProperty worker.tc1.lbfactor=1
JKWorkerProperty worker.tc2.port=15209
JKWorkerProperty worker.tc2.host=localhost
JKWorkerProperty worker.tc2.type=ajp13
JKWorkerProperty worker.tc2.lbfactor=1
JKWorkerProperty worker.tc3.port=15309
JKWorkerProperty worker.tc3.host=localhost
JKWorkerProperty worker.tc3.type=ajp13
JKWorkerProperty worker.tc3.lbfactor=1
JKWorkerProperty worker.loadbalancer.type=lb
JKWorkerProperty worker.loadbalancer.balance_workers=tc1, tc2, tc3
JKWorkerProperty worker.jkstatus.type=status
Tomcat Configuration
• Put same content on all Tomcats
• Edit conf/server.xml:
• jvmRoute must match jk worker
name!
<Engine name="Catalina" defaultHost="localhost" jvmRoute="tc1">
Problem: Session State
• HTTP is Stateless
• Apps use Sessions
– Cookies
– URL Encoding
• Session created on single server
– Broken by Load Balancing
– PHP: sessions stored on disk
Solutions: Session State
• “Sticky” routing on Load Balancer
• Store State in DB
• Put benign State in Cookie
– But don’t trust the client too much
• Replicate Sessions on Back-end
Tomcat Session Replication
• Share HttpSession objects across
instances
• One instance dies, session lives on
• Apache will route requests to other
instance
• Uses IP Multicast
Session Replication Config
• Uncomment <Cluster> element in
server.xml
• Put empty <distributable /> element in
<web-app> element in web.xml
Caching Content
• Dynamic Content is Expensive
• Static Content Relatively Cheap
• Several Approaches:
– Dynamic caching
– Pre-rendering popular pages
(index.rss…)
27
mod_cache Configuration
28
<IfModule mod_cache.c>
<IfModule mod_disk_cache.c>
CacheRoot /raid1/cacheroot
CacheEnable disk /
# A page modified 100 min. ago will expire in 10 min.
CacheLastModifiedFactor .1
# Always check again after 6 hours
CacheMaxExpire 21600
</IfModule>
</IfModule>
Make Popular Pages Static
• RSS Feeds
• Popular catalog queries
• … (Check your access log)
29
Static Page Substitution
30
<Directory "/home/sctemme/inst/blog/httpd/htdocs">
Options +Indexes
Order allow,deny
Allow from all
RewriteEngine on
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule ^(.*)$ /cgi-bin/blosxom.cgi/$1 [L,QSA]
</Directory>
Tuning the Database Tier
• Not my area (sorry)
• Give Money to Oracle Consultants
– (or MySQL) (or …)
• Tip: Separate Read and Write
Operations
– Replicate from Write db to Read db
– Read db data slightly stale
• Does it matter?
Putting it All Together
Client
Internet
Read-only
Write-only
Monitoring the Farm
• Monitor for outages
– More boxes, more failure
– HA can mask failures
• Monitor for performance
– Utilization
– Trends
Monitoring Solutions
• Nagios
– Check services, hosts for outage
– Highly configurable, extendable
– Worth your time investment
• Ganglia
– Monitor for performance
– See Brad Nicholes’s session
Monitoring Caveats
• Takes Time, Effort
– Highly flexible products
• You can’t fix it
– If you don’t know it’s broken
• You can’t tune it
– If you don’t know the bottlenecks
Conference Roadmap
• Monitoring 2.0 - Zenoss, the next level of IT
management (Training)
• Apache Performance Tuning Part 1:
Scaling Up
• Load-balancing with Apache HTTPD 2.2
and later
• Scaling the download infrastructure with
your success
• Break My Site
Current Version
http://people.apache.org/~sctemme/ApconEU2008/
Thank You

More Related Content

What's hot

Cloud stack overview
Cloud stack overviewCloud stack overview
Cloud stack overviewgavin_lee
 
XenServer Virtualization In Cloud Environments
XenServer Virtualization In Cloud EnvironmentsXenServer Virtualization In Cloud Environments
XenServer Virtualization In Cloud EnvironmentsTim Mackey
 
2015 deploying flash in the data center
2015 deploying flash in the data center2015 deploying flash in the data center
2015 deploying flash in the data centerHoward Marks
 
1. Core Features of Apache RocketMQ
1. Core Features of Apache RocketMQ1. Core Features of Apache RocketMQ
1. Core Features of Apache RocketMQ振东 刘
 
Selecting the correct hypervisor for CloudStack 4.5
Selecting the correct hypervisor for CloudStack 4.5Selecting the correct hypervisor for CloudStack 4.5
Selecting the correct hypervisor for CloudStack 4.5Tim Mackey
 
Getting Started with Apache CloudStack
Getting Started with Apache CloudStackGetting Started with Apache CloudStack
Getting Started with Apache CloudStackJoe Brockmeier
 
MariaDB on Docker
MariaDB on DockerMariaDB on Docker
MariaDB on DockerMariaDB plc
 
Kafka Tutorial: Streaming Data Architecture
Kafka Tutorial: Streaming Data ArchitectureKafka Tutorial: Streaming Data Architecture
Kafka Tutorial: Streaming Data ArchitectureJean-Paul Azar
 

What's hot (10)

Cloud stack overview
Cloud stack overviewCloud stack overview
Cloud stack overview
 
XenServer Virtualization In Cloud Environments
XenServer Virtualization In Cloud EnvironmentsXenServer Virtualization In Cloud Environments
XenServer Virtualization In Cloud Environments
 
2015 deploying flash in the data center
2015 deploying flash in the data center2015 deploying flash in the data center
2015 deploying flash in the data center
 
Aem maintenance
Aem maintenanceAem maintenance
Aem maintenance
 
1. Core Features of Apache RocketMQ
1. Core Features of Apache RocketMQ1. Core Features of Apache RocketMQ
1. Core Features of Apache RocketMQ
 
Selecting the correct hypervisor for CloudStack 4.5
Selecting the correct hypervisor for CloudStack 4.5Selecting the correct hypervisor for CloudStack 4.5
Selecting the correct hypervisor for CloudStack 4.5
 
Getting Started with Apache CloudStack
Getting Started with Apache CloudStackGetting Started with Apache CloudStack
Getting Started with Apache CloudStack
 
Cloud stack for_beginners
Cloud stack for_beginnersCloud stack for_beginners
Cloud stack for_beginners
 
MariaDB on Docker
MariaDB on DockerMariaDB on Docker
MariaDB on Docker
 
Kafka Tutorial: Streaming Data Architecture
Kafka Tutorial: Streaming Data ArchitectureKafka Tutorial: Streaming Data Architecture
Kafka Tutorial: Streaming Data Architecture
 

Viewers also liked

Performance out
Performance outPerformance out
Performance outJack Huang
 
Hbrsas customer experience
Hbrsas customer experienceHbrsas customer experience
Hbrsas customer experienceJack Huang
 
Performance out
Performance outPerformance out
Performance outJack Huang
 
Performance out
Performance outPerformance out
Performance outJack Huang
 
Jack huang's resume
Jack huang's resumeJack huang's resume
Jack huang's resumeJack Huang
 
Diez frases célebres para superar el desamor
Diez frases célebres para superar el desamorDiez frases célebres para superar el desamor
Diez frases célebres para superar el desamorjuanfilos
 
Diez frases célebres para superar el desamor
Diez frases célebres para superar el desamorDiez frases célebres para superar el desamor
Diez frases célebres para superar el desamorjuanfilos
 
07 menginstalasi-sistem-operasi-berbasis-gui
07 menginstalasi-sistem-operasi-berbasis-gui07 menginstalasi-sistem-operasi-berbasis-gui
07 menginstalasi-sistem-operasi-berbasis-gui342uz
 
Barahin e Ahmadiyya Aur Muqadma Azam-Ul-Kalam
Barahin e Ahmadiyya Aur Muqadma Azam-Ul-KalamBarahin e Ahmadiyya Aur Muqadma Azam-Ul-Kalam
Barahin e Ahmadiyya Aur Muqadma Azam-Ul-KalamAhmadi Muslim
 
Blondinka na kvizu
Blondinka na kvizuBlondinka na kvizu
Blondinka na kvizuMailzabava
 
胖胖筆記:Moodle在教學上的應用V2
胖胖筆記:Moodle在教學上的應用V2胖胖筆記:Moodle在教學上的應用V2
胖胖筆記:Moodle在教學上的應用V2Jainan O
 
ისტორიული ძეგლები
ისტორიული ძეგლებიისტორიული ძეგლები
ისტორიული ძეგლებიirmalomidze
 
09 003 Who Are You And How Do You Think
09 003 Who Are You And How Do You Think09 003 Who Are You And How Do You Think
09 003 Who Are You And How Do You ThinkLes Bicknell
 
Taking Twit Out Of Twitter (updated)
Taking Twit Out Of Twitter (updated)Taking Twit Out Of Twitter (updated)
Taking Twit Out Of Twitter (updated)Julian Matthews
 

Viewers also liked (20)

Performance out
Performance outPerformance out
Performance out
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
Hbrsas customer experience
Hbrsas customer experienceHbrsas customer experience
Hbrsas customer experience
 
Terraform
TerraformTerraform
Terraform
 
Performance out
Performance outPerformance out
Performance out
 
Terraform
TerraformTerraform
Terraform
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
Performance out
Performance outPerformance out
Performance out
 
Jack huang's resume
Jack huang's resumeJack huang's resume
Jack huang's resume
 
Diez frases célebres para superar el desamor
Diez frases célebres para superar el desamorDiez frases célebres para superar el desamor
Diez frases célebres para superar el desamor
 
Diez frases célebres para superar el desamor
Diez frases célebres para superar el desamorDiez frases célebres para superar el desamor
Diez frases célebres para superar el desamor
 
07 menginstalasi-sistem-operasi-berbasis-gui
07 menginstalasi-sistem-operasi-berbasis-gui07 menginstalasi-sistem-operasi-berbasis-gui
07 menginstalasi-sistem-operasi-berbasis-gui
 
Barahin e Ahmadiyya Aur Muqadma Azam-Ul-Kalam
Barahin e Ahmadiyya Aur Muqadma Azam-Ul-KalamBarahin e Ahmadiyya Aur Muqadma Azam-Ul-Kalam
Barahin e Ahmadiyya Aur Muqadma Azam-Ul-Kalam
 
Prezentacia1
Prezentacia1Prezentacia1
Prezentacia1
 
Blondinka na kvizu
Blondinka na kvizuBlondinka na kvizu
Blondinka na kvizu
 
胖胖筆記:Moodle在教學上的應用V2
胖胖筆記:Moodle在教學上的應用V2胖胖筆記:Moodle在教學上的應用V2
胖胖筆記:Moodle在教學上的應用V2
 
BULATS Ch. Chevret
BULATS  Ch. ChevretBULATS  Ch. Chevret
BULATS Ch. Chevret
 
ისტორიული ძეგლები
ისტორიული ძეგლებიისტორიული ძეგლები
ისტორიული ძეგლები
 
09 003 Who Are You And How Do You Think
09 003 Who Are You And How Do You Think09 003 Who Are You And How Do You Think
09 003 Who Are You And How Do You Think
 
Taking Twit Out Of Twitter (updated)
Taking Twit Out Of Twitter (updated)Taking Twit Out Of Twitter (updated)
Taking Twit Out Of Twitter (updated)
 

Similar to Performance out

Apache Performance Tuning: Scaling Up
Apache Performance Tuning: Scaling UpApache Performance Tuning: Scaling Up
Apache Performance Tuning: Scaling UpSander Temme
 
Building & Testing Scalable Rails Applications
Building & Testing Scalable Rails ApplicationsBuilding & Testing Scalable Rails Applications
Building & Testing Scalable Rails Applicationsevilmike
 
High Performance Drupal
High Performance DrupalHigh Performance Drupal
High Performance DrupalChapter Three
 
(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool Management(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool ManagementBIOVIA
 
Pascal benois performance_troubleshooting-spsbe18
Pascal benois performance_troubleshooting-spsbe18Pascal benois performance_troubleshooting-spsbe18
Pascal benois performance_troubleshooting-spsbe18BIWUG
 
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...VMworld
 
Taking the open cloud to 11
Taking the open cloud to 11Taking the open cloud to 11
Taking the open cloud to 11Joe Brockmeier
 
DrupalCampLA 2014 - Drupal backend performance and scalability
DrupalCampLA 2014 - Drupal backend performance and scalabilityDrupalCampLA 2014 - Drupal backend performance and scalability
DrupalCampLA 2014 - Drupal backend performance and scalabilitycherryhillco
 
Speeding Up The Snail
Speeding Up The SnailSpeeding Up The Snail
Speeding Up The SnailMarcus Deglos
 
Whats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and StorageWhats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and StorageJohn Moran
 
Using flash on the server side
Using flash on the server sideUsing flash on the server side
Using flash on the server sideHoward Marks
 
AWS re:Invent 2013 Recap
AWS re:Invent 2013 RecapAWS re:Invent 2013 Recap
AWS re:Invent 2013 RecapBarry Jones
 
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedInJay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedInLinkedIn
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...NetApp
 
High Performance WordPress II
High Performance WordPress IIHigh Performance WordPress II
High Performance WordPress IIBarry Abrahamson
 
Open stack ha design & deployment kilo
Open stack ha design & deployment   kiloOpen stack ha design & deployment   kilo
Open stack ha design & deployment kiloSteven Li
 
Beyond Apache: Faster Web Servers
Beyond Apache: Faster Web ServersBeyond Apache: Faster Web Servers
Beyond Apache: Faster Web Serverswebhostingguy
 
Moving to the Cloud: AWS, Zend, RightScale
Moving to the Cloud: AWS, Zend, RightScaleMoving to the Cloud: AWS, Zend, RightScale
Moving to the Cloud: AWS, Zend, RightScalemmoline
 

Similar to Performance out (20)

Performance out
Performance outPerformance out
Performance out
 
Drupal performance
Drupal performanceDrupal performance
Drupal performance
 
Apache Performance Tuning: Scaling Up
Apache Performance Tuning: Scaling UpApache Performance Tuning: Scaling Up
Apache Performance Tuning: Scaling Up
 
Building & Testing Scalable Rails Applications
Building & Testing Scalable Rails ApplicationsBuilding & Testing Scalable Rails Applications
Building & Testing Scalable Rails Applications
 
High Performance Drupal
High Performance DrupalHigh Performance Drupal
High Performance Drupal
 
(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool Management(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool Management
 
Pascal benois performance_troubleshooting-spsbe18
Pascal benois performance_troubleshooting-spsbe18Pascal benois performance_troubleshooting-spsbe18
Pascal benois performance_troubleshooting-spsbe18
 
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
 
Taking the open cloud to 11
Taking the open cloud to 11Taking the open cloud to 11
Taking the open cloud to 11
 
DrupalCampLA 2014 - Drupal backend performance and scalability
DrupalCampLA 2014 - Drupal backend performance and scalabilityDrupalCampLA 2014 - Drupal backend performance and scalability
DrupalCampLA 2014 - Drupal backend performance and scalability
 
Speeding Up The Snail
Speeding Up The SnailSpeeding Up The Snail
Speeding Up The Snail
 
Whats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and StorageWhats new in Microsoft Windows Server 2016 Clustering and Storage
Whats new in Microsoft Windows Server 2016 Clustering and Storage
 
Using flash on the server side
Using flash on the server sideUsing flash on the server side
Using flash on the server side
 
AWS re:Invent 2013 Recap
AWS re:Invent 2013 RecapAWS re:Invent 2013 Recap
AWS re:Invent 2013 Recap
 
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedInJay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
 
High Performance WordPress II
High Performance WordPress IIHigh Performance WordPress II
High Performance WordPress II
 
Open stack ha design & deployment kilo
Open stack ha design & deployment   kiloOpen stack ha design & deployment   kilo
Open stack ha design & deployment kilo
 
Beyond Apache: Faster Web Servers
Beyond Apache: Faster Web ServersBeyond Apache: Faster Web Servers
Beyond Apache: Faster Web Servers
 
Moving to the Cloud: AWS, Zend, RightScale
Moving to the Cloud: AWS, Zend, RightScaleMoving to the Cloud: AWS, Zend, RightScale
Moving to the Cloud: AWS, Zend, RightScale
 

More from Jack Huang

More from Jack Huang (11)

Pptexamples
PptexamplesPptexamples
Pptexamples
 
Jack huang's resume
Jack huang's resumeJack huang's resume
Jack huang's resume
 
Performance out
Performance outPerformance out
Performance out
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
jfdjsahf;ldsa
jfdjsahf;ldsajfdjsahf;ldsa
jfdjsahf;ldsa
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
bar
barbar
bar
 
foo
foofoo
foo
 
Foosball aerial shot
Foosball aerial shotFoosball aerial shot
Foosball aerial shot
 

Performance out

  • 1. Apache Performance Tuning Part 2: Scaling Out Sander Temme <sander@temme.net>
  • 2. Agenda • Introduction • Redundancy in Hardware • Building Out: Separate Tiers • Building Out: Load Balancing • Caching Content • Conclusion
  • 3. Introduction • Why do This? – Scalability (Oh my gosh, I’m so popular!) – Reliability (We need five nines!) • Why NOT do This? – It costs money
  • 4. Redundancy in Hardware • Moving Parts Break – Hard Disks – Power Supplies – Hard Disks – Fans – Hard Disks • Buy High Quality Disks – Refurbished, OEM, Brand Name – Which has longer warranty? – Which is more reliable?
  • 5. Server Configuration • Mirror those Disks – Install the RAID utility – Have it warn you – RAID is no good if you don’t learn of failures! • Redundant Power Supplies – On different circuits
  • 7. Scaling Vertically • Move Services to Other Hosts • Pros: – Less resource contention – Specialized hardware – Scale out tiers individually • Cons: – Development/Deployment harder – More hosts to manage
  • 9. Scaling Horizontally • Multiple servers per tier • All receive requests • All serve same content • Some arbitration scheme
  • 10. Load Balancing Schemes • DNS Tricks • Peer Server Pools – Network Load Balancing (Win2k3) – Wackamole • Load Balancing Appliance – Box from F5, Juniper, Cisco, Foundry, … – Linux Virtual Server
  • 11. DNS Round-Robin • Easy! • Multiple A Records in DNS Zone File • Not Smart: – DNS Lookups are cached – Load on Server – Server Outage
  • 12. Example Zone File scalingout.org. 86400 IN SOA ns.scalingout.org. sctemme.scalingout.org. ( 2006051401 ; Serial 86400 ; refresh (1 day) 7200 ; retry (2 hours) 8640000 ; expire (10 days) 86400 ) ; minimum (1 day) scalingout.org. IN NS bagheera.scalingout.org. gw IN A 10.11.0.1 bagheera IN A 10.11.0.2 ; ... mail IN CNAME bagheera ns IN CNAME bagheera www IN A 10.11.0.113 IN A 10.11.0.114 IN A 10.11.0.115
  • 13. Peer-based: NLB • Windows 2000 Server Enterprise Ed., Windows Server 2003 • Up to 32 hosts in cluster • All hosts assume cluster IP, MAC • NLB makes LB decision – Only one host gets to answer TCP handshake • Should be application independent
  • 14. Peer-based: Wackamole • High Availability Solution • When Host Fails – Other hosts take over its IP addresses – Distribute IP addresses among cluster – Every IP address reliably available • No Load Balancing! – Use with RR DNS (or something) http://www.backhand.org/wackamole/
  • 16. Load Balancing • One Load Balancer • Many Web Servers • Choice of Balancing Schemes – Round-robin, Least Used, … • Reliability – Heartbeats, unavailable servers don’t receive requests • Feature War
  • 17. Linux Virtual Server • Free, Open Source, etc. • IP Virtual Server module in kernel • Lots of auxiliary modules – Like a box of Legos – May come with Your Distribution • Do It Yourself http://www.linuxvirtualserver.org/
  • 18. Example: mod_proxy_balancer • New in Apache HTTP Server 2.2 • Part of mod_proxy • Two Load Balancing Methods – By number of requests – By number of bytes • Detects failed backends
  • 19. Apache Configuration Listen 80 LogLevel debug TransferLog logs/access_log LoadModule proxy_module modules/mod_proxy.so LoadModule proxy_http_module modules/mod_proxy_http.so LoadModule proxy_balancer_module modules/mod_proxy_balancer.so ProxyPass / balancer://mycluster/ ProxyPassReverse / http://1.2.3.4:80 ProxyPassReverse / http://1.2.3.5:80 <Proxy balancer://mycluster> BalancerMember http://1.2.3.4:80 BalancerMember http://1.2.3.5:80 </Proxy>
  • 20. Example: Tomcat, mod_jk • Apache + mod_jk • Multiple Tomcat servers • Balancer Worker
  • 21. Apache Configuration LoadModule jk_module /Volumes/Files/asf/httpd-r415210w/modules/mod_jk.so JKMount /servlets-examples/* loadbalancer JKMount /*.jsp loadbalancer JkMount /jkmanager/* jkstatus JKLogFile logs/jk_log JKLogLevel debug JKWorkerProperty worker.list=loadbalancer,jkstatus JKWorkerProperty worker.tc1.port=15109 JKWorkerProperty worker.tc1.host=localhost JKWorkerProperty worker.tc1.type=ajp13 JKWorkerProperty worker.tc1.lbfactor=1 JKWorkerProperty worker.tc2.port=15209 JKWorkerProperty worker.tc2.host=localhost JKWorkerProperty worker.tc2.type=ajp13 JKWorkerProperty worker.tc2.lbfactor=1 JKWorkerProperty worker.tc3.port=15309 JKWorkerProperty worker.tc3.host=localhost JKWorkerProperty worker.tc3.type=ajp13 JKWorkerProperty worker.tc3.lbfactor=1 JKWorkerProperty worker.loadbalancer.type=lb JKWorkerProperty worker.loadbalancer.balance_workers=tc1, tc2, tc3 JKWorkerProperty worker.jkstatus.type=status
  • 22. Tomcat Configuration • Put same content on all Tomcats • Edit conf/server.xml: • jvmRoute must match jk worker name! <Engine name="Catalina" defaultHost="localhost" jvmRoute="tc1">
  • 23. Problem: Session State • HTTP is Stateless • Apps use Sessions – Cookies – URL Encoding • Session created on single server – Broken by Load Balancing – PHP: sessions stored on disk
  • 24. Solutions: Session State • “Sticky” routing on Load Balancer • Store State in DB • Put benign State in Cookie – But don’t trust the client too much • Replicate Sessions on Back-end
  • 25. Tomcat Session Replication • Share HttpSession objects across instances • One instance dies, session lives on • Apache will route requests to other instance • Uses IP Multicast
  • 26. Session Replication Config • Uncomment <Cluster> element in server.xml • Put empty <distributable /> element in <web-app> element in web.xml
  • 27. Caching Content • Dynamic Content is Expensive • Static Content Relatively Cheap • Several Approaches: – Dynamic caching – Pre-rendering popular pages (index.rss…) 27
  • 28. mod_cache Configuration 28 <IfModule mod_cache.c> <IfModule mod_disk_cache.c> CacheRoot /raid1/cacheroot CacheEnable disk / # A page modified 100 min. ago will expire in 10 min. CacheLastModifiedFactor .1 # Always check again after 6 hours CacheMaxExpire 21600 </IfModule> </IfModule>
  • 29. Make Popular Pages Static • RSS Feeds • Popular catalog queries • … (Check your access log) 29
  • 30. Static Page Substitution 30 <Directory "/home/sctemme/inst/blog/httpd/htdocs"> Options +Indexes Order allow,deny Allow from all RewriteEngine on RewriteCond %{REQUEST_FILENAME} !-f RewriteCond %{REQUEST_FILENAME} !-d RewriteRule ^(.*)$ /cgi-bin/blosxom.cgi/$1 [L,QSA] </Directory>
  • 31. Tuning the Database Tier • Not my area (sorry) • Give Money to Oracle Consultants – (or MySQL) (or …) • Tip: Separate Read and Write Operations – Replicate from Write db to Read db – Read db data slightly stale • Does it matter?
  • 32. Putting it All Together Client Internet Read-only Write-only
  • 33. Monitoring the Farm • Monitor for outages – More boxes, more failure – HA can mask failures • Monitor for performance – Utilization – Trends
  • 34. Monitoring Solutions • Nagios – Check services, hosts for outage – Highly configurable, extendable – Worth your time investment • Ganglia – Monitor for performance – See Brad Nicholes’s session
  • 35. Monitoring Caveats • Takes Time, Effort – Highly flexible products • You can’t fix it – If you don’t know it’s broken • You can’t tune it – If you don’t know the bottlenecks
  • 36. Conference Roadmap • Monitoring 2.0 - Zenoss, the next level of IT management (Training) • Apache Performance Tuning Part 1: Scaling Up • Load-balancing with Apache HTTPD 2.2 and later • Scaling the download infrastructure with your success • Break My Site