Compare Clustering Methods for MS SQL Server

31,176 views

Published on

Clustering is very important technology for High Availability and it is important for DBA to understand benefits and pitfalls. With very few available techniques and a lot of gray areas right decision might help to avoid extra costs. Presentation is unveiling clustering basics, reviews and compares clustering technologies including Microsoft, XCOTO Gridscale, and HP PolyserveMatrix. This presentation can be helpful not only to beginners but to intermediate level DBAs and infrastructure managers.

Published in: Technology
0 Comments
16 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
31,176
On SlideShare
0
From Embeds
0
Number of Embeds
72
Actions
Shares
0
Downloads
0
Comments
0
Likes
16
Embeds 0
No embeds

No notes for slide

Compare Clustering Methods for MS SQL Server

  1. 1. Available Clustering Technologies<br />Alexander Prusakov<br />Senior SQL Server DBA<br />June, 2010<br />
  2. 2. Consultant - Sr. MS SQL Server DBA<br />20+ years of IT experience (business users, ERPs, databases and applications support, software development, data warehousing, etc.)<br />10+ years of experience with MS SQL Server and RDBMS<br />MBA, MCITP: DBA 2005/2008, IndyPASS, IndyNDA, IIBA member<br />Industries: banking/finance, energy, education, health care, manufacturing, IT consulting.<br />Speaker’s Background:<br />
  3. 3. What we are talking about and why<br />What is clustering: pure theory<br />Clustering with Microsoft, HP, and others<br />Pluses and minuses<br />Conclusion<br />Q&A<br />Agenda:<br />
  4. 4. Target: Hardware failure Database availability<br />Improve hardware quality<br />Duplicate components<br />Duplicate servers<br />Mirroring<br />Log shipping<br />Clustering<br />
  5. 5. Main Entry: cluster<br />Pronunciation: ’kləs-tər<br />Function: noun<br />Etymology: Middle English, from Old English clyster; akin to<br /> Old English clott clot<br />Date: before 12th century<br /><ul><li>A number of similar things that occur together
  6. 6. Two or more consecutive consonants or vowels in a segment of speech
  7. 7. A group of buildings and especially houses built close together on a sizable tract in order to preserve open spaces larger than the individual yard for common recreation
  8. 8. An aggregation of stars or galaxies that appear close together in the sky and are gravitationally associated</li></ul>Merriam-Webster Online.21 October 2009<br />
  9. 9. What we are talking aboutand why<br />
  10. 10. IMPORTANT!<br />Clustering is different from mirroring and/or replication;<br />
  11. 11. What is clustering: pure theory<br />Cluster (computing) - is a group of linked computers, working together closely so that in many respects they form a single computer. Clusters are usually deployed to improve performance and/or availability over that provided by a single computer. (From Wikipedia)<br />Cluster categorizations<br />Load-balancing clusters - clusters operate by distributing a workload evenly over multiple back end nodes.<br />Compute clusters - are used for primarily computational purposes, rather than handling IO-oriented operations such as web service or databases. For instance, a cluster might support computational simulations of weather or vehicle crashes. The primary distinction within compute clusters is how tightly-coupled the individual nodes are.<br />Grid computing - usually compute clusters, but more focused on throughput like a computing utility rather than running fewer, tightly-coupled jobs. Often, grids will incorporate heterogeneous collections of computers, possibly distributed geographically distributed nodes, sometimes administered by unrelated organizations.<br />High-availability (HA) clusters – (also known as failover clusters) are implemented primarily for the purpose of improving the availability of services. They operate by having redundant nodes to eliminate single points of failure. Two nodes is the minimum requirement to provide redundancy.<br />
  12. 12. What is clustering: pure theory (cont)<br />Cluster members are relatively close;<br />Hardware capabilities are important;<br />SCSI/iSCSI or Fiber drives only;<br />No dynamics disks;<br />MS DTC and Quorum – must exist<br /> (with size at least 1Gb each)<br />Two network cards: <br />Local – health check (LooksAlive test)<br />Public – access point<br />
  13. 13. What is clustering: pure theory (cont)<br />Cluster organization:<br /><ul><li>Logical host or Cluster logical hostis used to describe the network address which is used to access services provided by the cluster.
  14. 14. Nodeis physical computer with its own IP address, OS, applications.</li></ul>Examples:<br />MS SQL Server cluster – high availability<br />Scale up<br />Federation<br />Oracle Real Application Cluster – load balancing<br />Scale out<br />Scale up<br />
  15. 15. High-availability (HA) cluster:<br />
  16. 16. IMPORTANT!<br />Cluster is a way to reduce risk of failure:<br />Storage (if it is not clustered itself):<br />Power failure;<br />Storage hardware failure;<br />Geographical location:<br />Disaster (fire, flood, earthquake);<br />Power outage 2003 (New York, Cleveland, Ohio, Detroit, Michigan, Toronto and Ottawa);<br />User’s mistake:<br />From database drop to network cable pull out;<br />Probability of simultaneous hardware failure is not zero.<br />
  17. 17. Downtime<br />
  18. 18. Probability of failure<br />Mathematic theory.<br />Murphy's law:<br />Anything that can possibly go wrong, does<br />Too many, too close (hardware, software, network, humans…)<br />"Every program has at least one bug and can be shortened by at least one instruction — from which, by induction, it is evident that every program can be reduced to one instruction that does not work."<br /> by Ken Arnold<br />
  19. 19. Money, money, money<br />Time is money<br />More hardware means more money<br />More software means more money<br />More hardware and/or software means more money<br />Does it worth to go all the way? <br />
  20. 20. Cluster configuration<br />Active/Passive — Provides a fully redundant instance of each node, which is only brought online when its associated primary node fails.<br />Active/Active — Traffic intended for the failed node is either passed onto an existing node or load balanced across the remaining nodes.<br />N+1 — Provides a single extra node that is brought online to take over the role of the node that has failed.<br />N+M — In cases where a single cluster is managing many services, having only one dedicated failover node may not offer sufficient redundancy. In such cases, more than one (M) standby servers are included and available.<br />N-to-1 — Allows the failover standby node to become the active one temporarily, until the original node can be restored or brought back online, at which point the services or instances must be failed-back to it in order to restore High Availability. <br />N-to-N — A combination of Active/Active and N+M clusters.<br />
  21. 21. Load-balanced (LB) cluster:<br />
  22. 22.
  23. 23.
  24. 24.
  25. 25. IMPORTANT!<br />Clustering on virtual servers might not protect from hardware failure.<br />Adds complexity of sizing<br />
  26. 26. Compare direct costs<br />* - pricing details have to be complained to Microsoft Licensing<br />
  27. 27. Key features:<br /><ul><li>Microsoft Clustering for Standard Edition
  28. 28. 4 processors
  29. 29. Native clustering
  30. 30. Microsoft Clustering for Enterprise Edition 2005:
  31. 31. OS limited ## of CPUs;
  32. 32. Partitioning;
  33. 33. Parallel index operations;
  34. 34. On-line indexing;
  35. 35. Indexed view matching by the query processor;</li></li></ul><li>Key features:<br />Microsoft Clustering for Enterprise Edition 2005 (continue):<br />On-line restore;<br />Fast recovery;<br />Oracle replication;<br />Scale-out report server;<br />Data driven subscriptions;<br />Advances SSAS (like proactive cashing).<br />
  36. 36. Key features:<br />Microsoft Clustering for Enterprise Edition 2008:<br />Policy-Based Management;<br />Performance Data Collection;<br />Data Compression;<br />Backup Compression;<br />Resource Governor;<br />Transparent Data Encryption;<br />External Key Management / Extensible Key Management;<br />
  37. 37. Key features:<br />Microsoft Clustering for Enterprise Edition 2008 (continue):<br />Data Auditing;<br />Server Group Management;<br />Hot-Add CPUs and Hot-Add Memory;<br />Streamlined Installation;<br />Upgrade Advisor;<br />Partition Aligned Indexed Views;<br />Extended Events;<br />More, more, and more…;<br />
  38. 38. Key features:<br />HP PolyServe Matrix with MS SQL Server Standard Edition<br />Non-NTFS file system;<br />Increase disk size dynamically (no limits on disk size/RPM);<br />Usage of files over 2Tb in size (16Tb limit);<br />No need in identical hardware (Be careful!!!);<br />Any SQL instance can be moved to any note;<br />Software monitoring;<br />Dynamic Re-hosting;<br />4 or 8 Kb disk segment size;<br />Performance cost: 10% ?<br />
  39. 39. Key features:<br />XkotoGridscale:<br />Horizontal scalability ~ 0.85*n-servers;<br />Cost ~ 30% in software and hardware (SAN costs are not included);<br />Ability to test new database versions in real production environment;<br />Storage redundancy ;<br />Flexibility;<br />Time stamps?<br />
  40. 40. How Microsoft works:<br />
  41. 41.
  42. 42.
  43. 43. Windows cluster:<br />
  44. 44. How PolyServe works:<br />What happens during failover:<br />Removes the IP address for the Virtual SQL Server from the original node and then adds it to the backup node;<br />Updates registry keys and then starts SQL Server and SQL Agent on the backup node;<br />
  45. 45. PolyServe Matrix Management:<br />
  46. 46. PolyServe Matrix Disk Management:<br />
  47. 47. PolyServe Matrix Disk Management:<br />
  48. 48. PolyServe Matrix Disk Management:<br />
  49. 49. You get what you paid for:<br />Microsoft:<br />Hardware is very important;<br />Huge advantage in features;<br />You already have it all;<br />Easy to play with;<br />A lot of documentation;<br />A lot of knowledge around;<br />Cheaper labor (more people know).<br />
  50. 50. You get what you paid for:<br />HP:<br />HP – is HP (customer support, buyer experience);<br />Costly training and limited ($2500 pp);<br />Limited accessibility to working cluster;<br />Limited resource pull;<br />Limited second opinion option;<br />Clouded future.<br />
  51. 51. Think, analyze, think again – it is not black and white;<br />What is the real business need;<br />Create clear business requirements;<br />Do cost – benefit analysis because cost of the solution should be less than benefits from implementation;<br />Negotiate;<br />Create a clear plan;<br />Conclusion:<br />
  52. 52. 100% utilization is fiction:<br />80% per box utilization;<br />N+1 configuration;<br />Do you really like that vendor;<br />Evaluate alternatives (SSD, SST)<br />How good is your SysAdmin;<br />Do not be afraid – clustering is easy;<br />Just Do It!<br />Conclusion (continue):<br />
  53. 53. Internet – find your way.<br />http://www.sql-server-performance.com/articles/clustering/clustering_intro_p1.aspx<br />http://en.wikipedia.org/wiki/Computer_cluster<br />Vendor –pushing sales technique.<br />http://www.microsoft.com/windowsserver2003/enterprise/clustering.mspx<br />http://technet.microsoft.com/en-us/magazine/2007.03.sqlclusters.aspx<br />http://h18006.www1.hp.com/products/storage/software/polyserve/support/hp_polyserve.html<br />http://www.xkoto.com/products/gs-sql.php<br />Where to look for more:<br />
  54. 54. Email: a_prusakov@yahoo.com<br />Questions & Open Discussion<br />Thank You!<br />

×