Advanced Percona XtraDB Cluster in a nutshell... la suiteKenny Gryp
This document provides a hands-on tutorial for advanced Percona XtraDB Cluster users. It discusses setting up a 3 node PXC cluster environment in VirtualBox and bootstrapping the initial cluster. It then covers topics like avoiding state snapshot transfers when restarting MySQL, recovering from clean and unclean shutdowns, and reproducing and diagnosing different types of conflicts through examples.
Advanced percona xtra db cluster in a nutshell... la suite plsc2016Frederic Descamps
This is a tutorial I gave with my colleague Kenny Gryp at Percona Live 2016 in Santa Clara
Percona XtraDB Cluster is a high availability and high scalability solution for MySQL clustering. Percona XtraDB Cluster integrates Percona Server with the Galera synchronous replication library in a single product package, which enables you to create a cost-effective MySQL cluster.
For three years at Percona Live, we've introduced people to this technology... but what's next? This tutorial continues your education, and targets users that already have experience with Percona XtraDB Cluster and want to go further.
This tutorial will cover the following topics:
- Bootstrapping in details
- certification errors, understanding and preventing them
- Replication failures, how to deal with them
- Secrets of Galera Cache
- Mastering flow control
- Understanding and verifying replication throughput
- How to use WAN replication
- Implications of consistent reads
- Backups
- Load balancers and proxy protocol
Percona XtraDB Cluster is a high availability and high scalability solution for MySQL clustering. Percona XtraDB Cluster integrates Percona Server with the Galera synchronous replication library in a single product package which enables you to create a cost-effective MySQL cluster.
This tutorial will cover the following topics:
- Migration from standard MySQL Master-Slave Architecture to PXC
- Configuration differences between standard MySQLl and Xtradb Cluster
- How to add a node and what does SST, IST mean ? How to use them ?
- How to backup the cluster
- How to monitor the cluster
- 2 nodes servers- Why this isn't ideal but reasons and steps to setting it up anyway.
- Galera Arbitrator: Defining what it is.
- How to maintain the cluster
- Setting up load balancing for Xtradb cluster
- How to handle the cluster in the cloud
- Tips and tricks
- ... and if available cover PXC 5.6 with Galera 3 !!
This document discusses improvements made to LXC container support in Ganeti through a Google Summer of Code project. Key areas worked on included fixing existing LXC integration, adding unit tests, setting up quality assurance, and adding features like migration. The project was mentored by Hrvoje Ribicic. Future work may include two-level isolation of containers within VMs and live migration of LXC instances.
How to Troubleshoot OpenStack Without Losing SleepSadique Puthen
The complex architecture, design, and difficulties while troubleshooting amplifies the effort in debugging a problem with an OpenStack environment. This can give administrators and support associates sleepless nights if OpenStack native and supporting components are not configured properly and tuned for optimum performance, especially with large deployments that involve high availability and load balancing.
Advanced Percona XtraDB Cluster in a nutshell... la suiteKenny Gryp
This document provides a hands-on tutorial for advanced Percona XtraDB Cluster users. It discusses setting up a 3 node PXC cluster environment in VirtualBox and bootstrapping the initial cluster. It then covers topics like avoiding state snapshot transfers when restarting MySQL, recovering from clean and unclean shutdowns, and reproducing and diagnosing different types of conflicts through examples.
Advanced percona xtra db cluster in a nutshell... la suite plsc2016Frederic Descamps
This is a tutorial I gave with my colleague Kenny Gryp at Percona Live 2016 in Santa Clara
Percona XtraDB Cluster is a high availability and high scalability solution for MySQL clustering. Percona XtraDB Cluster integrates Percona Server with the Galera synchronous replication library in a single product package, which enables you to create a cost-effective MySQL cluster.
For three years at Percona Live, we've introduced people to this technology... but what's next? This tutorial continues your education, and targets users that already have experience with Percona XtraDB Cluster and want to go further.
This tutorial will cover the following topics:
- Bootstrapping in details
- certification errors, understanding and preventing them
- Replication failures, how to deal with them
- Secrets of Galera Cache
- Mastering flow control
- Understanding and verifying replication throughput
- How to use WAN replication
- Implications of consistent reads
- Backups
- Load balancers and proxy protocol
Percona XtraDB Cluster is a high availability and high scalability solution for MySQL clustering. Percona XtraDB Cluster integrates Percona Server with the Galera synchronous replication library in a single product package which enables you to create a cost-effective MySQL cluster.
This tutorial will cover the following topics:
- Migration from standard MySQL Master-Slave Architecture to PXC
- Configuration differences between standard MySQLl and Xtradb Cluster
- How to add a node and what does SST, IST mean ? How to use them ?
- How to backup the cluster
- How to monitor the cluster
- 2 nodes servers- Why this isn't ideal but reasons and steps to setting it up anyway.
- Galera Arbitrator: Defining what it is.
- How to maintain the cluster
- Setting up load balancing for Xtradb cluster
- How to handle the cluster in the cloud
- Tips and tricks
- ... and if available cover PXC 5.6 with Galera 3 !!
This document discusses improvements made to LXC container support in Ganeti through a Google Summer of Code project. Key areas worked on included fixing existing LXC integration, adding unit tests, setting up quality assurance, and adding features like migration. The project was mentored by Hrvoje Ribicic. Future work may include two-level isolation of containers within VMs and live migration of LXC instances.
How to Troubleshoot OpenStack Without Losing SleepSadique Puthen
The complex architecture, design, and difficulties while troubleshooting amplifies the effort in debugging a problem with an OpenStack environment. This can give administrators and support associates sleepless nights if OpenStack native and supporting components are not configured properly and tuned for optimum performance, especially with large deployments that involve high availability and load balancing.
Мастер-класс "Логическая репликация и Avito" / Константин Евтеев, Михаил Тюр...Ontico
HighLoad++ 2017
Зал «Кейптаун», 7 ноября, 12:00
Тезисы:
http://www.highload.ru/2017/abstracts/2868.html
В Avito объявления хранятся в базах данных Postgres. При этом уже на протяжении многих лет активно применяется логическая репликация. С помощью неё успешно решаются вопросы роста объема данных и количества запросов к ним, масштабирования и распределения нагрузки, доставки данных в DWH и поисковые подсистемы, меж-базные и меж-сервисные синхронизации данных и пр.
...
This document explains how to set up ProxySQL to log queries from users connecting directly to the database servers. It details installing and configuring ProxySQL to log queries to binary files, using a tool to convert the binary logs to text format, and setting up an ELK stack to index the query logs and make them searchable in Kibana. Filebeat is configured to ship the text query logs to Logstash, which parses them and sends the data to Elasticsearch. Kibana provides a web interface for viewing and analyzing the query logs.
The document discusses two scenarios for performing point-in-time recovery on a Galera cluster after a node truncates a production table. Scenario 1 involves stopping the application, restoring from backup, replaying binary logs up until the truncate event, and restarting nodes. Scenario 2 maintains application uptime by removing the problematic node, restoring its data, and reintegrating it while masking the truncate from other nodes. The document also provides tips for optimizing XtraBackup state snapshot transfers and migrating an asynchronous slave to a new master node.
This document discusses MySQL replication and clustering. It provides an overview of the processes for replication and clustering, including that replication has one master and slaves while clustering allows reading and writing on all nodes. It also gives quick how-to guides for setting up basic replication and clustering configurations.
Troubleshooting containerized triple o deploymentSadique Puthen
This document discusses troubleshooting containerized TripleO deployments. It provides an overview of traditional versus containerized TripleO deployments. Key aspects covered include building container images, registering images, deployment flow, and troubleshooting. It also discusses containerized components in the overcloud including HA pacemaker containers, standalone containers, containerized compute and Ceph nodes, and Neutron containers. Specific troubleshooting steps and files are outlined.
This document discusses the objectives, implementation, testing, and roadmap for multi-master replication in Postgres. The key goals are fault tolerance, allowing writes to any node, and compatibility with standalone Postgres. It uses logical replication and a transaction manager based on ClockSI to allow transactions to commit on any node. Testing involves starting docker containers to inject failures like network partitions and verify automatic recovery works as expected. The roadmap includes releasing a public beta, contributing patches to upstream Postgres, and discussing replication of catalog content.
1. A PostgreSQL database outage occurred at GitLab on January 31st due to a combination of factors including an increase in load, replication lag, and the deletion of the database directory.
2. Lessons learned include monitoring replication, using tools like pg_basebackup properly, and having backups and disaster recovery processes in place.
3. Recommended preventative measures include setting sane configuration values, automated testing of backups, assigning an owner for data durability, and improving documentation.
This document describes migrating a database from a standalone MySQL configuration to a Galera cluster for high availability and redundancy. It outlines the existing infrastructure including web, mail, and database servers managed by Puppet. It then details removing the existing MySQL data and joining the nodes to the new Galera cluster. Configuration files are shown for Galera settings like the state snapshot transfer method and slave threads. System information is displayed for one of the Galera nodes including the large production database size and high query throughput. The GitHub link shows example Puppet code to check the Galera cluster status and return errors if not in the primary or connected states.
MySQL async message subscription platformLouis liu
The document discusses Roma, an asynchronous MySQL message system that subscribes to messages from the Canal system. Roma stores these messages in its own storage (MetaQ) since Canal cannot reliably store messages. Clients can then subscribe to the messages from Roma. Roma acts as an intermediary between Canal and clients to ensure messages are reliably stored.
Anatomy of neutron from the eagle eyes of troubelshoortersSadique Puthen
This document summarizes the anatomy of OpenStack Neutron through examples of real-life troubleshooting scenarios. It explores four examples: security group rules not being effective, instances not getting IP addresses from DHCP, floating IP connections randomly failing, and slow provider network communications. For each example, it explains the root cause found by understanding Neutron's architecture and packet flows, and describes the troubleshooting steps taken such as examining logs, monitoring processes, and using tools like tcpdump. The goal is to demonstrate Neutron anatomy and troubleshooting methods rather than just state the problems and solutions.
Training Slides: Advanced 302: Performing Schema Changes in a Multi-Site/Mult...Continuent
Join us for this advanced training session as we explore how to perform schema changes within a complex multi-site/multi-master environment, with zero-downtime. This training is for anyone new to Continuent without prior experience, but will also serve as a wonderful refresher for any current users. Basic MySQL and MySQL replication knowledge is assumed.
Course Prerequisite Learning
Basics: Introduction to Clustering
Intermediate: Performing Cluster Maintenance with Zero-Downtime
Advanced: Multi-Site/Multi-Master Tungsten Clustering Deployments
AGENDA
- Re-cap multi-site/multi-master (MSMM) cluster architecture
- Discuss the complexities of schema changes in a live environment
- Review cluster states
- Demonstrate how to perform a schema change
- Re-cap of key command line tools and resources
Training Slides: Basics 105: Backup, Recovery and Provisioning Within Tungste...Continuent
Join us for this training session where we discuss tools for backing up MySQL databases within Tungsten Clustering, and how Tungsten Clustering makes it easy to reprovision databases and recover a cluster. This training is for all engineers using Tungsten Clustering and will explain the tools needed to create a sound backup and recovery strategy, and is also valuable for those who wish to review their current MySQL backup and MySQL recovery procedures.
AGENDA
- Methods and tools for taking a MySQL backup from within a Tungsten cluster
- mysqldump
- xtrackbackup
- Snapshots (lvm or other)
- File copy
- tungsten_provision_slave
- Restoring a MySQL backup from within a Tungsten cluster
- Verifying the backup contains the last binary log position, and the importance of having this
- Restoring backups into a cluster
- Provisioning slaves from an existing datasource
Orchestrator allows for easy MySQL failover by monitoring the cluster and promoting a new master when failures occur. Two test cases were demonstrated: 1) using a VIP and scripts to redirect connections during failover and 2) integrating with Proxysql to separate reads and writes and automatically redirect write transactions during failover while keeping read queries distributed. Both cases resulted in failover occurring within 16 seconds while maintaining application availability.
The document provides configuration instructions and guidelines for setting up streaming replication between a PostgreSQL master and standby server, including setting parameter values for wal_level, max_wal_senders, wal_keep_segments, creating a dedicated replication role, using pg_basebackup to initialize the standby, and various recovery target options to control the standby's behavior. It also discusses synchronous replication using replication slots and monitoring the replication process on both the master and standby servers.
Streaming Replication Made Easy in v9.3Sameer Kumar
This document discusses setting up streaming replication in PostgreSQL v9.3 to enable high availability. It covers preparing primary and standby servers, configuring wal_level and max_wal_senders on the primary, taking a backup and restoring on the standby, creating a recovery.conf file, starting the servers to test replication, triggering failover by promoting the standby, handling multiple replicas without rebuilding, and rebuilding the original primary as a new standby. Monitoring replication status is also addressed using views like pg_stat_replication.
This document outlines the steps to install Oracle Grid Infrastructure and configure high availability for an Oracle database cluster using Grid, NFS, and IP failover. It describes prerequisites like installing Oracle Grid and database packages, configuring shared storage, creating Oracle user accounts, and bonding network interfaces. The steps also include configuring the Oracle environment, installing the Grid software, and basic post-installation configuration to enable high availability functionality.
This document summarizes an introduction to OpenvSwitch presented by pichuang@sdnds-tw. It provides an overview of OpenvSwitch including that it is a production quality, multi-layer virtual switch that supports OpenFlow and is written in platform-independent C. It then describes some key OpenvSwitch features and components and how they interact, such as the datapath, ovs-vswitchd, and ovsdb-server. It concludes with suggestions for debugging and setting OpenvSwitch.
This document discusses Corosync and Pacemaker, which are open source cluster management and resource management tools. Corosync provides messaging, membership, and quorum services, while Pacemaker acts as a cluster resource manager that can monitor resources and move them between nodes as needed. Resources can be configured as clones, which have multiple instances across nodes, or as master-slave, where one node is active and others are backups. Pacemaker uses resource agents to monitor specific services and resources. The document provides examples of configuring and clustering an Apache web server and IP address.
MySQL replication allows data from a master database server to be copied to one or more slave database servers. It provides advantages like improving performance through load balancing, increasing data security with backups on slaves, and enabling analytics on slaves without impacting the master. Basic replication involves setting up a master server and slave server with unique IDs, configuring the master to log binary changes, and pointing the slave to the master so it can copy the binary log entries.
[Open Infrastructure & Cloud Native Days Korea 2019]
커뮤니티 버전의 OpenStack 과 Ceph를 활용하여 대고객서비스를 구축한 사례를 공유합니다. 유연성을 확보한 기업용 클라우드 서비스 구축 사례와 높은 수준의 보안을 요구하는 거래소 서비스를 구축, 운영한 사례를 소개합니다. 또한 이 프로젝트에 사용된 기술 스택 및 장애 해결사례와 최적화 방안을 소개합니다. 오픈스택은 역시 오픈소스컨설팅입니다.
#openstack #ceph #openinfraday #cloudnative #opensourceconsulting
This document provides documentation for Percona XtraDB Cluster, an open-source high availability and scalability solution for MySQL users. It includes sections on installation from binaries or source code, key features like high availability and multi-master replication, FAQs, how-tos, limitations, and other documentation. Percona XtraDB Cluster provides synchronous replication across multiple MySQL/Percona Server nodes, allowing for high availability and the ability to write to any node.
Percona Cluster Installation with High AvailabilityRam Gautam
This document provides configuration instructions for setting up a 3-node Percona XtraDB Cluster with high availability and load balancing. It describes:
1. Installing and configuring Percona 5.7 on three CentOS nodes to form a database cluster with a shared configuration and automatic replication.
2. Configuring HAProxy on two load balancing servers to distribute connections across the database nodes and provide health monitoring.
3. Configuring Keepalived on the load balancing servers to provide a virtual IP address and failover capability so that if one load balancer fails, the other will take over.
Мастер-класс "Логическая репликация и Avito" / Константин Евтеев, Михаил Тюр...Ontico
HighLoad++ 2017
Зал «Кейптаун», 7 ноября, 12:00
Тезисы:
http://www.highload.ru/2017/abstracts/2868.html
В Avito объявления хранятся в базах данных Postgres. При этом уже на протяжении многих лет активно применяется логическая репликация. С помощью неё успешно решаются вопросы роста объема данных и количества запросов к ним, масштабирования и распределения нагрузки, доставки данных в DWH и поисковые подсистемы, меж-базные и меж-сервисные синхронизации данных и пр.
...
This document explains how to set up ProxySQL to log queries from users connecting directly to the database servers. It details installing and configuring ProxySQL to log queries to binary files, using a tool to convert the binary logs to text format, and setting up an ELK stack to index the query logs and make them searchable in Kibana. Filebeat is configured to ship the text query logs to Logstash, which parses them and sends the data to Elasticsearch. Kibana provides a web interface for viewing and analyzing the query logs.
The document discusses two scenarios for performing point-in-time recovery on a Galera cluster after a node truncates a production table. Scenario 1 involves stopping the application, restoring from backup, replaying binary logs up until the truncate event, and restarting nodes. Scenario 2 maintains application uptime by removing the problematic node, restoring its data, and reintegrating it while masking the truncate from other nodes. The document also provides tips for optimizing XtraBackup state snapshot transfers and migrating an asynchronous slave to a new master node.
This document discusses MySQL replication and clustering. It provides an overview of the processes for replication and clustering, including that replication has one master and slaves while clustering allows reading and writing on all nodes. It also gives quick how-to guides for setting up basic replication and clustering configurations.
Troubleshooting containerized triple o deploymentSadique Puthen
This document discusses troubleshooting containerized TripleO deployments. It provides an overview of traditional versus containerized TripleO deployments. Key aspects covered include building container images, registering images, deployment flow, and troubleshooting. It also discusses containerized components in the overcloud including HA pacemaker containers, standalone containers, containerized compute and Ceph nodes, and Neutron containers. Specific troubleshooting steps and files are outlined.
This document discusses the objectives, implementation, testing, and roadmap for multi-master replication in Postgres. The key goals are fault tolerance, allowing writes to any node, and compatibility with standalone Postgres. It uses logical replication and a transaction manager based on ClockSI to allow transactions to commit on any node. Testing involves starting docker containers to inject failures like network partitions and verify automatic recovery works as expected. The roadmap includes releasing a public beta, contributing patches to upstream Postgres, and discussing replication of catalog content.
1. A PostgreSQL database outage occurred at GitLab on January 31st due to a combination of factors including an increase in load, replication lag, and the deletion of the database directory.
2. Lessons learned include monitoring replication, using tools like pg_basebackup properly, and having backups and disaster recovery processes in place.
3. Recommended preventative measures include setting sane configuration values, automated testing of backups, assigning an owner for data durability, and improving documentation.
This document describes migrating a database from a standalone MySQL configuration to a Galera cluster for high availability and redundancy. It outlines the existing infrastructure including web, mail, and database servers managed by Puppet. It then details removing the existing MySQL data and joining the nodes to the new Galera cluster. Configuration files are shown for Galera settings like the state snapshot transfer method and slave threads. System information is displayed for one of the Galera nodes including the large production database size and high query throughput. The GitHub link shows example Puppet code to check the Galera cluster status and return errors if not in the primary or connected states.
MySQL async message subscription platformLouis liu
The document discusses Roma, an asynchronous MySQL message system that subscribes to messages from the Canal system. Roma stores these messages in its own storage (MetaQ) since Canal cannot reliably store messages. Clients can then subscribe to the messages from Roma. Roma acts as an intermediary between Canal and clients to ensure messages are reliably stored.
Anatomy of neutron from the eagle eyes of troubelshoortersSadique Puthen
This document summarizes the anatomy of OpenStack Neutron through examples of real-life troubleshooting scenarios. It explores four examples: security group rules not being effective, instances not getting IP addresses from DHCP, floating IP connections randomly failing, and slow provider network communications. For each example, it explains the root cause found by understanding Neutron's architecture and packet flows, and describes the troubleshooting steps taken such as examining logs, monitoring processes, and using tools like tcpdump. The goal is to demonstrate Neutron anatomy and troubleshooting methods rather than just state the problems and solutions.
Training Slides: Advanced 302: Performing Schema Changes in a Multi-Site/Mult...Continuent
Join us for this advanced training session as we explore how to perform schema changes within a complex multi-site/multi-master environment, with zero-downtime. This training is for anyone new to Continuent without prior experience, but will also serve as a wonderful refresher for any current users. Basic MySQL and MySQL replication knowledge is assumed.
Course Prerequisite Learning
Basics: Introduction to Clustering
Intermediate: Performing Cluster Maintenance with Zero-Downtime
Advanced: Multi-Site/Multi-Master Tungsten Clustering Deployments
AGENDA
- Re-cap multi-site/multi-master (MSMM) cluster architecture
- Discuss the complexities of schema changes in a live environment
- Review cluster states
- Demonstrate how to perform a schema change
- Re-cap of key command line tools and resources
Training Slides: Basics 105: Backup, Recovery and Provisioning Within Tungste...Continuent
Join us for this training session where we discuss tools for backing up MySQL databases within Tungsten Clustering, and how Tungsten Clustering makes it easy to reprovision databases and recover a cluster. This training is for all engineers using Tungsten Clustering and will explain the tools needed to create a sound backup and recovery strategy, and is also valuable for those who wish to review their current MySQL backup and MySQL recovery procedures.
AGENDA
- Methods and tools for taking a MySQL backup from within a Tungsten cluster
- mysqldump
- xtrackbackup
- Snapshots (lvm or other)
- File copy
- tungsten_provision_slave
- Restoring a MySQL backup from within a Tungsten cluster
- Verifying the backup contains the last binary log position, and the importance of having this
- Restoring backups into a cluster
- Provisioning slaves from an existing datasource
Orchestrator allows for easy MySQL failover by monitoring the cluster and promoting a new master when failures occur. Two test cases were demonstrated: 1) using a VIP and scripts to redirect connections during failover and 2) integrating with Proxysql to separate reads and writes and automatically redirect write transactions during failover while keeping read queries distributed. Both cases resulted in failover occurring within 16 seconds while maintaining application availability.
The document provides configuration instructions and guidelines for setting up streaming replication between a PostgreSQL master and standby server, including setting parameter values for wal_level, max_wal_senders, wal_keep_segments, creating a dedicated replication role, using pg_basebackup to initialize the standby, and various recovery target options to control the standby's behavior. It also discusses synchronous replication using replication slots and monitoring the replication process on both the master and standby servers.
Streaming Replication Made Easy in v9.3Sameer Kumar
This document discusses setting up streaming replication in PostgreSQL v9.3 to enable high availability. It covers preparing primary and standby servers, configuring wal_level and max_wal_senders on the primary, taking a backup and restoring on the standby, creating a recovery.conf file, starting the servers to test replication, triggering failover by promoting the standby, handling multiple replicas without rebuilding, and rebuilding the original primary as a new standby. Monitoring replication status is also addressed using views like pg_stat_replication.
This document outlines the steps to install Oracle Grid Infrastructure and configure high availability for an Oracle database cluster using Grid, NFS, and IP failover. It describes prerequisites like installing Oracle Grid and database packages, configuring shared storage, creating Oracle user accounts, and bonding network interfaces. The steps also include configuring the Oracle environment, installing the Grid software, and basic post-installation configuration to enable high availability functionality.
This document summarizes an introduction to OpenvSwitch presented by pichuang@sdnds-tw. It provides an overview of OpenvSwitch including that it is a production quality, multi-layer virtual switch that supports OpenFlow and is written in platform-independent C. It then describes some key OpenvSwitch features and components and how they interact, such as the datapath, ovs-vswitchd, and ovsdb-server. It concludes with suggestions for debugging and setting OpenvSwitch.
This document discusses Corosync and Pacemaker, which are open source cluster management and resource management tools. Corosync provides messaging, membership, and quorum services, while Pacemaker acts as a cluster resource manager that can monitor resources and move them between nodes as needed. Resources can be configured as clones, which have multiple instances across nodes, or as master-slave, where one node is active and others are backups. Pacemaker uses resource agents to monitor specific services and resources. The document provides examples of configuring and clustering an Apache web server and IP address.
MySQL replication allows data from a master database server to be copied to one or more slave database servers. It provides advantages like improving performance through load balancing, increasing data security with backups on slaves, and enabling analytics on slaves without impacting the master. Basic replication involves setting up a master server and slave server with unique IDs, configuring the master to log binary changes, and pointing the slave to the master so it can copy the binary log entries.
[Open Infrastructure & Cloud Native Days Korea 2019]
커뮤니티 버전의 OpenStack 과 Ceph를 활용하여 대고객서비스를 구축한 사례를 공유합니다. 유연성을 확보한 기업용 클라우드 서비스 구축 사례와 높은 수준의 보안을 요구하는 거래소 서비스를 구축, 운영한 사례를 소개합니다. 또한 이 프로젝트에 사용된 기술 스택 및 장애 해결사례와 최적화 방안을 소개합니다. 오픈스택은 역시 오픈소스컨설팅입니다.
#openstack #ceph #openinfraday #cloudnative #opensourceconsulting
This document provides documentation for Percona XtraDB Cluster, an open-source high availability and scalability solution for MySQL users. It includes sections on installation from binaries or source code, key features like high availability and multi-master replication, FAQs, how-tos, limitations, and other documentation. Percona XtraDB Cluster provides synchronous replication across multiple MySQL/Percona Server nodes, allowing for high availability and the ability to write to any node.
Percona Cluster Installation with High AvailabilityRam Gautam
This document provides configuration instructions for setting up a 3-node Percona XtraDB Cluster with high availability and load balancing. It describes:
1. Installing and configuring Percona 5.7 on three CentOS nodes to form a database cluster with a shared configuration and automatic replication.
2. Configuring HAProxy on two load balancing servers to distribute connections across the database nodes and provide health monitoring.
3. Configuring Keepalived on the load balancing servers to provide a virtual IP address and failover capability so that if one load balancer fails, the other will take over.
MariaDB Auto-Clustering, Vertical and Horizontal Scaling within Jelastic PaaSJelastic Multi-Cloud PaaS
Availability and performance have a direct business impact for most of the companies nowadays. No one wants to lose money because of occasional downtime or data loss. Thus, to minimize the risk and ensure an extra level of redundancy, clustering and automatic scaling should be used. In this video Ruslan Synytsky presented how Jelastic PaaS implemented auto-clustering of MariaDB by providing the customers with different replication options out-of-box with no need in manual configurations. It is also detailed how to automate vertical and horizontal scaling of databases running in the cloud.
Video recording of the session https://www.youtube.com/watch?v=6MND3feb5zM
Container Orchestration from Theory to PracticeDocker, Inc.
Join Laura Frank and Stephen Day as they explain and examine technical concepts behind container orchestration systems, like distributed consensus, object models, and node topology. These concepts build the foundation of every modern orchestration system, and each technical explanation will be illustrated using Docker’s SwarmKit as a real-world example. Gain a deeper understanding of how orchestration systems like SwarmKit work in practice and walk away with more insights into your production applications.
Container orchestration from theory to practiceDocker, Inc.
"Join Laura Frank and Stephen Day as they explain and examine technical concepts behind container orchestration systems, like distributed consensus, object models, and node topology. These concepts build the foundation of every modern orchestration system, and each technical explanation will be illustrated using SwarmKit and Kubernetes as a real-world example. Gain a deeper understanding of how orchestration systems work in practice and walk away with more insights into your production applications."
Using Galera Cluster to Power Geo-distributed Applications on the WANphilip_stoev
This document discusses using Galera Cluster to power geo-distributed applications across wide area networks (WANs). Some key points:
- Galera Cluster allows MySQL databases to be replicated across multiple data centers separated by thousands of kilometers to achieve high availability and disaster recovery.
- Replicating data closer to applications improves performance by answering most read queries locally with up-to-date data cached in the InnoDB buffer pool.
- Galera is optimized for WAN replication with features like encrypting traffic, automatic detection and removal of unreliable nodes, and minimizing cross-data center traffic.
- Basic configuration involves specifying network locations, opening required ports, and configuring IPs for each node. Security must
Highly Available Load Balanced Galera MySql ClusterAmr Fawzy
Describing the major principles of well designed cloud system application including high availability and load balancing as well as implementing highly available load balanced galera mysql cluster
The document discusses best practices for Galera Cluster, a synchronous multi-master replication solution for MySQL/InnoDB. It covers topics like dealing with conflicts in a multi-master setup, performing state transfers when adding new nodes, different backup methods that assign a global transaction ID, and techniques for upgrading schemas in a clustered environment.
FOSDEM 2012: MySQL synchronous replication in practice with GaleraFromDual GmbH
This document provides an overview of MySQL synchronous replication using Galera. It discusses why Galera is used over other replication options, how to set up a Galera cluster with 3 nodes, the configuration required, and how to perform operations like starting/stopping nodes and performing a state snapshot transfer. It also covers characteristics of Galera like synchronous multi-master replication, automatic membership control, and parallel replication. The document is meant to facilitate questions and discussion about using Galera in practice.
Troy Lea's presentation on Monitoring VMware Virtualization Using vMA.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
In the first part of Galera Cluster best practices series, we will discuss the following topics:
* ongoing monitoring of the cluster and detection of bottlenecks;
* fine-tuning the configuration based on the actual database workload;
* selecting the optimal State Snapshot Transfer (SST) method;
* backup strategies
(video:http://galeracluster.com/videos/2159/)
Topics covered in this presentation:
1. The difference between traditional (e.g. MySQL) replication and Galera Cluster
2. General Galera Cluster principles
Webinar slides: Introducing Galera 3.0 - Now supporting MySQL 5.6Severalnines
You'll learn how Galera integrates with MySQL 5.6 and Global Transaction IDs to enable cross-datacenter and cloud replication over high latency networks. The benefits are clear; a globally distributed MySQL setup across regions to deliver Severalnines availability and real-time responsiveness.
Galera Cluster for MySQL is a true multi-master MySQL replication plugin, and has been proven in mission-critical infrastructures of companies like Ping Identity, AVG Technologies, KPN and HP Cloud DNS. In this webcast you¹ll learn about the following Galera Cluster capabilities, including the latest innovations in the new 3.0 release:
Galera Cluster features and benefits
Support for MySQL 5.6
Integration with MySQL Global Transaction Identifiers
Mixing Galera synchronous replication and asynchronous MySQL replication
Deploying in WAN and Cloud environments
Handling high-latency networks
Management of Galera
This presentation was written by Wagner Bianchi for the presentation on the Oracle Consulting Team/Professional Services meeting that took place in San Francisco/CA.
From these slides you'll learn how Galera integrates with MySQL 5.6 and Global Transaction IDs to enable cross-datacenter and cloud replication over high latency networks.
ABOUT GALERA CLUSTER
Galera Cluster for MySQL is a true multi-master MySQL replication plugin, and has been proven in mission-critical infrastructures of companies like Ping Identity, AVG Technologies, KPN and HP Cloud DNS. In this webcast you¹ll learn about the following Galera Cluster capabilities, including the latest innovations in the new 3.0 release:
Galera Cluster features and benefits
Support for MySQL 5.6
Integration with MySQL Global Transaction Identifiers
Mixing Galera synchronous replication and asynchronous MySQL replication
Deploying in WAN and Cloud environments
Handling high-latency networks
Management of Galera
This document discusses Neutron networking status in OpenStack, including features like Distributed Virtual Router (DVR) support. DVR allows distributed routing to remove bottlenecks and enable one-hop east-west traffic between VMs on different hypervisors. The document provides configuration options for enabling DVR and an example multi-node Devstack configuration for testing DVR on compute and network nodes. It also includes diagrams illustrating how DVR works to deliver traffic between VMs on different networks and hypervisors.
This document provides an overview of MySQL Cluster architecture, maintenance, and support. It discusses the core architecture including data distribution across nodes, features like high availability and scalability, and the cluster structure with management, data, and SQL nodes. It also covers configuration files, database partitioning, basic administration functions like starting and stopping nodes, replication demonstrations, logging, backup and restore processes, and monitoring tools.
RAC-Installing your First Cluster and DatabaseNikhil Kumar
RAC - Installing your First RAC
Abstract : Oracle Real Application Clusters have been one of the hottest technologies in the market since 2001 prior this is know OPS in 8i. Oracle has brought revolution in the field of database by enhancing RAC technologies in it each version. This presentation will give introduction of RAC and features introduced in each version of RAC. This presentation contains the demo of building Oracle clusterware from the scratch. Also we will discuss the new components and its features during installation. This presentation and demo will be done on version 11GR2. Which will be used as a base for our next presentation Viz. Upgradation of RAC 11GR2 to 12C RAC.
This presentation will give brief insight information of RAC infrastructure setup. Sometimes DBA doesn’t fully aware of prerequisite and verification steps that needs to perform before installing clusterware, So this session will cover thing to consider before installing clusterware and best practices followed during the whole process.
Agenda
Introduction of RAC
Installation of Clusterware.
Creating diskgroup / Adding disk to Diskgroup using ASMCA.
Creation of ACFS Volume.
Installation of RAC Database using DBCA.
1) MySQL live migration involves a 3 step process of taking a baseline dump/snapshot from the source database, setting up replication between the source and destination for continuous data movement, and finally performing a switchover to migrate to the destination.
2) The baseline dump uses mysqldump to export data, users, privileges and stored objects from the source and import to the destination. The --master-data option captures binary log coordinates for setting up replication.
3) Replication is setup using the binary log coordinates to synchronize data between the source and destination. Monitoring replication status ensures the destination lag is low before switchover.
This document summarizes migrating from MySQL replication to Galera Cluster. It describes Galera Cluster as providing synchronous multi-master replication with automatic failover. The migration procedure involves converting an existing asynchronous slave to Galera, building up the Galera Cluster to the desired size, switching readers and writers over to the cluster, and then deactivating the original asynchronous replication. Key benefits of Galera Cluster include strong consistency, high availability, and ability to join new nodes automatically.
This document provides an introduction and overview of MySQL, including how to download and access MySQL, basic commands to manage databases and tables, examples of SQL queries, and how to modify data. It covers topics such as creating databases and tables, selecting, joining, aggregating data, and updating records in MySQL. Examples demonstrate how to retrieve customer names, loan amounts, branch details, and more from the sample banking database.
This document provides an introduction to MySQL, an open source relational database management system. It discusses that MySQL is pronounced "my-es-que-el" and includes both a SQL server and client programs. It also summarizes that MySQL AB is the commercial entity behind MySQL that provides marketing, development, services, support and consulting. Additionally, it notes that MySQL is the most popular open source database with over 100 million downloads, it is certified for SAP applications, and is widely used by developers along with PHP and Apache.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
“An Outlook of the Ongoing and Future Relationship between Blockchain Technologies and Process-aware Information Systems.” Invited talk at the joint workshop on Blockchain for Information Systems (BC4IS) and Blockchain for Trusted Data Sharing (B4TDS), co-located with with the 36th International Conference on Advanced Information Systems Engineering (CAiSE), 3 June 2024, Limassol, Cyprus.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Infrastructure Challenges in Scaling RAG with Custom AI modelsZilliz
Building Retrieval-Augmented Generation (RAG) systems with open-source and custom AI models is a complex task. This talk explores the challenges in productionizing RAG systems, including retrieval performance, response synthesis, and evaluation. We’ll discuss how to leverage open-source models like text embeddings, language models, and custom fine-tuned models to enhance RAG performance. Additionally, we’ll cover how BentoML can help orchestrate and scale these AI components efficiently, ensuring seamless deployment and management of RAG systems in the cloud.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
1. 1
Galera Cluster For MySQL
Alexey Yurchenko, Codership Oy
Percona Live MySQL User Conference and Expo, 2012
2. 2
Follow the tutorial
Using Percona XtraDB Cluster RPMs, at Amazon
EC2
Datacenters:
us-west-1
us-east-1
eu-west-1
(look for galera_tutorial_* images)
3. 3
Agenda
1. Introduction (30-45 min)
a. Galera architecture overview
b.Important terms and concepts
2. Migration from stock MySQL replication (30 min)
3. LAN cluster setup (30 min)
a. Node configuration
b.Joining the cluster, state transfer
c. Executing SQL load
4. Node failure and recovery under load (30 min)
a. Planned restart
b.Hard crash
c. Split-brain
5. WAN replication (30 min)
6. Troubleshooting
7. Q&A
4. 4
What Is Galera Cluster for MySQL?
MySQL
server ✕ N
Galera
Cluster
for
MySQL
=
5. 5
What is Galera Cluster for MySQL
Each node is fully representative of the cluster:
1. No single point of failure
2. Synchronous...............
3. 99.9% transparency
●
No data loss
●
No slave lag
●
No master failover
●
Full multi-master
6. 6
What Is Galera Cluster for MySQL?
MySQL
Galera
Cluster
for
MySQLMySQL
MySQL
MySQL
7. 7
What Is Galera Cluster for MySQL?
Galera
Cluster
for
MySQL
MySQL
server
Server
as a
part of
the
cluster
9. 9
What will happen 1
Master Slave
MySQL
master-slave replication
Clients
We start with regular MySQL master-slave
10. 10
What will happen 2
Master
Slave /
G. node1
MySQL
master-slave replication
Clients
Upgrade MySQL slave to Galera node
Single node
Galera cluster
11. 11
What will happen 3
Master
Slave /
G. node1
MySQL
master-slave replication
Clients
Add a second Galera node using state
snapshot transfer (SST) 2-node
Galera cluster
G. node2
12. 12
What will happen 4
Master
Slave /
G. node1
Clients
Failover client connections to Galera cluster
and stop master-slave. 2-node
Galera cluster
G. node2
13. 13
What will happen 5
Master /
G. node3
Slave /
G. node1
Clients
Upgrade MySQL master to Galera and join to
cluster using incremental state transfer (IST)
3-node Galera cluster
G. node2
14. 14
Converting slave to Galera node
1. Stop the slave: slave> STOP SLAVE;
2. Upgrade the software:
slave# rpm -e MySQL-server
slave# rpm -Uvh percona-xtrabackup-2.0.0-*.rpm
Percona-XtraDB-Cluster-galera-2.0-*.rpm
Percona-XtraDB-Cluster-server-5.5.20-23.4*.rpm
3. Configure wsrep options in my.cnf
4. Add log_slave_updates=1 to my.cnf
Remove read_only = 1 from my.cnf
5. Restart the server.
6. Start the slave: slave> START SLAVE;
15. 15
Sample my.cnf for slave after upgrade
[mysqld]
wsrep_cluster_address=/usr/lib64/libgalera_smm.so
wsrep_node_address=gcomm:// # NOTE: This must be changed to peer address ASAP!
wsrep_node_name=node1
wsrep_provider='/usr/lib64/galera/libgalera_smm.so'
wsrep_provider_options='gcache.size=1G;socket.ssl_key=my_key;socket.ssl_cert=my_cert'
wsrep_slave_threads=16
wsrep_sst_method=xtrabackup
wsrep_sst_auth=root:
innodb_buffer_pool_size=1G
innodb_log_file_size=256M
innodb_autoinc_lock_mode=2
innodb_flush_log_at_trx_commit=0
innodb_doublewrite=0
innodb_file_per_table=1
binlog_format=ROW
datadir=/var/lib/mysql
log-bin = mysql-bin
server-id = 2
relay-log = mysql-relay-bin
#read-only = 1
log-slave-updates = 1
17. 17
Important wsrep variables 2
wsrep_cluster_address:
Where to connect to cluster. Has a URI form:
'gcomm://another_node_address?opt1=val1&opt2=val2'
But normally just
'gcomm://another_node_address'
A special form to start a new cluster:
'gcomm://'
!!! Danger !!! Never leave it in my.cnf !!!
18. 18
Important wsrep variables 3
wsrep_node_address:
An optional address of the node. A short-cut
way to configure listen addresses for
replication and state transfers.
By default it will be initialized to the first
network interface returned by ifconfig. This
however is unreliable. For best results it must
be initialized explicitly.
19. 19
Important wsrep variables 4
wsrep_node_name:
An optional name for the node. It will be used
in logging and to identify the desired donor
for state transfer.
By default it will be initialized to hostname,
but that may turn out to be not unique (it does
not have to) or unwieldy.
20. 20
Important wsrep variables 5
wsrep_provider_options:
Semicolon-separated list of options specific to
provider. Some useful Galera options:
gcache.size – a size of the permanent
transaction on-disk cache.
socket.ssl_key, socket.ssl_cert – SSL key and
certificate files.
21. 21
Important wsrep variables 6
wsrep_slave_threads:
How many threads to launch for parallel
applying.
> 1 requires certain InnoDB settings. Applying
of STATEMENT-based events is always
serialized.
22. 22
Important wsrep variables 7
wsrep_sst_method:
Base package contains scripts for mysqldump,
rsync and xtrabackup based state snapshot
transfers. Own scripts can be crafted (handy for
backup). Default is mysqldump. It also requires
setting wsrep_sst_auth to MySQL root
user:password pair.
23. 23
Status of Galera Replication
wsrep_ready
wsrep_cluster_status
wsrep_connected
Wait for node to sync with
cluster
No
Non-Primary
Primary
Network partitioning!
wsrep_local_state_uuid
wsrep_last_committed
Yes
Yes
No: reconnect
24. 24
Starting a 2nd
Galera node
1. Install Percona RPMs. No need to configure
initial database, it will be brought by SST.
2. Edit my.cnf. Set wsrep_cluster_address to
slave node IP.
3. Start mysqld:
node2# service mysql start
4. Update my.cnf on slave:
set wsrep_cluster_address to 2nd
node IP.
25. 25
Important Concepts 1
Application State:
For Galera application state is a set of data that
application decides to replicate. By default it is a
whole of MySQL databases (i.e. every node is a
complete replica of another).
Application state is identified by a Global Transation
ID.
26. 26
Important Concepts 1
Global Transaction ID (GTID):
f7720ae0-6f9b-11e1-0800-598d1b386dce:13989753562f7720ae0-6f9b-11e1-0800-598d1b386dce 32520198989
CLUSTER/HISTORY/STATE
UUID
TRX/STATE
SEQNO
28. 28
Important Concepts 1
State Snapshot Transfer (SST):
A transfer of a consistent snapshot of a node
state corresponding to a certain GTID in order to
initialize the state of a newly joining cluster node
from an already initialized node (donor).
29. 29
Performance of Galera replication
wsrep_flow_control_paused: what fraction of the
time replication was paused.
wsrep_flow_control_sent: how many times this
node paused replication.
wsrep_local_recv_queue_avg: average length of
slave trx queue – a sign of slave side bottleneck.
wsrep_cert_deps_distance: how many transactions
can be applied in parallel.
wsrep_local_send_queue_avg: a sign of network
bottleneck.
30. 30
Failover to Galera cluster
1. Briefly pause the load.
2. Make sure slave caught up with the master:
master> SHOW MASTER STATUSG
slave> SHOW SLAVE STATUSG
3. Take note of GTID on slave:
slave> SHOW STATUS LIKE 'wsrep_%';
4. Set binary log format to ROW on slave:
slave> SET GLOBAL binlog_format=ROW;
5. Direct the load to slave and the 2nd
Galera node.
31. 31
Make former master a Galera node
1. Upgrade software.
2. Configure my.cnf.
Set wsrep_cluster_address to any of Galera
nodes' names.
3. Copy/forge grastate.dat file. Set state
information to the GTID we noted previous
page.
4. Start mysqld.
32. 32
Important Concepts 2
Incremental State Transfer (IST):
Catch up with the cluster by replaying
missing transactions. Requires
➔
known initial node state;
➔
enough transactions cached at the donor.
33. 33
Important Concepts 3
Node Failure:
For Galera node a peer crash is indistinguishable from
network failure. Hence node is considered failed when it
no longer can be communicated with. Communication is
verified by receiving messages or keepalives.
evs.inactive_timeout sets the timeout after which node
is considered inactive (dead).
evs.suspect_timeout sets the timeout after which the
node can be pronounced dead if everyone else agrees.
34. 34
Galera WAN Replication
Currently Galera has no notion of a local or
remote node – it works as long as TCP works.
May need tuning to be more tolerant to
network hiccups – Galera (wsrep provider)
options:
evs.keepalive_period = PT3S;
evs.inactive_check_period = PT10S;
evs.suspect_timeout = PT30S;
evs.inactive_timeout = PT1M;
evs.consensus_timeout = PT1M
41. 41
Backup, backup, backup
Just backup one of the nodes. But backup
without GTID is not that useful - backup with
GTID:
1) xtrabackup:
node# innobackupex --galera-info
creates xtrabackup_galera_info in the datadir
2) Custom backup with arbitrator:
node# garbd … --donor node1 --sst custom
calls wsrep_sst_custom on node1.