Gluster Storage

Agenda
●Introduction
●Terminologies
●Architecture
●Getting Gluster Working

What is Gluster?
- Gluster was developed originally by Gluster, Inc., then by Red Hat, Inc., after
their purchase of Gluster in 2011
- Gluster is an open source distributed scale out storage
- Heterogeneous Commodity Hardware
- No centralized metadata server

Terminology
● Brick
- Brick is the basic unit of storage, represented by an export directory on a
server in the trusted storage pool
- i.e., NODE:/DIR
● Volume
- A logical collection of bricks. Most of the gluster management operations
happen on the volume
● Node
- Server running the gluster daemon and sharing volumes
● Trusted Storage Pool
- Collection of storage servers (nodes)

Trusted storage pool
A trusted network of nodes that host storage resources
Trusted storage pool commands
Add new node:
- gluster peer probe [node] command is used to add nodes to the
trusted storage pool
Remove node:
- gluster peer detach [node] command is used to remove nodes from
the trusted storage pool

Gluster main services
glusterd
- Volume management daemon
- Runs on all export nodes
glusterfsd
- GlusterFS brick daemon
- One process for each brick
- Managed by glusterd

Putting it all together
Trusted Storage Pool

Brick
Volum
e

Brick
Volum
e
Mount Point

Scaling
Scaling Up:
- Add disks to a node
- Expand a gluster volume by adding bricks
XFS
# gluster volume add-brick test_volume node1:/data/Music
Add Brick successful
# gluster volume rebalance test_volume start
Starting rebalance on volume dist has been successful

Scaling
Scaling Out:
- Add gluster nodes to trusted storage pool

Gluster volume types
Gluster storage supports different types of volumes based on the
requirements.
Some volumes are good for scaling storage size, while others are better for
improving performance and some are good for both size and performance.
● Distributed Volume
● Replicated Volume
● Distributed Replicated Volume
● Striped Volume

Distributed Volume
- Files are distributed across various bricks in the volume
- Any brick failure can lead to a complete loss of data

Creating a Distributed Volume
gluster volume create NEW-VOLNAME [transport [tcp | rdma | tcp,rdma]] NEW-BRICK...
For example, to create a distributed volume with four storage servers using TCP:
# gluster volume create test-volume server1:/exp1 server2:/exp2
server3:/exp3 server4:/exp4
Creation of test-volume has been successful
Please start the volume to access data

Display the volume information
# gluster volume info test-volume
Volume Name: test-volume
Type: Distribute
Status: Created
Number of Bricks: 4
Transport-type: tcp
Bricks:
Brick1: server1:/exp1

Replicated Volume
- Exact copy of the data is maintained on all bricks
- At least two bricks to create a Replicated Volume

Create a Replicated Volume
gluster volume create NEW-VOLNAME [replica COUNT] [transport [tcp | rdma |
tcp,rdma]] NEW-BRICK...
For example, to create a replicated volume with two storage servers:
# gluster volume create test-volume replica 2 transport tcp server1:/exp1
server2:/exp2

Distributed Replicated Volume
Files are distributed across replicated sets of bricks
- The number of bricks must be a multiple of the replica count
- Also the order in which we specify the bricks matters
Scaling and high availability

Create a Distributed Replicated Volume
gluster volume create NEW-VOLNAME [replica COUNT] [transport [tcp | rdma |
tcp,rdma]] NEW-BRICK...
# gluster volume create test-volume replica 2 transport tcp
server1:/exp1 server2:/exp2 server3:/exp3 server4:/exp4

Striped Volume
- Data is stored in the bricks after dividing it into smaller chunks
- Load is distributed and the file can be fetched faster

Create a Striped Volume
gluster volume create NEW-VOLNAME [stripe COUNT] [transport [tcp | dma | tcp,rdma]]
NEW-BRICK...
For example, to create a striped volume across two storage servers:
# gluster volume create test-volume stripe 2 transport tcp server1:/exp1
server2:/exp2

Which Volume Type Should I Use?
- Use distributed volumes where the requirement is to scale storage and the
redundancy is either not important or is provided by other hardware/software
layers
- Use replicated volumes in environments where high availability and high-
reliability are critical
- Use distributed replicated volumes in environments where the
requirement is to scale storage and high-reliability is critical. Distributed
replicated volumes offer improved read performance in most environments
- Use striped volumes only in high concurrency environments accessing very
large files

Getting Gluster Working
Six step process:
- Install the Gluster packages
- Start the Gluster services
- Create a trusted storage pool
- Create new volumes
- Start volumes
- Mount the volumes on clients

Getting Gluster Working
Install gluster package on server/s:
# yum install glusterfs-server
Start the GlusterFS management daemon:
# service glusterd start
Adding storage servers to a trusted storage pool:
# gluster peer probe my_server.scl.lab.tlv.redhat.com
Create volume on this server and start it:
# gluster volume create test-volume replica 2 transport tcp
server1:/exp1 server2:/exp2 server3:/exp3 server4:/exp4
# gluster volume start test-volume

Gluster Storage

More Related Content

What's hot

Similar to Gluster Storage

Recently uploaded

Gluster Storage

Editor's Notes