High Availability Storage With GlusterFS On CentOS - Automatic File Replication
1. High Availability Storage
With GlusterFS On
CentOS - Automatic File
Replication (Mirror ) ,
striping , distributing
Across Two Storage
Servers
RedHatStorage
3. Introduction
Software Defined Storage with
Intel® Enabling Technologies
HP Software-Defined Storage
NetApp Software-Defined
Storage
IBM Spectrum Storage
EMC VIPR SOFTWARE-DEFINED
STORAGE
4. Introduction
Red Hat Storage is a software-only,
scale-out storage solution that
provides flexible and agile
unstructured data storage for the
enterprise
The product can be installed and
managed on-premise, or in a public
cloud
5. Introduction
What is open, software-defined
storage from red hat point of view?
Software-defined storage brings
virtualization to enterprise data
storage.
6. Introduction
What is open, software-defined
storage ?
You can abstract and pool
storage capacity and compute
and networking resources across
on-premise and cloud
environments to scale
independently and beyond
individual hardware
components.
7. Introduction
What is open, software-defined
storage ?
Our software-defined storage is
also open source. It draws on the
innovations of a community of
developers, partners, and
customers.
The result is agile, scalable,
loosely coupled environments for
unstructured data storage.
9. Introduction
RED HAT PRODUCTS
Red Hat Gluster Storage
Provides a scalable, reliable,
and cost-effective data
management platform,
streamlining file and object
access across physical, virtual,
and cloud environments.
10. Introduction
RED HAT PRODUCTS
Red Hat Gluster Storage
In a scale-out system, one of the
biggest challenges is keeping track
of the logical and physical
locations of data and metadata.
Most distributed systems solve
this problem by creating a
metadata server to track the
location of data and metadata
11. Introduction
RED HAT PRODUCTS
Red Hat Gluster Storage
As traditional systems add more
files, more servers,or more disks,
the central metadata server
becomes a performance
bottleneck, as well as a central
point of failure.
13. Introduction
RED HAT PRODUCTS
Red Hat Gluster Storage
Unlike other traditional storage
solutions, Red Hat Storage does
not need a metadata server, and
locates files algorithmically using
an elastic hashing algorithm.
This no-metadata server ensures
better performance, linear
scalability, and reliability.
15. Introduction
RED HAT PRODUCTS
Red Hat Gluster Storage
Red Hat Storage Server for On-
premise enables enterprises to treat
physical storage as a virtualized,
scalable, and centrally managed
storage pool by using commodity
storage hardware.
16. Introduction
RED HAT PRODUCTS
Red Hat Gluster Storage
It supports multi-tenancy by
partitioning users or groups into
logical volumes on shared storage
17. Introduction
About glusterFS
glusterFS aggregates various
storage servers over network
interconnects into one large
parallel network file system.
it is a key building block of Red
Hat Storage.
18. About glusterFS
We are living in a world where
data is growing in an
unpredictable way and our need
to store this data, whether it is
structured or unstructured.
Introduction
19. About glusterFS
Distributed computing systems
offer a wide array of advantages
over centralized computing
systems. Here data is stored in a
distributed way with several
nodes as servers.
Introduction
20. Introduction
What is GlusterFS ?
GlusterFS is a distributed file
system defined to be used in user
space, i.e. File System in User
Space (FUSE). It is a software
based file system which accounts
to its own flexibility feature.
21. Introduction
What is GlusterFS ?
GlusterFS is an open source,
distributed file system capable of
scaling to several petabytes and
handling thousands of clients.
GlusterFS clusters together storage
building blocks over TCP/IP
interconnect, aggregating disk and
memory resources and managing
data in a single global namespace.
23. Introduction
What is GlusterFS ?
The figure , illustrates how users
can access application data and
files in a Global namespace using
a variety of standard protocols.
24. Introduction
What is GlusterFS ?
No longer are users locked into
costly, legacy storage platforms.
GlusterFs manages most storage
platform from multi vendor .
GlusterFS gives users the ability to
deploy scale-out, virtualized
storage – scaling from terabytes to
petabytes in a centrally managed
and commoditized pool of storage.
25. Going Deep
Storage concepts in GlusterFS
Brick – Brick is basically any
directory that is meant to be
shared among the trusted storage
pool.
Trusted Storage Pool – is a
collection of these shared
files/directories, which are based
on the designed protocol.
26. Going Deep
Storage concepts in GlusterFS
Block Storage – They are devices
through which the data is being
moved across systems in the form
of blocks.
Cluster – In Red Hat Storage, both
cluster and trusted storage pool
convey the same meaning of
collaboration of storage servers
based on a defined protocol.
27. Going Deep
Storage concepts in GlusterFS
Distributed File System – A file
system in which data is spread
over different nodes where users
can access the file without
knowing the actual location of the
file. User doesn’t experience the
feel of remote access.
28. Going Deep
Storage concepts in GlusterFS
FUSE – It is a loadable kernel
module which allows users to
create file systems above kernel
without involving any of the kernel
code.
29. Going Deep
Storage concepts in GlusterFS
glusterd – glusterd is the GlusterFS
management daemon which is the
backbone of file system which will
be running throughout the whole
time whenever the servers are in
active state.
30. Going Deep
Different Types of Volumes
Representations of different
types of volumes and
combinations among these
basic volume types are also
allowed as shown below.
Until Software Defined Storage (SDS), storage system vendors shipped legions of vertically integrated, closed proprietary storage appliances. Due to closed architectures, large application developers (customers) had limited tools to customize storage interfaces and management for their needs. Then came Software Defined Storage, which separated the storage control plane from the storage media plane. The Virtual Storage Layer moved the control function off of proprietary storage appliances and onto servers, leaving the media-specific characteristics to the storage media plane. With this separation, cutting-edge developers at Couchbase, Citrusleaf, and Percona are using software-defined interfaces to make the storage media fit their needs.
The Virtual Storage Layer™ (VSL) separates the control plane from the storage media plane. Like OpenFlow, the VSL moves the control function to the server, opening the way for developers to define new interfaces to storage.
Next example – NoSQL applications. We all know that with the proliferation of social media, mobile photo-sharing, blogging, tweeting, and HTML documents in general, the amount of unstructured data in the world is exploding. Fully-normalized relational databases aren’t optimized for large volumes of unstructured data, hence the advent of alternative NoSQL database technologies. Key-value pairs are the native data structure within NoSQL applications. A key is used to find a data value. Key-value pairs are distributed throughout caching layers and across federated systems. Cached key-value pairs expire within a specified time period to make room for more recently used pairs. Groups of pairs are created, copied, moved, and deleted. Key-value pairs are the common currency of NoSQL applications. A significant amount of source code is required to continually translate between application-native key-value pair data structures and the storage system’s simple block interface. But what if the storage media exported a native key-value pair interface? Enter Software Defined Storage. NoSQL applications could perform key-value pair put operations instead of a series of simple block writes. To retrieve data values, NoSQL applications could submit a key and get the corresponding value in return. And key-value pairs with a limited lifespan could be sent with expiry timers, with automatic flagging for garbage collection at expiration.
Our software-defined storage is also open source. It draws on the innovations of a community of developers, partners, and customers.
The result is agile, scalable, loosely coupled environments for unstructured data storage.
In a scale-out system, one of the biggest challenges is keeping track of the logical and physical
locations of data and metadata. Most distributed systems solve this problem by creating a metadata
server to track the location of data and metadata. As traditional systems add more files, more servers,
or more disks, the central metadata server becomes a performance bottleneck, as well as a central
point of failure.
. As traditional systems add more files, more servers,
or more disks, the central metadata server becomes a performance bottleneck, as well as a central
point of failure.
. As traditional systems add more files, more servers,
or more disks, the central metadata server becomes a performance bottleneck, as well as a central
point of failure.
Figure 1, above, illustrates how users can access application data and files in a Global namespace using a variety of standard protocols.
No longer are users locked into costly, monolithic, legacy storage platforms. GlusterFS gives users the ability to deploy scale-out, virtualized storage – scaling from terabytes to petabytes in a centrally managed and commoditized pool of storage.
Block Storage – They are devices through which the data is being moved across systems in the form of blocks.
Cluster – In Red Hat Storage, both cluster and trusted storage pool convey the same meaning of collaboration of storage servers based on a defined protocol.
Distributed File System – A file system in which data is spread over different nodes where users can access the file without knowing the actual location of the file. User doesn’t experience the feel of remote access.
Distributed File System – A file system in which data is spread over different nodes where users can access the file without knowing the actual location of the file. User doesn’t experience the feel of remote access.
FUSE – It is a loadable kernel module which allows users to create file systems above kernel without involving any of the kernel code.
glusterd – glusterd is the GlusterFS management daemon which is the backbone of file system which will be running throughout the whole time whenever the servers are in active state.
glusterd – glusterd is the GlusterFS management daemon which is the backbone of file system which will be running throughout the whole time whenever the servers are in active state.