SlideShare a Scribd company logo
Towards
a
Grid
File
System
Based
on
a
Large-Scale

BLOB
Management
Service

Viet-Trung
Tran1,
Gabriel
Antoniu2,
Bogdan
Nicolae3,
Luc
Bougé1,
Osamu
Tatebe4

1
ENS
Cachan
-
Brittany,
France
2
INRIA
Centre
Rennes
-
Bretagne-Atlantique,
France
3
University
of
Rennes
1,
France
4
University
of
Tsukuba,
Japan




                                                       1
New
Challenges
for
Large-scale
Data
Storage

Scalable
storage
management
for
new-generation,
data-oriented
high-performance
applications
     Massive,
unstructured
data
objects
(Terabytes)
     Many
data
objects
(10³)
     High
concurrency
(10³
concurrent
clients)
     Fine-grain
access
(Megabytes)
     Large-scale
distributed
platform:
large
clusters,
grids,
clouds,
desktop
grids


Applications:
distributed,
with
high
throughput
under
concurrency
     E-science
Data-centric
applications
     Storage
for
cloud
services
     Map-Reduce-based
data
mining
     Checkpointing
on
desktop
grids




                                                                                       2
BlobSeer:
a
BLOB-based
Approach

Developed
by
the
KerData
team
at
INRIA
Rennes
     Recently
created
from
the
PARIS
project-team

Generic
data-management
platform
for
huge,
unstructured
data
     Huge
data
(TB)
     Highly
concurrent,
fine-grain
access
(MB):
R/W/A                    BLOB
     Prototype
available

Key
design
features
     Decentralized
metadata
management
     Beyond
MVCC:
multiversioning
exposed
to
the
user
     Lock-free
write
access
through
versioning
            Write-once
pages

A
back-end
for
higher-level,
sophisticated
data
management
systems
     Short
term:
highly
scalable
distributed
file
systems
     Middle
term:
storage
for
cloud
services
     Long
term:
extremely
large
distributed
databases       http://blobseer.gforge.inria.fr/


                                                                                                3
BlobSeer:
Design                                       http://blobseer.gforge.inria.fr


Each
blob
is
fragmented
into
equally-sized
“pages”
     Allows
huge
data
amounts
to
be
distributed
all
over
the
peers
     Avoids
contention
for
simultaneous
accesses
to
disjoint
parts
of
the
data
block


Metadata
:
locate
pages
that
make
up
a
given
blob
     Fine-grained
and
distributed
     Efficiently
managed
through
a
DHT


Versioning
     Update/append:
generate
new
pages
rather
than
overwrite
     Metadata
is
extended
to
incorporate
the
update
     Both
the
old
and
the
new
version
of
the
blob
are
accessible




                                                                                         4
BlobSeer:
Architecture                                         Providers



Clients
     Perform
fine
grain
blob
accesses
Providers
     Store
the
pages
of
the
blob
                                                              Provider
Provider
manager                          Clients             manager
     Monitors
the
providers
     Favors
data
load
balancing                         Version
                                                         manager
Metadata
providers
     Store
information
about
page
location
Version
manager
     Ensures
concurrency
control


                                                    Metadata providers


                                                                           5
Version
Management
in
BlobSeer


               Beyond Multiversion
                  Concurrency
                    Control




                                     6
Background:
Object-based
File
Systems
From
block-based
to
object-based
file
systems
       Block
management
at
the
file
server
       Object
management
is
delegated
to
object-based
storage
devices
(OSDs)
or
storage
        servers

Advantages
     Scalability:
offload
~90%
of
workload
from
the
metadata
server
     OSDs
and
storage
servers
are
more
autonomous,
self-managed
and
easier
to
share



Examples
of
object-based
file
systems
     Lustre
(Schwan,
2003)
     Ceph
(Weil
et
al.,
2006)
     GoogleFS
(Ghemawat
et
al.,
2003)
     XtreemFS
(Hupfeld
et
al.,
2008)




                                                                                           7
Towards
a
BLOB-based
File
System

Goal:
Build
a
BLOB-based
file
system,
able
to
cope
with
huge
data
and
heavy
access
  concurrency
in
a
large-scale
distributed
environment
Hierarchical
approach
     High-level
file
system
metadata
management:
the
Gfarm
grid
file
system
     Low-level
object
management:
the
BlobSeer
BLOB
management
system
                                                        Gfarm




                                                     BlobSeer




                                                                                     8
The
Gfarm
Grid
File
System

The
Gfarm
file
system
[University
of
Tsukuba,
Japan]
A
distributed
file
system
designed
for
working
at
the
Grid
scale
File
can
be
shared
among
all
nodes
and
clients
Applications
can
access
files
using
the
same
path
regardless
of
the
file
location
Main
components
     Gfarm's
metadata
server
     File
system
nodes
     Gfarm
clients



                                                        gfmd:
Gfarm
management
daemon
                                                        gfsd
:
Gfarm
storage
daemon


                                                                                    9
The
Gfarm
Grid
File
System
[2]

Advanced
features
     User
management
     Authentication
and
single
sign-on
based
on
Grid
Security
Infrastructure
(GSI)
     POSIX
file
system
API
(gfarm2fs)
and
Gfarm
API

Limitations
     No
file
striping,
thus
file
size
is
limited
     No
access
concurrency
     No
versioning
capability




                                                                                      10
Why
combine
Gfarm
and
BlobSeer?

        Gfarm               BlobSeer              Gfarm/BlobSeer

   POSIX
interface                               POSIX
interface
                       Lack
of
POSIX
file
   User
management                               User
management
                       system
interface
   Support
GSI                                   Support
GSI



   File
sizes
are
                                                 Access
concurrency
   limited             Access
concurrency
                                                 Huge
file
sizes
   Not
suitable
for    Fine-grain
access
                                                 Fine-grain
access
   concurrent
access   Versioning
                                                 Versioning
   No
versioning



General
idea:
Gfarm
handles
file
metadata,
BlobSeer
handles
file
data

                                                                      11
Coupling
Gfarm
and
BlobSeer
[1]


   The
first
approach                           Gfarm
        Each
file
system
node
(gfsd)
         connects
to
BlobSeer
to
store/get
         Gfarm
file
data                                                1

        It
must
manage
the
mapping
from                            2
         Gfarm
files
to
BLOBs                                   3
        It
always
acts
as
an
intermediary
for
         data
transfer
         Bottleneck
at
the
file
system
node                 4
                                                 BlobSeer
     




                                                                            12
Coupling
Gfarm
and
BlobSeer
[1]


   The
first
approach                           Gfarm
        Each
file
system
node
(gfsd)
         connects
to
BlobSeer
to
store/get
         Gfarm
file
data                                                1

        It
must
manage
the
mapping
from                            2
         Gfarm
files
to
BLOBs                                   3
        It
always
acts
as
an
intermediary
for
         data
transfer
         Bottleneck
at
the
file
system
node                 4
                                                 BlobSeer
     




                                                                            13
Coupling
Gfarm
and
BlobSeer
[2]


   Second
approach                                 Gfarm
        The
gfsd
maps
Gfarm
files
to
BLOBs,
         and
responds
the
client's
request
within
         the
BLOB
ID                                                        1

        Then,
the
client
directly
access
data
in                   2
         BlobSeer                                           3

                                                                4


                                                                        5




                                                                                14
Gfarm/BlobSeer:
Design
Considerations

   The
whole
system
is
dimensioned
for
a
multi-site
Grid
        One
BlobSeer
instance
at
each
site

   The
client
can
dynamically
switch
between
these
two
well
defined
access
    modes
        Remote
access
mode
(red
lines):
if
the
client
cannot
directly
access
BlobSeer
        BlobSeer
direct
access
mode
(blue
lines):
the
clients
can
directly
access
BlobSeer




                                                                                              15
Discussion


   The
integrated
system
can
support
huge
file
sizes
since
Gfarm
file
is
now
    transparently
striped
over
BlobSeer
data
providers.
   Remote
access
mode
        Only
required
modifying
the
source
code
of
the
gfsd
daemon
        Could
not
resolve
the
current
bottleneck
problem
on
the
gfsd
daemon

   BlobSeer
direct
access
mode
        Supports
concurrent
accesses
        Was
more
complicated
in
implementation,
a
new
access
mode
had
to
be
         introduced
into
Gfarm




                                                                                16
Experimental
Evaluation
on
Grid'5000
[1]

   Access
throughput
with
no
concurrency
   Gfarm
benchmark
measures
throughput
when
writing/reading
   Configuration:
1
gfmd,
1
gfsd,
1
client,
9

data
providers,
8MB
page
size
   BlobSeer
direct
access
mode
provides
a
higher
throughput




               Writing                                 Reading

                                                                                17
Experimental
Evaluation
on
Grid'5000
[2]

   Access
throughput
under
concurrency
   Configuration
        1
gfmd
        1
gfsd
        24
data
providers
        Each
client
accesses
1GB
of
a
10GB
file
        Page
size
8MB

   Gfarm
sequentializes
concurrent
accesses




                                                   18
Experimental
evaluation
on
Grid'5000
[3]


   Access
throughput
under
heavy
    concurrency
   Configuration
(deployed
on
157
nodes
of
    Rennes
site)
        1
gfmd
        1
gfsd
        Each
client
accesses
1GB
of
a
64GB
file
        Page
size
8MB
        Up
to
64
concurrent
clients
        64
data
providers
        24
metadata
providers
        1
version
manager
        1
page
manager



                                                   19
Introducting
Versioning
in
Gfarm/BlobSeer

   Versioning
is
the
capability
of
accessing
data
of
a
specified
file
version
   Not
only
to
rollback
data
when
desired,
but
also
to
access
different
file
    versions
within
the
same
computation
   Favors
efficient
access
concurrency
   Approach
        Delegate
versioning
management
to
BlobSeer
        A
Gfarm
file
is
mapped
to
a
single
BLOB
        A
file
version
is
mapped
to
the
corresponding
version
of
the
BLOB




                                                                                 20
Difficulties

Gfarm
does
not
support
versioning
Some
issues
we
had
to
deal
with:
New
extended
API
for
clients
A
coordination
between
clients
and
gfsd
daemons
via
new
RPC
calls
Modification
of
the
inner
data
structures
of
Gfarm
in
order
to
handle
file
versions
Versioning
is
not
a
standard
feature
of
POSIX
file
systems




                                                                             21
Versioning
interface

   Versioning
capability
was
fully
implemented
   At
Gfarm
API
level
        gfs_get_current_version(GFS_File
gf,size_t
*nversion)‫‏‬
        gfs_get_latest_version(GFS_File
gf,size_t
*nversion)‫‏‬
        gfs_set_version(GFS_File
gf,size_t
version)‫‏‬
        gfs_pio_vread(size_t
nversion,GFS_File
gf,
void
*buffer,
int
size,
int
*np)‫‏‬

   At
POSIX
file
system
level
        Defined
some
ioctl
commands
                                           fd = open(argv[1], 0_RDWR);

                                           np = pwrite(fd, buffer_w, BUFFER_SIZE,0);
                                           ioctl(fd, BLOB_GET_LATEST_VERSION, &nversion);

                                           ioctl(fd, BLOB_SET_ACTIVE_VERSION, &nversion);
                                           np = pread(fd, buffer_r, BUFFER_SIZE,0);

                                           ioctl(fd, BLOB_GET_ACTIVE_VERSION, &nversion);
                                           close(fd);



                                                                                            22
Conclusion

   The
experimental
results
suggest
that
our
prototype
well
exploited
the
    combined
advantages
of
Gfarm
and
BlobSeer.
        A
file
system
API
        Concurrent
accesses
        Huge
file
sizes
(tested
up
to
64GB
file)
        Versioning

   Future
work
        Experiment
the
system
on
a
more
complex
topology
as
several
sites
of
         Grid'5000,
and
with
a
WAN
network
        Ensure
the
semantic
of
consistency
since
Gfarm
has
not
yet
maintained
cache
         coherence
between
buffers
on
clients
        Compare
our
prototype
to
other
Grid
file
systems




                                                                                       23
Application
scenarios


Grid
and
cloud
storage
for
data-mining
applications
with
massive
data
Distributed
storage
for
large-scale
Petascale
computing
applications
Storage
for
desktop
grid
applications
with
high
write-throughput
requirements
Storage
support
for
extremely
large
databases




                                                                                24

More Related Content

Viewers also liked

A zoom on membase vng
A zoom on membase vngA zoom on membase vng
A zoom on membase vng
Viet-Trung TRAN
 
Tachyon memory centric, fault tolerance storage for cluster framworks
Tachyon  memory centric, fault tolerance storage for cluster framworksTachyon  memory centric, fault tolerance storage for cluster framworks
Tachyon memory centric, fault tolerance storage for cluster framworks
Viet-Trung TRAN
 
Security of In-car wireless networks
Security of In-car wireless networksSecurity of In-car wireless networks
Security of In-car wireless networks
Viet-Trung TRAN
 
Scalable Data Management Systems for Big Data
Scalable Data Management Systems for Big DataScalable Data Management Systems for Big Data
Scalable Data Management Systems for Big Data
Viet-Trung TRAN
 
Scalable metadata management at very large scale filesystems: A Survey
Scalable metadata management at very large scale filesystems: A SurveyScalable metadata management at very large scale filesystems: A Survey
Scalable metadata management at very large scale filesystems: A SurveyViet-Trung TRAN
 
Bitcoin P2P currency
Bitcoin P2P currencyBitcoin P2P currency
Bitcoin P2P currency
Viet-Trung TRAN
 
Efficient Support for MPI-I/O Atomicity
Efficient Support for MPI-I/O AtomicityEfficient Support for MPI-I/O Atomicity
Efficient Support for MPI-I/O Atomicity
Viet-Trung TRAN
 
Some thoughts on apache spark & shark
Some thoughts on apache spark & sharkSome thoughts on apache spark & shark
Some thoughts on apache spark & shark
Viet-Trung TRAN
 
Pyramid: A large-scale array-oriented active storage system
Pyramid: A large-scale array-oriented active storage systemPyramid: A large-scale array-oriented active storage system
Pyramid: A large-scale array-oriented active storage system
Viet-Trung TRAN
 
3 - Finding similar items
3 - Finding similar items3 - Finding similar items
3 - Finding similar items
Viet-Trung TRAN
 
6 clustering
6 clustering6 clustering
6 clustering
Viet-Trung TRAN
 
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
Viet-Trung TRAN
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systems
Viet-Trung TRAN
 
Dimensionality reduction: SVD and its applications
Dimensionality reduction: SVD and its applicationsDimensionality reduction: SVD and its applications
Dimensionality reduction: SVD and its applications
Viet-Trung TRAN
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computing
Viet-Trung TRAN
 
Introduction to BigData @TCTK2015
Introduction to BigData @TCTK2015Introduction to BigData @TCTK2015
Introduction to BigData @TCTK2015
Viet-Trung TRAN
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Viet-Trung TRAN
 
From neural networks to deep learning
From neural networks to deep learningFrom neural networks to deep learning
From neural networks to deep learning
Viet-Trung TRAN
 

Viewers also liked (18)

A zoom on membase vng
A zoom on membase vngA zoom on membase vng
A zoom on membase vng
 
Tachyon memory centric, fault tolerance storage for cluster framworks
Tachyon  memory centric, fault tolerance storage for cluster framworksTachyon  memory centric, fault tolerance storage for cluster framworks
Tachyon memory centric, fault tolerance storage for cluster framworks
 
Security of In-car wireless networks
Security of In-car wireless networksSecurity of In-car wireless networks
Security of In-car wireless networks
 
Scalable Data Management Systems for Big Data
Scalable Data Management Systems for Big DataScalable Data Management Systems for Big Data
Scalable Data Management Systems for Big Data
 
Scalable metadata management at very large scale filesystems: A Survey
Scalable metadata management at very large scale filesystems: A SurveyScalable metadata management at very large scale filesystems: A Survey
Scalable metadata management at very large scale filesystems: A Survey
 
Bitcoin P2P currency
Bitcoin P2P currencyBitcoin P2P currency
Bitcoin P2P currency
 
Efficient Support for MPI-I/O Atomicity
Efficient Support for MPI-I/O AtomicityEfficient Support for MPI-I/O Atomicity
Efficient Support for MPI-I/O Atomicity
 
Some thoughts on apache spark & shark
Some thoughts on apache spark & sharkSome thoughts on apache spark & shark
Some thoughts on apache spark & shark
 
Pyramid: A large-scale array-oriented active storage system
Pyramid: A large-scale array-oriented active storage systemPyramid: A large-scale array-oriented active storage system
Pyramid: A large-scale array-oriented active storage system
 
3 - Finding similar items
3 - Finding similar items3 - Finding similar items
3 - Finding similar items
 
6 clustering
6 clustering6 clustering
6 clustering
 
OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents OCR processing with deep learning: Apply to Vietnamese documents
OCR processing with deep learning: Apply to Vietnamese documents
 
Introduction to distributed file systems
Introduction to distributed file systemsIntroduction to distributed file systems
Introduction to distributed file systems
 
Dimensionality reduction: SVD and its applications
Dimensionality reduction: SVD and its applicationsDimensionality reduction: SVD and its applications
Dimensionality reduction: SVD and its applications
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computing
 
Introduction to BigData @TCTK2015
Introduction to BigData @TCTK2015Introduction to BigData @TCTK2015
Introduction to BigData @TCTK2015
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
 
From neural networks to deep learning
From neural networks to deep learningFrom neural networks to deep learning
From neural networks to deep learning
 

Similar to Towards A Grid File System Based On A Large-Scale BLOB Management Service

Progress on adapting BlobSeer to WAN scale
Progress on adapting BlobSeer to WAN scaleProgress on adapting BlobSeer to WAN scale
Progress on adapting BlobSeer to WAN scale
Viet-Trung TRAN
 
Rhel cluster gfs_improveperformance
Rhel cluster gfs_improveperformanceRhel cluster gfs_improveperformance
Rhel cluster gfs_improveperformance
sprdd
 
Google file system GFS
Google file system GFSGoogle file system GFS
Google file system GFS
zihad164
 
GFS & HDFS Introduction
GFS & HDFS IntroductionGFS & HDFS Introduction
GFS & HDFS Introduction
Hariharan Ganesan
 
Google
GoogleGoogle
Google
rpaikrao
 
Cloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and DeploymentCloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and Deployment
GlusterFS
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
Sandeep Patil
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
Trishali Nayar
 
Compression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsCompression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of Tradeoffs
DataWorks Summit
 
Gpfs introandsetup
Gpfs introandsetupGpfs introandsetup
Gpfs introandsetup
asihan
 
Compression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsCompression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of Tradeoffs
DataWorks Summit
 
Who Lives in Our Garden?
Who Lives in Our Garden?Who Lives in Our Garden?
Who Lives in Our Garden?
Altoros
 
GR8Conf 2011: Adopting Grails
GR8Conf 2011: Adopting GrailsGR8Conf 2011: Adopting Grails
GR8Conf 2011: Adopting Grails
GR8Conf
 
Adopting Grails - GR8Conf Europe
Adopting Grails - GR8Conf EuropeAdopting Grails - GR8Conf Europe
Adopting Grails - GR8Conf Europe
KlausBaumecker
 
AFS introduction
AFS introductionAFS introduction
AFS introduction
Manfred Furuholmen
 
Hadoop and HDFS
Hadoop and HDFSHadoop and HDFS
Hadoop and HDFS
SatyaHadoop
 
Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...
Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...
Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...
Sumeet Singh
 
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio ManfredOSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
NETWAYS
 
Gluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFSGluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFS
GlusterFS
 
storage-systems.pptx
storage-systems.pptxstorage-systems.pptx
storage-systems.pptx
ShimoFcis
 

Similar to Towards A Grid File System Based On A Large-Scale BLOB Management Service (20)

Progress on adapting BlobSeer to WAN scale
Progress on adapting BlobSeer to WAN scaleProgress on adapting BlobSeer to WAN scale
Progress on adapting BlobSeer to WAN scale
 
Rhel cluster gfs_improveperformance
Rhel cluster gfs_improveperformanceRhel cluster gfs_improveperformance
Rhel cluster gfs_improveperformance
 
Google file system GFS
Google file system GFSGoogle file system GFS
Google file system GFS
 
GFS & HDFS Introduction
GFS & HDFS IntroductionGFS & HDFS Introduction
GFS & HDFS Introduction
 
Google
GoogleGoogle
Google
 
Cloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and DeploymentCloud Storage Adoption, Practice, and Deployment
Cloud Storage Adoption, Practice, and Deployment
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
 
Compression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsCompression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of Tradeoffs
 
Gpfs introandsetup
Gpfs introandsetupGpfs introandsetup
Gpfs introandsetup
 
Compression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsCompression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of Tradeoffs
 
Who Lives in Our Garden?
Who Lives in Our Garden?Who Lives in Our Garden?
Who Lives in Our Garden?
 
GR8Conf 2011: Adopting Grails
GR8Conf 2011: Adopting GrailsGR8Conf 2011: Adopting Grails
GR8Conf 2011: Adopting Grails
 
Adopting Grails - GR8Conf Europe
Adopting Grails - GR8Conf EuropeAdopting Grails - GR8Conf Europe
Adopting Grails - GR8Conf Europe
 
AFS introduction
AFS introductionAFS introduction
AFS introduction
 
Hadoop and HDFS
Hadoop and HDFSHadoop and HDFS
Hadoop and HDFS
 
Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...
Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...
Hadoop Summit San Jose 2013: Compression Options in Hadoop - A Tale of Tradeo...
 
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio ManfredOSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
 
Gluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFSGluster Webinar: Introduction to GlusterFS
Gluster Webinar: Introduction to GlusterFS
 
storage-systems.pptx
storage-systems.pptxstorage-systems.pptx
storage-systems.pptx
 

More from Viet-Trung TRAN

Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Viet-Trung TRAN
 
Dynamo: Amazon’s Highly Available Key-value Store
Dynamo: Amazon’s Highly Available Key-value StoreDynamo: Amazon’s Highly Available Key-value Store
Dynamo: Amazon’s Highly Available Key-value Store
Viet-Trung TRAN
 
Pregel: Hệ thống xử lý đồ thị lớn
Pregel: Hệ thống xử lý đồ thị lớnPregel: Hệ thống xử lý đồ thị lớn
Pregel: Hệ thống xử lý đồ thị lớn
Viet-Trung TRAN
 
Mapreduce simplified-data-processing
Mapreduce simplified-data-processingMapreduce simplified-data-processing
Mapreduce simplified-data-processing
Viet-Trung TRAN
 
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của FacebookTìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Viet-Trung TRAN
 
giasan.vn real-estate analytics: a Vietnam case study
giasan.vn real-estate analytics: a Vietnam case studygiasan.vn real-estate analytics: a Vietnam case study
giasan.vn real-estate analytics: a Vietnam case study
Viet-Trung TRAN
 
Giasan.vn @rstars
Giasan.vn @rstarsGiasan.vn @rstars
Giasan.vn @rstars
Viet-Trung TRAN
 
A Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural NetworkA Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural Network
Viet-Trung TRAN
 
A Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural NetworkA Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural Network
Viet-Trung TRAN
 
Large-Scale Geographically Weighted Regression on Spark
Large-Scale Geographically Weighted Regression on SparkLarge-Scale Geographically Weighted Regression on Spark
Large-Scale Geographically Weighted Regression on Spark
Viet-Trung TRAN
 
Recent progress on distributing deep learning
Recent progress on distributing deep learningRecent progress on distributing deep learning
Recent progress on distributing deep learning
Viet-Trung TRAN
 
success factors for project proposals
success factors for project proposalssuccess factors for project proposals
success factors for project proposals
Viet-Trung TRAN
 
GPSinsights poster
GPSinsights posterGPSinsights poster
GPSinsights poster
Viet-Trung TRAN
 
Deep learning for nlp
Deep learning for nlpDeep learning for nlp
Deep learning for nlp
Viet-Trung TRAN
 
From decision trees to random forests
From decision trees to random forestsFrom decision trees to random forests
From decision trees to random forests
Viet-Trung TRAN
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
Introduction to mining massive datasets
Introduction to mining massive datasetsIntroduction to mining massive datasets
Introduction to mining massive datasets
Viet-Trung TRAN
 
2 association rules
2 association rules2 association rules
2 association rules
Viet-Trung TRAN
 
Interactive big data analytics
Interactive big data analyticsInteractive big data analytics
Interactive big data analytics
Viet-Trung TRAN
 
Hệ thống phân tích tình trạng giao thông: Ứng dụng công cụ xử lý dữ liệu lớn...
Hệ thống phân tích tình trạng giao thông:  Ứng dụng công cụ xử lý dữ liệu lớn...Hệ thống phân tích tình trạng giao thông:  Ứng dụng công cụ xử lý dữ liệu lớn...
Hệ thống phân tích tình trạng giao thông: Ứng dụng công cụ xử lý dữ liệu lớn...
Viet-Trung TRAN
 

More from Viet-Trung TRAN (20)

Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
Bắt đầu tìm hiểu về dữ liệu lớn như thế nào - 2017
 
Dynamo: Amazon’s Highly Available Key-value Store
Dynamo: Amazon’s Highly Available Key-value StoreDynamo: Amazon’s Highly Available Key-value Store
Dynamo: Amazon’s Highly Available Key-value Store
 
Pregel: Hệ thống xử lý đồ thị lớn
Pregel: Hệ thống xử lý đồ thị lớnPregel: Hệ thống xử lý đồ thị lớn
Pregel: Hệ thống xử lý đồ thị lớn
 
Mapreduce simplified-data-processing
Mapreduce simplified-data-processingMapreduce simplified-data-processing
Mapreduce simplified-data-processing
 
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của FacebookTìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
Tìm kiếm needle trong Haystack: Hệ thống lưu trữ ảnh của Facebook
 
giasan.vn real-estate analytics: a Vietnam case study
giasan.vn real-estate analytics: a Vietnam case studygiasan.vn real-estate analytics: a Vietnam case study
giasan.vn real-estate analytics: a Vietnam case study
 
Giasan.vn @rstars
Giasan.vn @rstarsGiasan.vn @rstars
Giasan.vn @rstars
 
A Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural NetworkA Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural Network
 
A Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural NetworkA Vietnamese Language Model Based on Recurrent Neural Network
A Vietnamese Language Model Based on Recurrent Neural Network
 
Large-Scale Geographically Weighted Regression on Spark
Large-Scale Geographically Weighted Regression on SparkLarge-Scale Geographically Weighted Regression on Spark
Large-Scale Geographically Weighted Regression on Spark
 
Recent progress on distributing deep learning
Recent progress on distributing deep learningRecent progress on distributing deep learning
Recent progress on distributing deep learning
 
success factors for project proposals
success factors for project proposalssuccess factors for project proposals
success factors for project proposals
 
GPSinsights poster
GPSinsights posterGPSinsights poster
GPSinsights poster
 
Deep learning for nlp
Deep learning for nlpDeep learning for nlp
Deep learning for nlp
 
From decision trees to random forests
From decision trees to random forestsFrom decision trees to random forests
From decision trees to random forests
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 
Introduction to mining massive datasets
Introduction to mining massive datasetsIntroduction to mining massive datasets
Introduction to mining massive datasets
 
2 association rules
2 association rules2 association rules
2 association rules
 
Interactive big data analytics
Interactive big data analyticsInteractive big data analytics
Interactive big data analytics
 
Hệ thống phân tích tình trạng giao thông: Ứng dụng công cụ xử lý dữ liệu lớn...
Hệ thống phân tích tình trạng giao thông:  Ứng dụng công cụ xử lý dữ liệu lớn...Hệ thống phân tích tình trạng giao thông:  Ứng dụng công cụ xử lý dữ liệu lớn...
Hệ thống phân tích tình trạng giao thông: Ứng dụng công cụ xử lý dữ liệu lớn...
 

Recently uploaded

Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
saastr
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
Edge AI and Vision Alliance
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 

Recently uploaded (20)

Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 

Towards A Grid File System Based On A Large-Scale BLOB Management Service