SlideShare a Scribd company logo
OSDC 2012
XtreemFS
Extreme cloud file system?!
Udo Seidel
OSDC 2012
Agenda
● Background/motivation
● High level overview
● High Availability
● Security
● Summary
OSDC 2012
Distributed file systems
● Part of shared file systems family
● Around for a while
● “back” in scope
● Storage challenges
– More
– Faster
– Cheaper
● XaaS
OSDC 2012
Shared file systems family
● Multiple server access the same data
● Different approaches
● Network based, e.g. NFS, CIFS
● Clustered
– Shared disk, e.g. CXFS, CFS, GFS(2), OCFS2
– Distributed, e.g. Lustre, CephFS, GlusterFS .... and
XtreemFS
OSDC 2012
Distributed file systems – why?
● More efficient utilization of distributed hardware
● Storage
● CPU/Network
● Scalability ... capacity demands
● Amount
● I/O requirements
OSDC 2012
Distributed file systems – which?
● HDFS (Hadoop)
● CephFS ..
● GlusterFS .. RedHat
● ...
● XtreemFS
OSDC 2012
History
● European Research project (2006-2010)
● Part of XtreemOS
● Linux based grid O/S
● Member of OpenGridForum
● Need of distributed file system
OSDC 2012
XtreemFS and storage
● Distributed file system => distributed storage
● Object base storage approach
OSDC 2012
Storage – looking back
● Not very intelligent
● Simple and well documented interface, e.g.
SCSI standard
● Storage management outside the disks
OSDC 2012
Storage – these days
● Storage hardware powerful => Re-define: tasks
of storage hardware and attached computer
● Shift of responsibilities towards storage
● Block allocation
● Space management
● Storage objects instead of blocks
● Extension of interface -> OSD standard
OSDC 2012
Object Based Storage I
● Objects of quite general nature
● Files
● Partitions
● ID for each storage object
● Separation of meta data operation and storing
file data
● HA not covered at all
● Object based Storage Devices
OSDC 2012
Object Based Storage II
● OSD software implementation
● Usual an additional layer between between
computer and storage
● Presents object-based file system to the computer
● Use a “normal” file system to store data on the
storage
● Delivered as part of XtreemFS
● File systems: LUSTRE, EXOFS, CephFS
OSDC 2012
Implementation I
● Java
● Supported O/S
– Linux
– MacOS X with manual work
– Free/Net/OpenBSD?
– No Windows anymore
● Server and Client (fuse) ... both in user space
● Non-privileged user
OSDC 2012
Implementation II
● IP based
● Different ports for different XtreemFS services
● Clear text vs. encrypted
● Object based storage
● Software implementation
● OSD features in XtreemFS code
– Copy on write
– Snapshotting
OSDC 2012
XtreemFS – the architecture I
● 4 components
● Object based Storage Devices
● Meta Data and Replica Catalogue Servers
● Directory Service
● Clients ;-)
OSDC 2012
XtreemFS – the architecture II
OSDC 2012
XtreemFS services
● Several
● OSD
● MRC
● Volumes
● UUID's
● Abstraction from network
● Change requires outage
● Plans for topology
OSDC 2012
XtreemFS – DIR/MRC data
● Data stored locally
● BabuDB
● Independent of OSD
● Write buffers
ModusModus DescriptionDescription
ASYNC Asynchronous log entry write
FSYNC Fsync() called after log entry write and before
ack'ing of operation
SYNC_WRITE Synchronous log entry write, ack'ing of operation
before meta data update
SYNC_WRITE_METADATA Synchronous log entry write and meta data
update before ack'ing of operation
OSDC 2012
XtreemFS – OSD data
● File cut in 128 Kbyte pieces
● Default: entire file on one OSD
● Distribution across multiple OSD's possible
● RAID 0 implemented
● RAID 5 planned
● Parallel reads/writes
OSDC 2012
XtreemFS interfaces
● HTTP
● Read-only
● Read-write planned
● Command line
● All purposes
OSDC 2012
XtreemFS interfaces
OSDC 2012
XtreemFS – high level summary
● Multi-platform
● Abstraction via UUID
● Communication separation
● Freedom of choice of OSD backend file system
● HPC out of scope
OSDC 2012
XtreemFS – HA in general
● One part: OSD
● Replication via policies
● Other part: MRC and DIR
● Local data stored in BabuDB's
● Synchronization via BabuDB methods
OSDC 2012
XtreemFS – HA for MRC/DIR
● Master/slave
● Master changes -> log file without buffering
● Log file entries propagation to slaves
● Quorum needed => at least 3 instances
● No automation for DIR
● Synchronization
● in clear text
● Encryption via SSL possible
OSDC 2012
XtreemFS OSD replication
● File replication
● Read-only
– Since 1.0
– Easy to handle
● Read-write
– Only since 1.3
– Later more
● Copies
● Full
● Partial aka on-demand
OSDC 2012
XtreemFS r/o replication
● Arbitrary amount of replicas
● Equally treated replicas
● Only OSD local access
● No sync needed
● Use case
● Static files :-)
● Low bandwidth (partial replica)
● Big static files (partial replica)
OSDC 2012
XtreemFS r/w replication
● Primary/secondary
● Election on demand with leases
● Read/write access
● First primary
● Propagated to secondaries
OSDC 2012
XtreemFS r/w replication - failure
● Secondary
● Behaviour configurable
● Write failure vs. Write on remaining
– Quorum needed
● Primary
● Behaviour configurable
● Write failure vs. Write on remaining
– Quorum needed
OSDC 2012
XtreemFS OSD/replica policies
● OSD selection for new files
● Replica selection for new/additional copies
● Categories: filter, group, sort
● Combination of rules
Policy Category
Standard OSD filter
FQDN based filter, group, sort
UUID based filter
Data center topology group, sort
random sort
OSDC 2012
XtreemFS HA summary
● Homework needed for DIR and MRC
● OSD
● Lateness of OSD read-write replication
● OSD Read-only replication
– Mature and WAN ready
● Access time improvement via striping
● Flexibility of policies
OSDC 2012
XtreemFS encryption
● Not on file system level
● For communication
● Interaction of DIR, MRC and OSD
● Data replication for HA for DIR and/or MRC
OSDC 2012
XtreemFS channel encryption
● Via SSL
● PCKS#12 or Java Key Store (JKS)
● Locally stored
– service/client certificates
– root CA certificates
● Two modes
● All-Or-Nothing approach
● Grid-SSL
– just authentication
OSDC 2012
XtreemFS secure channel
encryption
● Password protection of certificates
● MRC/DIR/OSD: stored service configuration
● Client: via CLI!!
OSDC 2012
XtreemFS encryption summary
● Data encryption on POSIX layer?
● SSL obvious choice for TCP/IP channels
● Missing PKI contradicts scalability
● Password protection needs re-design
OSDC 2012
Summary
● High self-defined goals
● Some dropped?
● Some partially implemented
● Ok for R&D Labs
● HA and housekeeping improvement needed
● Encryption w/o PKI
OSDC 2012
References
● http://www.xtreemfs.org
● http://babudb.googlecode.com
OSDC 2012
Thank you!

More Related Content

What's hot

Sdc challenges-2012
Sdc challenges-2012Sdc challenges-2012
Sdc challenges-2012
Gluster.org
 
Cncf meetup-rook
Cncf meetup-rookCncf meetup-rook
Cncf meetup-rook
Juraj Hantak
 
Cncf meetup-rook
Cncf meetup-rookCncf meetup-rook
Cncf meetup-rook
Juraj Hantak
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
Gluster.org
 
State of the_gluster_-_lceu
State of the_gluster_-_lceuState of the_gluster_-_lceu
State of the_gluster_-_lceu
Gluster.org
 
Gluster Data Tiering
Gluster Data TieringGluster Data Tiering
Gluster Data Tiering
Joseph Elwin Fernandes
 
Lt2013 glusterfs.talk
Lt2013 glusterfs.talkLt2013 glusterfs.talk
Lt2013 glusterfs.talk
Udo Seidel
 
Disperse xlator ramon_datalab
Disperse xlator ramon_datalabDisperse xlator ramon_datalab
Disperse xlator ramon_datalab
Gluster.org
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
Gluster.org
 
Glusterfs and openstack
Glusterfs  and openstackGlusterfs  and openstack
Glusterfs and openstack
openstackindia
 
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vosOSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
NETWAYS
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applications
Ferenc Szalai
 
GlusterFS And Big Data
GlusterFS And Big DataGlusterFS And Big Data
GlusterFS And Big Data
Lalatendu Mohanty
 
GlusterFS Talk for CentOS Dojo Bangalore
GlusterFS Talk for CentOS Dojo BangaloreGlusterFS Talk for CentOS Dojo Bangalore
GlusterFS Talk for CentOS Dojo Bangalore
Raghavendra Talur
 
Comparison between OCFS2 and GFS2
Comparison between OCFS2 and GFS2Comparison between OCFS2 and GFS2
Comparison between OCFS2 and GFS2
Gang He
 
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Gluster.org
 
Cache Tiering and Erasure Coding
Cache Tiering and Erasure CodingCache Tiering and Erasure Coding
Cache Tiering and Erasure Coding
Shinobu Kinjo
 
Storage as a Service with Gluster
Storage as a Service with GlusterStorage as a Service with Gluster
Storage as a Service with Gluster
Vijay Bellur
 
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Atin Mukherjee
 
Life as a GlusterFS Consultant with Ivan Rossi
Life as a GlusterFS Consultant with Ivan RossiLife as a GlusterFS Consultant with Ivan Rossi
Life as a GlusterFS Consultant with Ivan Rossi
Gluster.org
 

What's hot (20)

Sdc challenges-2012
Sdc challenges-2012Sdc challenges-2012
Sdc challenges-2012
 
Cncf meetup-rook
Cncf meetup-rookCncf meetup-rook
Cncf meetup-rook
 
Cncf meetup-rook
Cncf meetup-rookCncf meetup-rook
Cncf meetup-rook
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
 
State of the_gluster_-_lceu
State of the_gluster_-_lceuState of the_gluster_-_lceu
State of the_gluster_-_lceu
 
Gluster Data Tiering
Gluster Data TieringGluster Data Tiering
Gluster Data Tiering
 
Lt2013 glusterfs.talk
Lt2013 glusterfs.talkLt2013 glusterfs.talk
Lt2013 glusterfs.talk
 
Disperse xlator ramon_datalab
Disperse xlator ramon_datalabDisperse xlator ramon_datalab
Disperse xlator ramon_datalab
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
 
Glusterfs and openstack
Glusterfs  and openstackGlusterfs  and openstack
Glusterfs and openstack
 
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vosOSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applications
 
GlusterFS And Big Data
GlusterFS And Big DataGlusterFS And Big Data
GlusterFS And Big Data
 
GlusterFS Talk for CentOS Dojo Bangalore
GlusterFS Talk for CentOS Dojo BangaloreGlusterFS Talk for CentOS Dojo Bangalore
GlusterFS Talk for CentOS Dojo Bangalore
 
Comparison between OCFS2 and GFS2
Comparison between OCFS2 and GFS2Comparison between OCFS2 and GFS2
Comparison between OCFS2 and GFS2
 
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
 
Cache Tiering and Erasure Coding
Cache Tiering and Erasure CodingCache Tiering and Erasure Coding
Cache Tiering and Erasure Coding
 
Storage as a Service with Gluster
Storage as a Service with GlusterStorage as a Service with Gluster
Storage as a Service with Gluster
 
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
 
Life as a GlusterFS Consultant with Ivan Rossi
Life as a GlusterFS Consultant with Ivan RossiLife as a GlusterFS Consultant with Ivan Rossi
Life as a GlusterFS Consultant with Ivan Rossi
 

Similar to OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel

Osdc2012 xtfs.talk
Osdc2012 xtfs.talkOsdc2012 xtfs.talk
Osdc2012 xtfs.talk
Udo Seidel
 
Linuxtag.ceph.talk
Linuxtag.ceph.talkLinuxtag.ceph.talk
Linuxtag.ceph.talk
Udo Seidel
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
sprdd
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
sprdd
 
adp.ceph.openstack.talk
adp.ceph.openstack.talkadp.ceph.openstack.talk
adp.ceph.openstack.talk
Udo Seidel
 
2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
Sage Weil
 
Cephfsglusterfs.talk
Cephfsglusterfs.talkCephfsglusterfs.talk
Cephfsglusterfs.talk
Udo Seidel
 
CephFS in Jewel: Stable at Last
CephFS in Jewel: Stable at LastCephFS in Jewel: Stable at Last
CephFS in Jewel: Stable at Last
Ceph Community
 
MongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL DatabaseMongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL Database
FITC
 
CS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIXCS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIX
ruchith
 
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
Dávid Kőszeghy
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage System
NETWAYS
 
Hadoop operations basic
Hadoop operations basicHadoop operations basic
Hadoop operations basic
Hafizur Rahman
 
An Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdfAn Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdf
Stephen Lorello
 
INFINISTORE(tm) - Scalable Open Source Storage Arhcitecture
INFINISTORE(tm) - Scalable Open Source Storage ArhcitectureINFINISTORE(tm) - Scalable Open Source Storage Arhcitecture
INFINISTORE(tm) - Scalable Open Source Storage Arhcitecture
Thomas Uhl
 
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-CasesRed Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red_Hat_Storage
 
Ceph storage for ocp deploying and managing ceph on top of open shift conta...
Ceph storage for ocp   deploying and managing ceph on top of open shift conta...Ceph storage for ocp   deploying and managing ceph on top of open shift conta...
Ceph storage for ocp deploying and managing ceph on top of open shift conta...
OrFriedmann
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red_Hat_Storage
 
Quick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterQuick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage Cluster
Patrick Quairoli
 

Similar to OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel (20)

Osdc2012 xtfs.talk
Osdc2012 xtfs.talkOsdc2012 xtfs.talk
Osdc2012 xtfs.talk
 
Linuxtag.ceph.talk
Linuxtag.ceph.talkLinuxtag.ceph.talk
Linuxtag.ceph.talk
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
 
adp.ceph.openstack.talk
adp.ceph.openstack.talkadp.ceph.openstack.talk
adp.ceph.openstack.talk
 
2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
 
Cephfsglusterfs.talk
Cephfsglusterfs.talkCephfsglusterfs.talk
Cephfsglusterfs.talk
 
CephFS in Jewel: Stable at Last
CephFS in Jewel: Stable at LastCephFS in Jewel: Stable at Last
CephFS in Jewel: Stable at Last
 
MongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL DatabaseMongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL Database
 
CS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIXCS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIX
 
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage System
 
Hadoop operations basic
Hadoop operations basicHadoop operations basic
Hadoop operations basic
 
An Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdfAn Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdf
 
INFINISTORE(tm) - Scalable Open Source Storage Arhcitecture
INFINISTORE(tm) - Scalable Open Source Storage ArhcitectureINFINISTORE(tm) - Scalable Open Source Storage Arhcitecture
INFINISTORE(tm) - Scalable Open Source Storage Arhcitecture
 
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-CasesRed Hat Gluster Storage - Direction, Roadmap and Use-Cases
Red Hat Gluster Storage - Direction, Roadmap and Use-Cases
 
Ceph storage for ocp deploying and managing ceph on top of open shift conta...
Ceph storage for ocp   deploying and managing ceph on top of open shift conta...Ceph storage for ocp   deploying and managing ceph on top of open shift conta...
Ceph storage for ocp deploying and managing ceph on top of open shift conta...
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
 
Quick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterQuick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage Cluster
 

Recently uploaded

8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
kalichargn70th171
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
Ayan Halder
 
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeA Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
Shane Coughlan
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
Philip Schwarz
 
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...Introducing Crescat - Event Management Software for Venues, Festivals and Eve...
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...
Crescat
 
GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
Green Software Development
 
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CDKuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
rodomar2
 
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise EditionWhy Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Envertis Software Solutions
 
Empowering Growth with Best Software Development Company in Noida - Deuglo
Empowering Growth with Best Software  Development Company in Noida - DeugloEmpowering Growth with Best Software  Development Company in Noida - Deuglo
Empowering Growth with Best Software Development Company in Noida - Deuglo
Deuglo Infosystem Pvt Ltd
 
DDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systemsDDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systems
Gerardo Pardo-Castellote
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
Remote DBA Services
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
Octavian Nadolu
 
E-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet DynamicsE-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet Dynamics
Hornet Dynamics
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
Google
 
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j
 
What is Master Data Management by PiLog Group
What is Master Data Management by PiLog GroupWhat is Master Data Management by PiLog Group
What is Master Data Management by PiLog Group
aymanquadri279
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
Aftab Hussain
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
Łukasz Chruściel
 

Recently uploaded (20)

8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
 
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeA Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
 
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...Introducing Crescat - Event Management Software for Venues, Festivals and Eve...
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...
 
GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
 
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CDKuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
 
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise EditionWhy Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
Why Choose Odoo 17 Community & How it differs from Odoo 17 Enterprise Edition
 
Empowering Growth with Best Software Development Company in Noida - Deuglo
Empowering Growth with Best Software  Development Company in Noida - DeugloEmpowering Growth with Best Software  Development Company in Noida - Deuglo
Empowering Growth with Best Software Development Company in Noida - Deuglo
 
DDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systemsDDS-Security 1.2 - What's New? Stronger security for long-running systems
DDS-Security 1.2 - What's New? Stronger security for long-running systems
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
 
E-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet DynamicsE-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet Dynamics
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
 
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
 
What is Master Data Management by PiLog Group
What is Master Data Management by PiLog GroupWhat is Master Data Management by PiLog Group
What is Master Data Management by PiLog Group
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
 

OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel

  • 1. OSDC 2012 XtreemFS Extreme cloud file system?! Udo Seidel
  • 2. OSDC 2012 Agenda ● Background/motivation ● High level overview ● High Availability ● Security ● Summary
  • 3. OSDC 2012 Distributed file systems ● Part of shared file systems family ● Around for a while ● “back” in scope ● Storage challenges – More – Faster – Cheaper ● XaaS
  • 4. OSDC 2012 Shared file systems family ● Multiple server access the same data ● Different approaches ● Network based, e.g. NFS, CIFS ● Clustered – Shared disk, e.g. CXFS, CFS, GFS(2), OCFS2 – Distributed, e.g. Lustre, CephFS, GlusterFS .... and XtreemFS
  • 5. OSDC 2012 Distributed file systems – why? ● More efficient utilization of distributed hardware ● Storage ● CPU/Network ● Scalability ... capacity demands ● Amount ● I/O requirements
  • 6. OSDC 2012 Distributed file systems – which? ● HDFS (Hadoop) ● CephFS .. ● GlusterFS .. RedHat ● ... ● XtreemFS
  • 7. OSDC 2012 History ● European Research project (2006-2010) ● Part of XtreemOS ● Linux based grid O/S ● Member of OpenGridForum ● Need of distributed file system
  • 8. OSDC 2012 XtreemFS and storage ● Distributed file system => distributed storage ● Object base storage approach
  • 9. OSDC 2012 Storage – looking back ● Not very intelligent ● Simple and well documented interface, e.g. SCSI standard ● Storage management outside the disks
  • 10. OSDC 2012 Storage – these days ● Storage hardware powerful => Re-define: tasks of storage hardware and attached computer ● Shift of responsibilities towards storage ● Block allocation ● Space management ● Storage objects instead of blocks ● Extension of interface -> OSD standard
  • 11. OSDC 2012 Object Based Storage I ● Objects of quite general nature ● Files ● Partitions ● ID for each storage object ● Separation of meta data operation and storing file data ● HA not covered at all ● Object based Storage Devices
  • 12. OSDC 2012 Object Based Storage II ● OSD software implementation ● Usual an additional layer between between computer and storage ● Presents object-based file system to the computer ● Use a “normal” file system to store data on the storage ● Delivered as part of XtreemFS ● File systems: LUSTRE, EXOFS, CephFS
  • 13. OSDC 2012 Implementation I ● Java ● Supported O/S – Linux – MacOS X with manual work – Free/Net/OpenBSD? – No Windows anymore ● Server and Client (fuse) ... both in user space ● Non-privileged user
  • 14. OSDC 2012 Implementation II ● IP based ● Different ports for different XtreemFS services ● Clear text vs. encrypted ● Object based storage ● Software implementation ● OSD features in XtreemFS code – Copy on write – Snapshotting
  • 15. OSDC 2012 XtreemFS – the architecture I ● 4 components ● Object based Storage Devices ● Meta Data and Replica Catalogue Servers ● Directory Service ● Clients ;-)
  • 16. OSDC 2012 XtreemFS – the architecture II
  • 17. OSDC 2012 XtreemFS services ● Several ● OSD ● MRC ● Volumes ● UUID's ● Abstraction from network ● Change requires outage ● Plans for topology
  • 18. OSDC 2012 XtreemFS – DIR/MRC data ● Data stored locally ● BabuDB ● Independent of OSD ● Write buffers ModusModus DescriptionDescription ASYNC Asynchronous log entry write FSYNC Fsync() called after log entry write and before ack'ing of operation SYNC_WRITE Synchronous log entry write, ack'ing of operation before meta data update SYNC_WRITE_METADATA Synchronous log entry write and meta data update before ack'ing of operation
  • 19. OSDC 2012 XtreemFS – OSD data ● File cut in 128 Kbyte pieces ● Default: entire file on one OSD ● Distribution across multiple OSD's possible ● RAID 0 implemented ● RAID 5 planned ● Parallel reads/writes
  • 20. OSDC 2012 XtreemFS interfaces ● HTTP ● Read-only ● Read-write planned ● Command line ● All purposes
  • 22. OSDC 2012 XtreemFS – high level summary ● Multi-platform ● Abstraction via UUID ● Communication separation ● Freedom of choice of OSD backend file system ● HPC out of scope
  • 23. OSDC 2012 XtreemFS – HA in general ● One part: OSD ● Replication via policies ● Other part: MRC and DIR ● Local data stored in BabuDB's ● Synchronization via BabuDB methods
  • 24. OSDC 2012 XtreemFS – HA for MRC/DIR ● Master/slave ● Master changes -> log file without buffering ● Log file entries propagation to slaves ● Quorum needed => at least 3 instances ● No automation for DIR ● Synchronization ● in clear text ● Encryption via SSL possible
  • 25. OSDC 2012 XtreemFS OSD replication ● File replication ● Read-only – Since 1.0 – Easy to handle ● Read-write – Only since 1.3 – Later more ● Copies ● Full ● Partial aka on-demand
  • 26. OSDC 2012 XtreemFS r/o replication ● Arbitrary amount of replicas ● Equally treated replicas ● Only OSD local access ● No sync needed ● Use case ● Static files :-) ● Low bandwidth (partial replica) ● Big static files (partial replica)
  • 27. OSDC 2012 XtreemFS r/w replication ● Primary/secondary ● Election on demand with leases ● Read/write access ● First primary ● Propagated to secondaries
  • 28. OSDC 2012 XtreemFS r/w replication - failure ● Secondary ● Behaviour configurable ● Write failure vs. Write on remaining – Quorum needed ● Primary ● Behaviour configurable ● Write failure vs. Write on remaining – Quorum needed
  • 29. OSDC 2012 XtreemFS OSD/replica policies ● OSD selection for new files ● Replica selection for new/additional copies ● Categories: filter, group, sort ● Combination of rules Policy Category Standard OSD filter FQDN based filter, group, sort UUID based filter Data center topology group, sort random sort
  • 30. OSDC 2012 XtreemFS HA summary ● Homework needed for DIR and MRC ● OSD ● Lateness of OSD read-write replication ● OSD Read-only replication – Mature and WAN ready ● Access time improvement via striping ● Flexibility of policies
  • 31. OSDC 2012 XtreemFS encryption ● Not on file system level ● For communication ● Interaction of DIR, MRC and OSD ● Data replication for HA for DIR and/or MRC
  • 32. OSDC 2012 XtreemFS channel encryption ● Via SSL ● PCKS#12 or Java Key Store (JKS) ● Locally stored – service/client certificates – root CA certificates ● Two modes ● All-Or-Nothing approach ● Grid-SSL – just authentication
  • 33. OSDC 2012 XtreemFS secure channel encryption ● Password protection of certificates ● MRC/DIR/OSD: stored service configuration ● Client: via CLI!!
  • 34. OSDC 2012 XtreemFS encryption summary ● Data encryption on POSIX layer? ● SSL obvious choice for TCP/IP channels ● Missing PKI contradicts scalability ● Password protection needs re-design
  • 35. OSDC 2012 Summary ● High self-defined goals ● Some dropped? ● Some partially implemented ● Ok for R&D Labs ● HA and housekeeping improvement needed ● Encryption w/o PKI