The Future of GlusterFS and Gluster.org John Mark Walker GlusterFS Community Guy Red Hat, Inc. January 25, 2012
The Roots of GlusterFS <ul><ul><li>Distributed storage solutions difficult to find
Decided to write their own
No filesystem experts – Pro & Con
Applied lessons from microkernel architecture </li><ul><li>GNU Hurd </li></ul></ul></ul>
The Roots of GlusterFS <ul><li>All storage solutions were either </li><ul><li>Too expensive. or...
Not scalable, or…
Single purpose, or…
Don’t support legacy apps, or…
Don't support new apps, or...
Do some combo of the above, but not very well </li></ul></ul>
The Roots of GlusterFS <ul><li>The challenge: </li><ul><li>Create a storage system that was… </li><ul><li>Scalable
Seamlessly integrated in the data center
Future-proof </li></ul></ul><li>The solution: GlusterFS </li><ul><li>Scalable, with DHT
POSIX-compliant
Stackable
User-space </li></ul></ul>
GlusterFS Client Architecture <ul><li>Creating a file system in user space </li><ul><li>Utilizes fuse module </li><ul><li>...
No Centralized Metadata Client A Client B Client C Server X Files Extended Attr. Server Y Files Extended Attr. Server Z Fi...
What is a Translator? <ul><li>Add/remove layers
Reorder layers
Move layers between client and server
Implement new layers </li><ul><li>e.g. encryption </li></ul><li>Replace old layers </li><ul><li>e.g. replication </li></ul...
Some Features <ul><li>Distributed, replicated and/or striped volumes
Global namespace
High availability
Geo-replication
Rebalancing
Remove or replace bricks
Self healing
volume profile and top metrics </li></ul>
No one ever expects the Red Hat acquisition
Red Hat Invests in GlusterFS <ul><li>Unstructured data volume to grow 44x by 2020
Cloud and virtualization are driving scale-out storage growth
Upcoming SlideShare
Loading in …5
×

The Future of GlusterFS and Gluster.org

6,892 views

Published on

These are the slides from a webinar I did today (Jan 26, 2012). It's all about where the GlusterFS project stands today and where it's going.

Published in: Technology
1 Comment
5 Likes
Statistics
Notes
  • Although it's a commonplace that GlusterFS storage capacity scales up to petabytes (a million 'GBs' / a quadrillion bytes**) technically speaking, its upper limit is much higher than that: about 72 brontobytes.

    This is approximately one byte for every single cell in the bodies of every single person in the United States.

    (**quadrillion short-scale, not long-scale)
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
No Downloads
Views
Total views
6,892
On SlideShare
0
From Embeds
0
Number of Embeds
1,443
Actions
Shares
0
Downloads
161
Comments
1
Likes
5
Embeds 0
No embeds

No notes for slide

The Future of GlusterFS and Gluster.org

  1. The Future of GlusterFS and Gluster.org John Mark Walker GlusterFS Community Guy Red Hat, Inc. January 25, 2012
  2. The Roots of GlusterFS <ul><ul><li>Distributed storage solutions difficult to find
  3. Decided to write their own
  4. No filesystem experts – Pro & Con
  5. Applied lessons from microkernel architecture </li><ul><li>GNU Hurd </li></ul></ul></ul>
  6. The Roots of GlusterFS <ul><li>All storage solutions were either </li><ul><li>Too expensive. or...
  7. Not scalable, or…
  8. Single purpose, or…
  9. Don’t support legacy apps, or…
  10. Don't support new apps, or...
  11. Do some combo of the above, but not very well </li></ul></ul>
  12. The Roots of GlusterFS <ul><li>The challenge: </li><ul><li>Create a storage system that was… </li><ul><li>Scalable
  13. Seamlessly integrated in the data center
  14. Future-proof </li></ul></ul><li>The solution: GlusterFS </li><ul><li>Scalable, with DHT
  15. POSIX-compliant
  16. Stackable
  17. User-space </li></ul></ul>
  18. GlusterFS Client Architecture <ul><li>Creating a file system in user space </li><ul><li>Utilizes fuse module </li><ul><li>Kernel goes through fuse, which hands off to glusterd </li></ul></ul></ul>Linux kernel Fuse Ext4 glusterd … … Applications
  19. No Centralized Metadata Client A Client B Client C Server X Files Extended Attr. Server Y Files Extended Attr. Server Z Files Extended Attr.
  20. What is a Translator? <ul><li>Add/remove layers
  21. Reorder layers
  22. Move layers between client and server
  23. Implement new layers </li><ul><li>e.g. encryption </li></ul><li>Replace old layers </li><ul><li>e.g. replication </li></ul></ul>FUSE Interface Layer Performance Layer Distribution Layer Replication Layer Protocol Layer Local Filesystem Later
  24. Some Features <ul><li>Distributed, replicated and/or striped volumes
  25. Global namespace
  26. High availability
  27. Geo-replication
  28. Rebalancing
  29. Remove or replace bricks
  30. Self healing
  31. volume profile and top metrics </li></ul>
  32. No one ever expects the Red Hat acquisition
  33. Red Hat Invests in GlusterFS <ul><li>Unstructured data volume to grow 44x by 2020
  34. Cloud and virtualization are driving scale-out storage growth
  35. Scale-out storage shipments to exceed 63,000 PB by 2015 (74% CAGR)
  36. 40% of core cloud spend related to storage
  37. GlusterFS-based solutions up to 50% less than other storage systems </li></ul>
  38. Red Hat Invests in GlusterFS <ul><li>GlusterFS adds to the Red Hat stack </li><ul><li>Complements other Red Hat offerings
  39. Many integration points </li></ul><li>More engineers hacking on GlusterFS than ever before </li></ul>RHEL RHEV Bare Metal Clouds GlusterFS Unified Storage JBoss
  40. Red Hat Invests in GlusterFS <ul><li>Acceleration of community investment </li><ul><li>GlusterFS needs to be “bigger than Red Hat”
  41. Transformation of GlusterFS from product to project </li><ul><li>From “open core” to upstream </li></ul><li>More resources for engineering and community outreach
  42. Red Hat's success rests on economies of scale </li><ul><li>Critical mass of users and developers </li></ul></ul></ul>
  43. Join a Winning Team <ul><li>We're hiring hackers and engineers
  44. Looking for community collaborators </li><ul><li>ISVs, students, IT professionals, fans, et al. </li></ul></ul>“ Join me, and together, we can rule the galaxy...”
  45. The Immediate Future
  46. The Gluster Community <ul><li>300,000+ downloads </li><ul><li>~35,000 /month
  47. >300% increase Y/Y </li></ul><li>1000+ deployments </li><ul><li>45 countries </li></ul><li>2,000+ registered users </li><ul><li>Mailing lists, Forums, etc. </li></ul></ul>Global adoption
  48. The Gluster Community <ul><li>Why are we changing? </li><ul><li>Only 1 non-Red Hat core contributor </li><ul><li>There were 2, but he acquired us </li></ul><li>Want to be the software standard for distributed storage
  49. Want to be more inclusive, more community-driven </li></ul></ul>Goal: create global ecosystem that supports ISVs, service providers and more
  50. Towards “Real” Open Source <ul><li>GlusterFS, prior to acquisition </li><ul><li>“ Open Core”
  51. Tied directly to Gluster products </li><ul><li>No differentiation </li></ul><li>Very little outside collaboration
  52. Contributors had to assign copyright to Gluster </li><ul><li>Discouraged would-be contributors </li></ul></ul></ul>
  53. Towards “Real” Open Source “ Open Core” <ul><ul><li>All engineering controlled by project/product sponsor
  54. No innovation outside of core engineering team
  55. All open source features also in commercial product
  56. Many features in Commercial product not in open source code </li></ul></ul>Commercial Product Open Source Code
  57. Towards “Real” Open Source “ Real” Open Source <ul><ul><li>Many points of collaboration and innovation in open source project
  58. Engineering team from multiple sources
  59. Project and product do not completely overlap
  60. Commercial products are hardened, more secure and thoroughly tested </li></ul></ul>Open Source Code Commercial Products
  61. Towards “Real” Open Source “ Real” Open Source <ul><ul><li>Enables more innovation on the fringes
  62. Engineering team from multiple sources
  63. Open source project is “upstream” from commercial product
  64. “ Downstream” products are hardened, more secure and thoroughly tested </li></ul></ul>Fedora Linux RHEL
  65. Towards “Real” Open Source “ Real” Open Source <ul><ul><li>Enables more innovation on the fringes
  66. Engineering team from multiple sources
  67. Open source project is “upstream” from commercial product
  68. “ Downstream” products are hardened, more secure and thoroughly tested </li></ul></ul>GlusterFS Red Hat Storage
  69. Project Roadmaps
  70. GlusterFS 3.3 ETA in Q2/Q3 2012 What's New in GlusterFS 3.3 <ul><li>New features </li><ul><li>Unified File & Object access
  71. Hadoop / HDFS compatibility </li></ul><li>New Volume Type </li><ul><li>Replicated + striped (+ distributed) volumes </li></ul><li>Enhancements to Distributed volumes (DHT translator) </li><ul><li>Rebalance can migrate open files
  72. Remove-brick can migrate data to remaining bricks </li></ul><li>Enhancements to Replicated volumes (AFR translator) </li><ul><li>Change replica count on an active volume, add replication to distribute-only volumes
  73. Granular locking – Much faster self-healing for large files
  74. Proactive self-heal process starts without FS stat
  75. Round-trip reduction for lower latency
  76. Quorum enforcement - avoid split brain scenarios </li></ul></ul>
  77. File and Object Storage <ul><li>Traditional SAN / NAS support either file or block storage
  78. New storage methodologies implement RESTful APIs over HTTP
  79. Demand for unifying the storage infrastructure increasing
  80. Treats files as objects and volumes as buckets
  81. Available now in 3.3 betas
  82. Soon to be backported to 3.2.x
  83. Contributing to OpenStack project </li><ul><li>Re-factored Swift API </li></ul></ul>
  84. Technology Integrations GlusterFS used as VM storage system <ul><ul><li>Pause and re-start VM’s, even on another hypervisor
  85. HA and DR for VM’s
  86. Faster VM deployment
  87. V-motion –like capability </li></ul></ul>Shared storage ISOs and appliances <ul><ul><li>oVirt / RHEV
  88. CloudStack
  89. OpenStack </li></ul></ul>Goal: The standard for cloud storage OpenStack Imaging Services Unified File & Object Storage … Compute API Layer Mobile Apps. Web Clients. Enterprise Software Ecosystem
  90. HDFS/Hadoop Compatibility <ul><li>HDFS compatibility library </li><ul><li>Simultaneous file and object access within Hadoop </li></ul><li>Benefits </li><ul><li>Legacy app access to MapReduce applications
  91. Enables data storage consolidation </li></ul><li>Simplify and unify storage deployments
  92. Provide users with file level access to data
  93. Enable legacy applications to access data via NFS </li><ul><li>Analytic apps can access data without modification </li></ul></ul>
  94. The Gluster Community <ul><li>What is changing? </li><ul><li>HekaFS / CloudFS being folded into Gluster project </li><ul><li>HekaFS == GlusterFS + multi-tenancy and SSL for auth and data encryption
  95. HekaFS.org
  96. ETA ~9 months </li></ul></ul></ul>
  97. What else?
  98. GlusterFS Advisory Board <ul><li>Advisory board </li><ul><li>Consists of industry and community leaders from Facebook, Citrix, Fedora, and OpenStack </li><ul><li>Richard Wareing, Storage Engineer, Facebook
  99. Jeff Darcy, Filesystem Engineer, Red Hat; Founder, HekaFS Project
  100. AB Periasamy, Co-Founder, GlusterFS project
  101. Ewan Mellor, Xen Engineer, Citrix; Member, OpenStack project
  102. David Nalley, CloudStack Community Mgr; Fedora Advisory Board
  103. Louis Zuckerman, Sr. System Administrator, Picture Marketing
  104. Joe Julian, Sr. System Administrator, Ed Wyse Beauty Products
  105. Greg DeKoenigsberg, Community VP, Eucalyptus; co-founder, Fedora
  106. John Mark Walker, Gluster.org Community Guy (Chair) </li></ul></ul></ul>
  107. Gluster.org Web Site <ul><li>Services for users and developers </li><ul><li>Developer section with comprehensive docs
  108. Collaborative project hosting
  109. Continuing development of end user documentation and interactive tools </li></ul><li>Published roadmaps </li><ul><li>Transparent feature development </li></ul></ul>
  110. GlusterFS Downloads <ul><li>Where's the code? </li><ul><li>GlusterFS 3.3 </li><ul><li>Simultaneous file + object
  111. HDFS compatibility
  112. Improved self-healing + VM hosting </li><ul><li>Granular locking </li></ul><li>Beta 3 due Feb/Mar 2012
  113. http://download.gluster.org/pub/gluster/glusterfs </li></ul></ul></ul>
  114. Gluster.org Services <ul><li>Gluster.org </li><ul><li>Portal into all things GlusterFS </li></ul><li>Community.gluster.org </li><ul><li>Self-support site; Q&A; HOWTOs; tutorials </li></ul><li>Patch review, CI </li><ul><li>review.gluster.com </li></ul><li>#gluster </li><ul><li>IRC channel on Freenode </li></ul></ul>
  115. Development Process <ul><li>Source code </li><ul><li>Hosted at github.com/gluster </li></ul><li>Bugs and Feature Requests </li><ul><li>Bugzilla.redhat.com – select GlusterFS from menu </li></ul><li>Patches </li><ul><li>Submit via Gerritt at review.gluster.com </li></ul><li>See Development Work Flow doc: </li><ul><li>gluster.org/community/documentation/index.php/Development_Work_Flow </li></ul></ul>
  116. Thank You <ul><li>GlusterFS contacts </li><ul><li>Gluster.org/interact/mailinglists
  117. @RedHatStorage & @GlusterOrg
  118. #gluster on Freenode </li></ul><li>My contact info </li><ul><li>[email_address]
  119. Twitter & identi.ca: @johnmark </li></ul></ul>

×