Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

State of the_gluster_-_lceu

289 views

Published on

20160401 guster-roadmap

Published in: Technology
  • Be the first to comment

  • Be the first to like this

State of the_gluster_-_lceu

  1. 1. The State of the Gluster Community John Mark Walker Gluster Community Guy November 8, 2012
  2. 2. 11/08/12 Topics ● What is GlusterFS ● GlusterFS 3.3 ● Ways to Access GlusterFS Volumes ● GlusterFS client (FUSE) ● Libgfapi – QEMU block device driver ● Translators ● Swift API – Unified File and Object ● The Roadmap
  3. 3. 11/08/12 Simple Economics ● Simplicity, scalability, less cost Multi-TenantVirtualized Automated Commoditized Scale on Demand In the Cloud Scale Out Open Source
  4. 4. Commercial Product What is Open Source Open Source Code “Open Core” ● All engineering controlled by project/product sponsor ● No innovation outside of core engineering team ● All open source features also in commercial product ● Many features in Commercial product not in open source code
  5. 5. Open Source Code What is Open Source Commercial Products “Real” Open Source ● Many points of collaboration and innovation in open source project ● Engineering team from multiple sources ● Project and product do not completely overlap ● Commercial products are hardened, more secure and thoroughly tested
  6. 6. GlusterFS What is Open Source Red Hat Storage “Real” Open Source ● Enables more innovation on the fringes ● Engineering team from multiple sources ● Open source project is “upstream” from commercial product ● “Downstream” products are hardened, more secure and thoroughly tested
  7. 7. 11/08/12 Simplicity Bias ● FC, FCoE, iSCSI → HTTP, Sockets ● Modified BSD OS → Linux / User Space / C, Python & Java ● Appliance based → Application based
  8. 8. 11/08/12 Scale-out Open Source is the winner
  9. 9. 11/08/12 Bengaluru Office Conference Room US Head Office Bengaluru Office
  10. 10. Community Deployments
  11. 11. 11/08/12 Not a Storage Company ● At first a cluster-building company ● Engineering team excelled at building open source HPC systems
  12. 12. 11/08/12 Necessity: The Mother of Invention
  13. 13. 11/08/12 The big idea: Storage should be simple
  14. 14. 11/08/12 What is Simple Storage? ● Low-risk, easy to deploy and administer, data consistency, open source, software-only, user space
  15. 15. 11/08/12 What is GlusterFS, Really? Gluster is a unified, distributed storage system ● User space, global namespace, stackable, POSIX-y, scale-out NAS platform, inspired by GNU Hurd
  16. 16. 11/08/12 Some Features ● No single point of failure ● DHT ● Synchronous and asynchronous replication ● Proactive self-healing
  17. 17. 11/08/12 What Can You Do With It? ● Media – Docs, Photos, Video ● Shared storage – multi-tenant environments ● Big Data – Log Files, RFID Data ● Objects – Long Tail Data
  18. 18. 11/08/12 Standard Deployment ● Distributed over multiple servers ● Replicate volumes ● On top of disk FS (XFS, Ext4, ie. Xattrs) ● Multi-protocol access
  19. 19. Red Hat Proprietary 19 Storage for Any Environment Scale-out NAS for On-premises and Public Clouds ● Standardized NAS infrastructure ● On-premise and public cloud ● POSIX-ish ● Apps move easily between environments ● Replicate between both Public CloudOn-premises
  20. 20. 11/08/12 First Versions ● Toolkit for building storage systems ● Very hacker-friendly ● Community integral part of development – Drove feature development – Repeatable use cases
  21. 21. 11/08/12 Mid-2011 Snapshot ● Scale-out NAS ● Distributed and replicated ● NFS, CIFS and native GlusterFS ● User-space, stackable architecture ● Lots of users, not many devs → A good platform to build on
  22. 22. 11/08/12 GlusterFS 3.3: Building on the Foundation ● Granular locking ● Proactive self-healing ● Improved rebalancing ● More access methods
  23. 23. 11/08/12 Granular Locking – Server fails, comes back – Files evaluated – Block-by-block until healed Server 1 Server 2 GlusterFS GlusterFS Virtual Disk 1-1 Virtual Disk 1-2 Virtual Disk 2-1 Virtual Disk 2-2 Blocks compared
  24. 24. 11/08/12 Proactive Self-healing – Performed server-to-server – Recovered node queries peers / Symlink 1 Hidden | Symlink 2 Symlink 3 File 1 File 2 File 3 Server 1 - good Server 2 - recovered Replicated Server 3 - good File 1 File 2 File 3 Server 4 - good Distributed Self-healing
  25. 25. 11/08/12 Easier Rebalancing – Now faster ● Previously, created entire new hash set, moving data unnecessarily ● Now recreates hash map and compares to old – Easier to decommission server nodes – Proof point for synchronous translator API
  26. 26. 11/08/12 Unified File and Object (UFO) – S3, Swift-style object storage – Access via UFO or Gluster mount Client Proxy Account Container Object HTTP Request ID=/dir/sub/sub2/file Directory Volume FileClient NFS or GlusterFS Mount
  27. 27. 11/08/12 Unified File and Object (UFO) – Your gateway to the cloud – Your data, accessed your way
  28. 28. 11/08/12 HDFS Compatibility – Run MapReduce jobs on GlusterFS – Add unstructured data to Hadoop Hadoop Server GlusterFS GlusterFS GlusterFS Local Disk GlusterFS HDFS Connector (Jar file)
  29. 29. 11/08/12 4. Coming Attractions
  30. 30. 11/08/12 API Check ● Ways to interface with GlusterFS – Translators ● Stackable, async and sync – FUSE mount ● GlusterFS client – Libgfapi ● FUSE bypass
  31. 31. 11/08/12 API Check ● Ways to interface with GlusterFS – Marker framework ● Geo-replication, quickly ID changes – UFO RESTful API – HDFS library – Management API ● oVirt 3.1
  32. 32. 11/08/12 Better VM Image Handling – Better responsiveness for random I/o use cases – Contribution: Block Device Translator
  33. 33. 11/08/12 Enabling GlusterFS for Virtualization use ● QEMU-GlusterFS integration ● Native integration, no FUSE mount ● Gluster as QEMU block back end ● QEMU talks to gluster and gluster hides different image formats and storage types underneath ● Block device support in GlusterFS via Block Device translator ● Logical volumes as VM images
  34. 34. 11/08/12 GlusterFS & QEMU
  35. 35. 11/08/12 Libglusterfs Client API – Previously abandoned – Brought back to life ● In part because of QEMU Fuse bypass contributions
  36. 36. 11/08/12 Split Brain – Nodes cannot see each other, but can all still write – Often due to network outages – Sometimes results in conflicts – Up to 3.2, GlusterFS had no concept of “quorum”
  37. 37. 11/08/12 Quorum Enforcement – Which node has valid data? – If quorum, keep writing, else stop ● Configurable option -No quorum -Stops writing -Quorum -Keeps writing Server 1 Server 2 Broken Connection -Quorum -Keeps writing Server 3
  38. 38. 11/08/12 Quorum Enforcement – After connection restored, self-heal kicks off -No quorum -Stops writing -Quorum -Keeps writing Replica 1 Replica 2 -Quorum -Keeps writing Replica 3 -No quorum -Stops writing -Quorum -Keeps writing -Quorum -Keeps writing Self-heal
  39. 39. 11/08/12 Enhanced Quorum – Quorum tracking on the servers – Need quorum for any management changes – 3rd party arbiters / observers so never N=2
  40. 40. 11/08/12 Management UI & REST API – Collaboration with oVirt project – Management GUI for admins – RESTful gateway for devs – First community release... ?
  41. 41. 11/08/12
  42. 42. 11/08/12 Multi-tenancy & Encryption – HekaFS created this for cloud deployments – In-flight data encryption
  43. 43. 11/08/12 Down the Road – Multi-master Geo-rep – Snapshots – Versioning – GeoRep Sparse Replicas – File compression & de-dupe
  44. 44. 11/08/12 Server-side Processing – Implementing gfind, glocate – Fast traversal of metadata in xattrs ● Find and locate responsive – Inotify-esque behavior: triggers based on i/o activity, ie. file close ● Why rely on Hadoop batch- processing?
  45. 45. 11/08/12 Goal: Intelligent Storage ● Just storing and retrieving data is not enough ● Should be able to store, analyze, transform, mutilate, and retrieve ● Intelligent storage gives sysadmins and developers the ultimate data swiss army knife
  46. 46. Thank you! John Mark Walker Gluster Community Guy johnmark@redhat.com

×