MogileFS Architecture

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Notes on slide 1

    Voici la première déclinaison produit de notre stratégie Open Storage avec la série Sun Storage 7000 Unified Storage System. Cette offre se compose de trois modèles offrant de 2 à 576 To de volumètrie brut. Ces modèles sont le S7110, le S7210 et enfin le S7410 qui existe également en version cluster.

    Voici la première déclinaison produit de notre stratégie Open Storage avec la série Sun Storage 7000 Unified Storage System. Cette offre se compose de trois modèles offrant de 2 à 576 To de volumètrie brut. Ces modèles sont le S7110, le S7210 et enfin le S7410 qui existe également en version cluster.

    Voici la première déclinaison produit de notre stratégie Open Storage avec la série Sun Storage 7000 Unified Storage System. Cette offre se compose de trois modèles offrant de 2 à 576 To de volumètrie brut. Ces modèles sont le S7110, le S7210 et enfin le S7410 qui existe également en version cluster.

    LIVEJOURNAL

    Merci de votre attention

    Favorites, Groups & Events

    MogileFS Architecture - Presentation Transcript

    1. > MogileFS Architecture Philippe Julio – Principal Field Technologist Sun Microsystems France November, 2009
    2. Data Management Vision « The Data are not created relevant, they become so ! » November, 2009 2
    3. What Is MogileFS ? MogileFS is an open source distributed filesystem, flexible and high available on a network of commodity hardware MogileFS is an anagram for "OMG Files" and was created for LiveJournal to handle the storage, replication and retrieval of the large amount of file uploads. November, 2009 3
    4. Data Management Data-Driven on-Line Websites • To run the apps : messages, posts, blog entries, video clips, maps, web graph... • To give the data context : friends networks, social networks, collaborative filtering... • To keep the applications running : web logs, system logs, system metrics, database query logs... November, 2009 4
    5. Sun's Cloud Computing Strategy Products Expertise Q-layer and and Products Expertise Technologies andServices and Technologies Services xVM OpenOpen Partners Community Communities November, 2009 5
    6. Sun's Open Source Strategy Developer Preference User Preference Value Proposition ● More core developers • Free to use ● Business Deployment ● More deploying • More platform choice ● Sun's target market developers • More suppliers ● Binary distribution ● More partners • Larger user community ● Pay for value November, 2009 6
    7. New Data Management Economics Compute Trend Data Trend New Analytics Emerge Semi-structured Data (MapReduce...) (MogileFS, Bigtable, HDFS...) Master/Slave Semi- Object Store structured Architectural shift to the cloud Database and HPC-style workloads ScaleDB, Big Table, SimpleDB HBase Master/Master Open source, general purpose datawarehouse Distributed FS Federated/ Proprietary, dedicated Sharded datawarehouse Unstructured Structured Data Data OLTP is the datawarehouse November, 2009 7
    8. Changing Software Economics 1998 2008 2009 November, 2009 8
    9. Who Used MogileFS ? • Danga Interactive project • Six appart acquisition 220 million pages view / Month 5 billion pages view / Month November, 2009 9
    10. How to Interface your Application with MogileFS ? • Implement “Save As Cloud...” to Write to the cloud • Implement “Open From Cloud...” to Read from the cloud • MogileFS 2.32 http://search.cpan.org/~bradfitz/mogilefs-server • Documentation Guide http://www.danga.com/mogilefs • Develop your application with MogileFS : http://mogilefs.pbworks.com/ObtainingMogileFS November, 2009 10
    11. OpenSolaris for MogileFS Network AutoMagic Automated Install Security D-Light Image Packaging Containers System ZFS DTrace Time Slider CIFS Predictive Self Healing Distribution Solaris Virtualization Constructor Technology Open Storage COMSTAR November, 2009 11
    12. MogileFS Architecture Infrastructure as a Services General Purpose Storage Servers • Combine server with disk & networking • Specialized software enables general purpose systems designs to provide high performance data services Sun's Open Platform direction Data moves to the infrastructure Sun Fire x4xxx (Data Compute and Store) Sun Sparc Enterprise T5xxx (Data Compute and Store) Sun Storage 7xxx (Data Store) November, 2009 12
    13. MogileFS Architecture Sizing • Application Servers sizing > Number of concurrent low users > Number of concurrent medium users > Number of concurrent high users • MogileFS Servers sizing > Useable Data Volume : Customer data volume (customer need) > Number of replication blocks : 2 minimum (3 recommanded) > 2 CPU quad-core > 16 GB RAM minimum > Work Data Volume (metadata, namespace...) > Raw Data Volume (Useable Data Volume * Nb of replication blocks) + Work Data Volume > Number of cluster nodes Max (Number of replication blocks, (Number of Trackers + Number of Strorage Nodes)) • No RAID factor, No HBA port November, 2009 13
    14. MogileFS Architecture x86 Components Low Cost Server & Storage : Sun Fire x4xxx Interface Sun Virtualization Technologie : Solaris Containers Performance • NFS • HTTP • WebDAV Sun Fire x4540 ● 2 CPU Quad Core AMD Sun Fire x4240 ● Up to 64 GB RAM ● 2 CPU Quad Core AMD 48 x 1TB SATA II 7200 RPM Sun Fire x4140 ● ● Up to 128GB RAM ● 4 RU ● 2 CPU Quad Core AMD ● Up to 4,67 TB Disks ● Up to 128 GB RAM ● 16 x 300 GB SAS 10000 RPM ● Up to 2,34 TB Disks ● 2 RU ● 8 x 300 GB SAS 10000 RPM ● 1 RU Capacity November, 2009 14
    15. MogileFS Architecture CMT Components Low Cost Server : Sun Sparc Enterprise T5xxx Interface Sun Virtualization Technologies : Solaris Containers, LDoms Performance • NFS • HTTP Sun Enterprise T5240 • .WebDAV ● 2 CPU Octo Core UST2+ ● Up to 256 GB RAM Sun Enterprise T5440 ● Up to 4,68 TB Disks ● 2 CPU Octo Ccore UST2+ ● 16 x 300 GB SAS 10000 RPM ● Up to 512GB RAM ● 2 RU Sun Blade T6340 ● Up to 1,17 TB Disks ● 2 CPU Octo Core UST2+ ● 4 x 300 GB SAS 10000 RPM ● Up to 256 GB RAM ● 4 RU ● Up to 600 GB Disk ● 2 x 300 GB 10000 RPM ● 10 RU Capacity November, 2009 15
    16. MogileFS Architecture Open Storage Components Low Cost Storage : Sun Storage 7xxx OpenStorage Interfaces Performance • NFS • WebDAV • HTTP • D-Trace Alalytics Sun Storage 7410 ● 16GB, 64GB and 128GB Sun Storage 7210 RAM options ● 32GB and 64GB RAM options ● Up to 288TB Sun Storage 7110 ● Up to 142TB ● Hybrid Storage Pool I/O ● 8GB RAM ● Hybrid Storage Pool I/O Acceleration ● 14 x 300 GB SAS 10K Acceleration ● Read/Write Flash/SSD options RPM Drives ● Write Flash/SSD options for higher performance ● Up to 4, 2TB for higher performance Capacity November, 2009 16
    17. MogileFS Architecture Key Enablers • A scalable, Fault tolerant, High performance distributed file system • No Single Point of Failure Tracker Storage Node • Automatic file replication (3 replications recommanded) • Better than RAID Host1 Host4 • Flat NameSpace • Share-Nothing • No RAID required Tracker Storage Node • Local filesystem agnostic • Tracker client transfert (mogilefsd) - Replication -- Deletion - Query - Reaper - Monitor Host2 Host5 • Files are broken up and spread over the Storage Node (mogstored) HTTP and WebDAV server • Database MySQL stores the MogileFS metadata (the Database Storage Node namespace, and which files are where) • Client Library : Ruby, Perl, Java, Python, PHP… Host3 Host6 November, 2009 17
    18. MogileFS Architecture Overview database Client Library Tracker Tracker Storage Node Storage Node November, 2009 18
    19. MogileFS Architecture World Wide IP Cluster HA Storage Node Tracker Storage Node Storage Node Tracker Storage Node Storage Node Storage Node Tracker Database Storage Node Hybrid Server High Avalability High Scalability (compute and storage) with load balancing and by adding for all Cluster nodes Cluster Storage Nodes November, 2009 19
    20. MogileFS Architecture Software Components • MogileFS • Memcached • Perlbal > Is an open source > Is a high-performance, > Is Perl-based reverse proxy distributed filesystem distributed memory object load balancer and web server > 3 components : (storage caching system, generic in > Is a single-threaded event- nodes, trackers, and the nature, but intended for use in based server supporting database speeding up dynamic web HTTP load balancing, web applications by alleviating serving, and a mix of the two > Local disks on storage database load nodes can be formatted > dropped the database load to > The management interface with your filesystem of provides extremely detailed choice (ZFS, NFS...) almost nothing, yielding faster and powerful statistics in page load times for users, addition to runtime better resource utilization, and configuration faster access to the databases on a memcache > Supports the concept of miss having per-service (and global) plugins that can override many parts of request handling and behavior. November, 2009 20
    21. MogileFS Architecture High Availability MogileFS Architecture Database Tracker Storage Storage Storage Single Point Single Point Of Failure 0 Of Failure 1 2 3 4 (SPOF) (SPOF) MogileFS High Availability Architecture Database Tracker Tracker Tracker Storage Storage Storage Storage Active Passive Cluster 0 0' Cluster 1 2 3 4 5 6 7 Node Node (Metadata) 2 Trackers nodes for HA Cluster 4 Storage Nodes November, 2009 and more than 2 for Load Balancing 21
    22. MogileFS Architecture High Availability 2 x HD 300 GB 2 x HD 300 GB Active Database node Passive Database Node + Metadata + Metadata Solaris Cluster Sun StorageTek Array Disks Mirroring RAID1 12 Disks SAS 146 GB November, 2009 22
    23. MogileFS Architecture x86 Reference Architecture Cloud Storage Model Sun Fire x4140 Application Servers ● 2 CPU Quad Core AMD ● 32 GB RAM SF x4140 SF x4140 ● 2 x 300 GB SAS 10000 RPM SF x4140 SF x4140 SF x4540 Sun Fire 4540 ● 2 CPU Quad Core AMD ● 32 GB RAM Database Tracker ● 48 x 1TB SATA II 7200 RPM Storage Node Interfaces • WebDAV OpenSolaris on each server • NFS MogileFS on each cluster node • HTTP November, 2009 23
    24. MogileFS Architecture CMT Reference Architecture Cloud Storage Model Sun Enterprise T5240 Application Servers ● 2 CPU Octo Core UST2+ ● 32 GB RAM SE T5240 SE T5240 ● 2 x 300 GB SAS 10000 RPM SE T5240 SE T5240 SE T5240 Sun Enterprise T5240 ● 2 CPU Oc*to Core UST2+ Database Tracker Storage Node ● 32 GB RAM ● 16 x 300 GB 10000 RPM Interfaces • WebDAV OpenSolaris on each server • NFS MogileFS on each cluster node • HTTP November, 2009 24
    25. MogileFS Architecture Open Storage Reference Architecture SF x4140 Cloud Storage Model Database Sun Fire x4140 Tracker ● 2 CPU Quad Core ● 32 GB RAM Tracker ● 4 x 300 GB SAS 10000 RPM Tracker OpenStorage Storage Node Switch Storage Node Storage Node Sun Storage 7410 ● 64 GB RAM Storage Node ● 288 TB Interfaces Hybrid Storage Pool I/O WebDAV ● Storage Node Acceleration • ● Read/Write Flash/SSD • NFS Storage Node options for higher performance • HTTP Storage Node OpenSolaris on each server MogileFS on each cluster node Performances analysis with D-Trace Analytics on Sun Storage 7xxx November, 2009 25
    26. MogileFS Architecture Pricing Model - Example • Utility pay-as-you-go pricing model, competitive to market • Based on capacity > Volume of data used > Volume of data transfer > ... • Easy sign-up and self provisioning November, 2009 26
    27. MogileFS Infrastructure Value Proposition Manage terabytes of • Mean processing time • TECHNOLOGY / METHODOLOGY CMT, AMD and INTEL Processors KEY PERFORMANCE INDICATORS CUSTOMER REFERENCE • Volume of data used storage with little effort • Multi OS : Linux, Solaris 10 • Volume of data transfer Objective : manage terabytes of storage • Open Solaris • Number of requests with little effort • Sun Cloud Storage • Number of Storage nodes Solution : more hundred of servers • Open Storage : Sun Storage 7000 Unified Storage • Number of Trackers Customer Benefit : this client was able System • Return On Investment to utilize hundreds of machines • Sun Fire x4540 (Hybrid Server with 48TB) concurrently. No captial expense for • Total Cost of Ownership equipment. Ability to deploy in hours not • Infratsructure SwaP ratio • MogileFS is a distributed applications with high in months density of data • Use rate of equipment environment • Time to deploy a new service • MySQL is the database for MogileFS metadata storage • Time to Market • Memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load • Perlbal is a reverse proxy for load balancer and web server • Solaris Containers • Sun xVM Ops Center for patching and provisonning • Cloud Computing Workshop SERVICES • Solaris Cluster • Sun Cloud Services (compute, storage) • Product Deployment Services • Solaris ZFS • Sun Learning Services • Sun Managed Services • Sun Support Services • Sun Global Financial Services Operation November, 2009 27
    28. Philippe Julio philippe.julio@sun.com http://blogs.sun.com/philippejulio
    SlideShare Zeitgeist 2009

    + Sun MicrosystemsSun Microsystems Nominate

    custom

    256 views, 0 favs, 1 embeds more stats

    MogileFS is an open source distributed file system. more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 256
      • 246 on SlideShare
      • 10 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 11
    Most viewed embeds
    • 10 views on http://blogs.sun.com

    more

    All embeds
    • 10 views on http://blogs.sun.com

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories