SlideShare a Scribd company logo
1 of 38
Working with Volumes
  7/6/2012

© 2012 MapR Technologies   Volumes 1
Working with Volumes
   Agenda
   • Volumes
   • Exercise: Volumes
   • Snapshots
   • Exercise: Snapshots & Schedules
   • Mirrors
   • Exercise: Mirrors & Schedules
   • Best Practices



© 2012 MapR Technologies      Volumes 2
Working with Volumes
   Objectives
   At the end of this module you will be able to:
   • Explain how volumes relate to the MapR storage architecture
   • Describe a typical volume layout
   • Create, modify, move and set permissions on a volume
   • Explain how MapR snapshots work and how they are different from other
     types of snapshots
   • Create a snapshot and a snapshot schedule
   • Describe mirror volumes
   • Create a mirror volume and a mirror sync schedule


© 2012 MapR Technologies          Volumes 3
Volumes




© 2012 MapR Technologies    Volumes 4
What is a Volume?

                                            Logical unit of storage with
                                             policy for:
                                             –   Replication factor
                                             –   Ownership
                                             –   Data protection
                                             –   Data placement
                                            Contains directories and files
                                            Made of containers
                                            Unique to the MapR distribution


                            Volumes help you manage data
© 2012 MapR Technologies            Volumes 5
Volumes Let You Manage Data


                                      Backup and load balancing
                                       –   All or just a portion
                                      Point-in-time recovery
                                      Data placement
                                      Ownership/Permissions on
                                       Volumes




© 2012 MapR Technologies   Volumes 6
Storage Architecture


                                      Nodes
                                      Disks
                                      Storage Pools
                                      Containers
                                         –   Distributed across cluster
                                         –   16(default)-32 GB




© 2012 MapR Technologies     Volumes 7
Replication

                                               1

                    1                              2

           3               2

                               1   4

               1                               2

      2        4

                               3
                                               3
                                                           1       2       3
                    4                      4
                                                       4       5       6
                                                           7       8       9


© 2012 MapR Technologies               Volumes 8
Typical Volume Layout

                                           /




      /binaries            /hbase      /projects   /users    /var/mapr




        /build             /test        /mjones    /jsmith     local...



                  Recommendation: set up a volume per user
© 2012 MapR Technologies            Volumes 9
Applying Policy

                                Volume permissions
                                Quota
                                Topology
                                Replication Factor
                                Snapshots
                                Schedules
                                Mirrors




© 2012 MapR Technologies       Volumes 10
Permissions


                              Delegate all or a subset

                               –   dump – Dump the volume
                               –   restore – Mirror or restore the volume
                               –   m – Modify properties, create/delete snapshots
                               –   d – Delete the volume
                               –   fc – Full control (admin and ACL change)



        Volume permissions—not file and directory permissions

© 2012 MapR Technologies            Volumes 11
Quota



                                       Volume Quotas:
                                        –   Hard quota
                                        –   Advisory quota




                           Limited also by per-user quotas

© 2012 MapR Technologies               Volumes 12
Topology

                               /
                           /rack1
                                                Physical topology describes cluster
                                                 layout
       /rack1      /rack2       /rack3
                                                 –   Generally corresponds to racks
                                                     (or power drops)
                                                Volume topology specifies what
                                                 physical topology the volume uses
                                                 –   Expressed as a path
                                                 –   Limits placement of data




                                         Topology is a label

© 2012 MapR Technologies                    Volumes 13
Replication Factor


                                      Desired
                                       –   Re-replicate after configurable
                                           timeout
                                      Minimum
                                       –   Re-replicate immediately




                       Determined by data protection needs

© 2012 MapR Technologies          Volumes 14
Exercise:
                           Volumes



© 2012 MapR Technologies    Volumes 15
Creating a Volume
                                               maprcli volume create
                                                -name name
                                                -mount 1
                                                -path path




                      Name and mount path can be different
© 2012 MapR Technologies          Volumes 16
Setting Permissions
                                              maprcli acl edit
                                               -name <name>
                                               -type volume
                                               -user <user>




                           Use acl edit not acl set

© 2012 MapR Technologies         Volumes 17
Modifying a Volume
                                         maprcli volume modify
                                          -name name
                                          -quota quota




© 2012 MapR Technologies    Volumes 18
Moving a Volume
                                             maprcli volume move
                                              -name name
                                              -topology path




© 2012 MapR Technologies        Volumes 19
Snapshots




© 2012 MapR Technologies    Volumes 20
What is a Snapshot?

                                   Point-in-time image of a volume
                                   Guards against error replication
                                   No initial space penalty
                                   In streaming data, no space
                                    penalty at all!
                                   Nearly instantaneous




                     Supports Recovery Point Objective (RPO)

© 2012 MapR Technologies          Volumes 21
Snapshot Architecture


                                    Snapshot
                           Current View of redirected
                           New writes Snapshot
                             Previous area the Data
                              Original to write
                              Continuing toVolume
                               Writing




© 2012 MapR Technologies          Volumes 22
Where Do Snapshots Go?


                              The .snapshot directory
                               –   Top level of every volume
                               –   Does not appear in ls
                                   (to prevent recursion)
                               –   Same path via NFS or hadoop shell




© 2012 MapR Technologies           Volumes 23
Schedule


                               Collection of rules
                               Makes things happen
                               Applied to a volume:
                                –   Normal: snapshot
                                –   Mirror: sync
                               Specifies how long to keep
                                snapshots




© 2012 MapR Technologies    Volumes 24
Exercise:
                    Snapshots & Schedules



© 2012 MapR Technologies   Volumes 25
Creating a Snapshot

                            maprcli volume snapshot create
                             -volume name
                             -snapshotname name




© 2012 MapR Technologies      Volumes 26
Creating a Schedule
                             maprcli schedule create
                                -schedule
                                     '{"name":"Schedule-1",
                                     "rules":
                                           [{"frequency":"weekly",
                                           "date":"sun",
                                           "time":"7”,
                                           "retain":"2w"}]}'




© 2012 MapR Technologies      Volumes 27
Applying a Schedule
                                          maprcli schedule list
                                          maprcli volume modify
                                           -name name
                                           -schedule ID




© 2012 MapR Technologies     Volumes 28
Mirrors




© 2012 MapR Technologies   Volumes 29
What is a Mirror?


                                    A full, read-only copy of a volume
                                    Use cases:
                                     –   Remote backup
                                     –   Research/production
                                     –   Load balancing
                                     –   Cascading mirrors to distribute data




                             Can be mounted or not…
© 2012 MapR Technologies          Volumes 30
Navigating Mirrors

                      mapr.cluster.root             //            /’/’

                Volume “a” at /a       X/a/a             a’
                                                         /a’   Mirror (mounted or not)



                                 /b
                                  /b            b’
                                                /b’


                           /c
                            /c          c’
                                         /c’

             X/d /d              d’
                                 /d’             d’’
                                                /d’’


© 2012 MapR Technologies               Volumes 31
Writing to a Mirror Anyway


                              The .rw directory
                     .rw       –   Top level of the cluster
                               –   Does not appear in ls
                                   (to prevent recursion)
                               –   Same path via NFS or hadoop shell




© 2012 MapR Technologies           Volumes 32
Exercise:
                           Mirrors & Schedules



© 2012 MapR Technologies         Volumes 33
Creating a Mirror
                                         maprcli volume create
                                             -name name
                                             -source name
                                             -type 1
                                             -schedule ID




© 2012 MapR Technologies        Volumes 34
Syncing a Mirror
      Two ways to sync:         maprcli volume mirror start
                                 -name mirror


                                maprcli volume mirror push
                                -name source

                                             …and of course you can
                                              start the mirror in the
                                              MapR Control System
                                              too.

© 2012 MapR Technologies        Volumes 35
Best Practices




© 2012 MapR Technologies      Volumes 36
Best Practices

     Set up a /data topology and put all nodes and volumes in it
     Set up a /decommisioning topology with nothing in it
      –   Moving nodes to /decommissioning migrates data off when needed
     If desired, set up a /cldb topology to isolate CLDB data
     Set up a volume per user or project




© 2012 MapR Technologies          Volumes 37
Questions




© 2012 MapR Technologies   Volumes 38

More Related Content

What's hot

What's New and Upcoming in HDFS - the Hadoop Distributed File System
What's New and Upcoming in HDFS - the Hadoop Distributed File SystemWhat's New and Upcoming in HDFS - the Hadoop Distributed File System
What's New and Upcoming in HDFS - the Hadoop Distributed File SystemCloudera, Inc.
 
TriHUG - Beyond Batch
TriHUG - Beyond BatchTriHUG - Beyond Batch
TriHUG - Beyond Batchboorad
 
MapReduce Container ReUse
MapReduce Container ReUseMapReduce Container ReUse
MapReduce Container ReUseHortonworks
 
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, ClouderaHadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, ClouderaCloudera, Inc.
 
Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2GlusterFS
 
How to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop ClusterHow to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop ClusterAltoros
 
CCNxCon 2012: Session #7: Off-Path Caching in CCN
CCNxCon 2012: Session #7: Off-Path Caching in CCNCCNxCon 2012: Session #7: Off-Path Caching in CCN
CCNxCon 2012: Session #7: Off-Path Caching in CCNPARC, a Xerox company
 
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...xKinAnx
 
Nn ha hadoop world.final
Nn ha hadoop world.finalNn ha hadoop world.final
Nn ha hadoop world.finalHortonworks
 
An introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAn introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAlessio Villardita
 
Strata + Hadoop World 2012: HDFS: Now and Future
Strata + Hadoop World 2012: HDFS: Now and FutureStrata + Hadoop World 2012: HDFS: Now and Future
Strata + Hadoop World 2012: HDFS: Now and FutureCloudera, Inc.
 
TSM og replikering
TSM og replikering TSM og replikering
TSM og replikering Solv AS
 
Ambari Meetup: NameNode HA
Ambari Meetup: NameNode HAAmbari Meetup: NameNode HA
Ambari Meetup: NameNode HAHortonworks
 

What's hot (20)

What's New and Upcoming in HDFS - the Hadoop Distributed File System
What's New and Upcoming in HDFS - the Hadoop Distributed File SystemWhat's New and Upcoming in HDFS - the Hadoop Distributed File System
What's New and Upcoming in HDFS - the Hadoop Distributed File System
 
TriHUG - Beyond Batch
TriHUG - Beyond BatchTriHUG - Beyond Batch
TriHUG - Beyond Batch
 
MapReduce Container ReUse
MapReduce Container ReUseMapReduce Container ReUse
MapReduce Container ReUse
 
HDF-EOS APIs, tools, etc.
HDF-EOS APIs, tools, etc.HDF-EOS APIs, tools, etc.
HDF-EOS APIs, tools, etc.
 
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, ClouderaHadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
 
Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2Gluster Webinar May 25: Whats New in GlusterFS 3.2
Gluster Webinar May 25: Whats New in GlusterFS 3.2
 
How to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop ClusterHow to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop Cluster
 
Hitachi Data Services. Business Continuity
Hitachi Data Services. Business ContinuityHitachi Data Services. Business Continuity
Hitachi Data Services. Business Continuity
 
CCNxCon 2012: Session #7: Off-Path Caching in CCN
CCNxCon 2012: Session #7: Off-Path Caching in CCNCCNxCon 2012: Session #7: Off-Path Caching in CCN
CCNxCon 2012: Session #7: Off-Path Caching in CCN
 
NFS and ODBC
NFS and ODBCNFS and ODBC
NFS and ODBC
 
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
 
Database backup 110810
Database backup 110810Database backup 110810
Database backup 110810
 
What is HDF-EOS?
What is HDF-EOS?What is HDF-EOS?
What is HDF-EOS?
 
Nn ha hadoop world.final
Nn ha hadoop world.finalNn ha hadoop world.final
Nn ha hadoop world.final
 
An introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAn introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale Computers
 
Strata + Hadoop World 2012: HDFS: Now and Future
Strata + Hadoop World 2012: HDFS: Now and FutureStrata + Hadoop World 2012: HDFS: Now and Future
Strata + Hadoop World 2012: HDFS: Now and Future
 
San Admin
San AdminSan Admin
San Admin
 
TSM og replikering
TSM og replikering TSM og replikering
TSM og replikering
 
HDF-EOS Development: Current Status and Tools
HDF-EOS Development: Current Status and ToolsHDF-EOS Development: Current Status and Tools
HDF-EOS Development: Current Status and Tools
 
Ambari Meetup: NameNode HA
Ambari Meetup: NameNode HAAmbari Meetup: NameNode HA
Ambari Meetup: NameNode HA
 

Similar to 50a volumes

80a disaster recovery
80a disaster recovery80a disaster recovery
80a disaster recoverymapr-academy
 
70a monitoring & troubleshooting
70a monitoring & troubleshooting70a monitoring & troubleshooting
70a monitoring & troubleshootingmapr-academy
 
Greenplum Database on HDFS
Greenplum Database on HDFSGreenplum Database on HDFS
Greenplum Database on HDFSDataWorks Summit
 
Design, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for HadoopDesign, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for Hadoopmcsrivas
 
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?  Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You? EMC
 
41a managing services
41a managing services41a managing services
41a managing servicesmapr-academy
 
Next Generation Hadoop: High Availability for YARN
Next Generation Hadoop: High Availability for YARN Next Generation Hadoop: High Availability for YARN
Next Generation Hadoop: High Availability for YARN Arinto Murdopo
 
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...lucenerevolution
 
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...lucenerevolution
 
Greenplum Database Overview
Greenplum Database Overview Greenplum Database Overview
Greenplum Database Overview EMC
 
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010Gavin Heavyside
 
State of Resource Management in Big Data
State of Resource Management in Big DataState of Resource Management in Big Data
State of Resource Management in Big DataKhalid Ahmed
 
State of Resource Management in Big Data
State of Resource Management in Big DataState of Resource Management in Big Data
State of Resource Management in Big DataYong Feng
 

Similar to 50a volumes (20)

22 configuration
22 configuration22 configuration
22 configuration
 
80a disaster recovery
80a disaster recovery80a disaster recovery
80a disaster recovery
 
70a monitoring & troubleshooting
70a monitoring & troubleshooting70a monitoring & troubleshooting
70a monitoring & troubleshooting
 
Greenplum Database on HDFS
Greenplum Database on HDFSGreenplum Database on HDFS
Greenplum Database on HDFS
 
HBase with MapR
HBase with MapRHBase with MapR
HBase with MapR
 
48a tuning
48a tuning48a tuning
48a tuning
 
Design, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for HadoopDesign, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for Hadoop
 
Spark
SparkSpark
Spark
 
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?  Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
 
41a managing services
41a managing services41a managing services
41a managing services
 
13c planning
13c planning13c planning
13c planning
 
Next Generation Hadoop: High Availability for YARN
Next Generation Hadoop: High Availability for YARN Next Generation Hadoop: High Availability for YARN
Next Generation Hadoop: High Availability for YARN
 
12a architecture
12a architecture12a architecture
12a architecture
 
Introduction to map reduce
Introduction to map reduceIntroduction to map reduce
Introduction to map reduce
 
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
 
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
 
Greenplum Database Overview
Greenplum Database Overview Greenplum Database Overview
Greenplum Database Overview
 
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010
 
State of Resource Management in Big Data
State of Resource Management in Big DataState of Resource Management in Big Data
State of Resource Management in Big Data
 
State of Resource Management in Big Data
State of Resource Management in Big DataState of Resource Management in Big Data
State of Resource Management in Big Data
 

More from mapr-academy

42 lab-managing services
42 lab-managing services42 lab-managing services
42 lab-managing servicesmapr-academy
 
3 map r installation & setup administration course description
3 map r installation & setup administration course description3 map r installation & setup administration course description
3 map r installation & setup administration course descriptionmapr-academy
 

More from mapr-academy (7)

53 lab-nfs
53 lab-nfs53 lab-nfs
53 lab-nfs
 
51 lab-volumes
51 lab-volumes51 lab-volumes
51 lab-volumes
 
42 lab-managing services
42 lab-managing services42 lab-managing services
42 lab-managing services
 
14 lab-planing
14 lab-planing14 lab-planing
14 lab-planing
 
20a installation
20a installation20a installation
20a installation
 
10c introduction
10c introduction10c introduction
10c introduction
 
3 map r installation & setup administration course description
3 map r installation & setup administration course description3 map r installation & setup administration course description
3 map r installation & setup administration course description
 

Recently uploaded

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

50a volumes

  • 1. Working with Volumes 7/6/2012 © 2012 MapR Technologies Volumes 1
  • 2. Working with Volumes Agenda • Volumes • Exercise: Volumes • Snapshots • Exercise: Snapshots & Schedules • Mirrors • Exercise: Mirrors & Schedules • Best Practices © 2012 MapR Technologies Volumes 2
  • 3. Working with Volumes Objectives At the end of this module you will be able to: • Explain how volumes relate to the MapR storage architecture • Describe a typical volume layout • Create, modify, move and set permissions on a volume • Explain how MapR snapshots work and how they are different from other types of snapshots • Create a snapshot and a snapshot schedule • Describe mirror volumes • Create a mirror volume and a mirror sync schedule © 2012 MapR Technologies Volumes 3
  • 4. Volumes © 2012 MapR Technologies Volumes 4
  • 5. What is a Volume?  Logical unit of storage with policy for: – Replication factor – Ownership – Data protection – Data placement  Contains directories and files  Made of containers  Unique to the MapR distribution Volumes help you manage data © 2012 MapR Technologies Volumes 5
  • 6. Volumes Let You Manage Data  Backup and load balancing – All or just a portion  Point-in-time recovery  Data placement  Ownership/Permissions on Volumes © 2012 MapR Technologies Volumes 6
  • 7. Storage Architecture  Nodes  Disks  Storage Pools  Containers – Distributed across cluster – 16(default)-32 GB © 2012 MapR Technologies Volumes 7
  • 8. Replication 1 1 2 3 2 1 4 1 2 2 4 3 3 1 2 3 4 4 4 5 6 7 8 9 © 2012 MapR Technologies Volumes 8
  • 9. Typical Volume Layout / /binaries /hbase /projects /users /var/mapr /build /test /mjones /jsmith local... Recommendation: set up a volume per user © 2012 MapR Technologies Volumes 9
  • 10. Applying Policy  Volume permissions  Quota  Topology  Replication Factor  Snapshots  Schedules  Mirrors © 2012 MapR Technologies Volumes 10
  • 11. Permissions  Delegate all or a subset – dump – Dump the volume – restore – Mirror or restore the volume – m – Modify properties, create/delete snapshots – d – Delete the volume – fc – Full control (admin and ACL change) Volume permissions—not file and directory permissions © 2012 MapR Technologies Volumes 11
  • 12. Quota  Volume Quotas: – Hard quota – Advisory quota Limited also by per-user quotas © 2012 MapR Technologies Volumes 12
  • 13. Topology / /rack1  Physical topology describes cluster layout /rack1 /rack2 /rack3 – Generally corresponds to racks (or power drops)  Volume topology specifies what physical topology the volume uses – Expressed as a path – Limits placement of data Topology is a label © 2012 MapR Technologies Volumes 13
  • 14. Replication Factor  Desired – Re-replicate after configurable timeout  Minimum – Re-replicate immediately Determined by data protection needs © 2012 MapR Technologies Volumes 14
  • 15. Exercise: Volumes © 2012 MapR Technologies Volumes 15
  • 16. Creating a Volume maprcli volume create -name name -mount 1 -path path Name and mount path can be different © 2012 MapR Technologies Volumes 16
  • 17. Setting Permissions maprcli acl edit -name <name> -type volume -user <user> Use acl edit not acl set © 2012 MapR Technologies Volumes 17
  • 18. Modifying a Volume maprcli volume modify -name name -quota quota © 2012 MapR Technologies Volumes 18
  • 19. Moving a Volume maprcli volume move -name name -topology path © 2012 MapR Technologies Volumes 19
  • 20. Snapshots © 2012 MapR Technologies Volumes 20
  • 21. What is a Snapshot?  Point-in-time image of a volume  Guards against error replication  No initial space penalty  In streaming data, no space penalty at all!  Nearly instantaneous Supports Recovery Point Objective (RPO) © 2012 MapR Technologies Volumes 21
  • 22. Snapshot Architecture Snapshot Current View of redirected New writes Snapshot Previous area the Data Original to write Continuing toVolume Writing © 2012 MapR Technologies Volumes 22
  • 23. Where Do Snapshots Go?  The .snapshot directory – Top level of every volume – Does not appear in ls (to prevent recursion) – Same path via NFS or hadoop shell © 2012 MapR Technologies Volumes 23
  • 24. Schedule  Collection of rules  Makes things happen  Applied to a volume: – Normal: snapshot – Mirror: sync  Specifies how long to keep snapshots © 2012 MapR Technologies Volumes 24
  • 25. Exercise: Snapshots & Schedules © 2012 MapR Technologies Volumes 25
  • 26. Creating a Snapshot maprcli volume snapshot create -volume name -snapshotname name © 2012 MapR Technologies Volumes 26
  • 27. Creating a Schedule maprcli schedule create -schedule '{"name":"Schedule-1", "rules": [{"frequency":"weekly", "date":"sun", "time":"7”, "retain":"2w"}]}' © 2012 MapR Technologies Volumes 27
  • 28. Applying a Schedule maprcli schedule list maprcli volume modify -name name -schedule ID © 2012 MapR Technologies Volumes 28
  • 29. Mirrors © 2012 MapR Technologies Volumes 29
  • 30. What is a Mirror?  A full, read-only copy of a volume  Use cases: – Remote backup – Research/production – Load balancing – Cascading mirrors to distribute data Can be mounted or not… © 2012 MapR Technologies Volumes 30
  • 31. Navigating Mirrors mapr.cluster.root // /’/’ Volume “a” at /a X/a/a a’ /a’ Mirror (mounted or not) /b /b b’ /b’ /c /c c’ /c’ X/d /d d’ /d’ d’’ /d’’ © 2012 MapR Technologies Volumes 31
  • 32. Writing to a Mirror Anyway  The .rw directory .rw – Top level of the cluster – Does not appear in ls (to prevent recursion) – Same path via NFS or hadoop shell © 2012 MapR Technologies Volumes 32
  • 33. Exercise: Mirrors & Schedules © 2012 MapR Technologies Volumes 33
  • 34. Creating a Mirror maprcli volume create -name name -source name -type 1 -schedule ID © 2012 MapR Technologies Volumes 34
  • 35. Syncing a Mirror Two ways to sync: maprcli volume mirror start -name mirror maprcli volume mirror push -name source …and of course you can start the mirror in the MapR Control System too. © 2012 MapR Technologies Volumes 35
  • 36. Best Practices © 2012 MapR Technologies Volumes 36
  • 37. Best Practices  Set up a /data topology and put all nodes and volumes in it  Set up a /decommisioning topology with nothing in it – Moving nodes to /decommissioning migrates data off when needed  If desired, set up a /cldb topology to isolate CLDB data  Set up a volume per user or project © 2012 MapR Technologies Volumes 37
  • 38. Questions © 2012 MapR Technologies Volumes 38