SlideShare a Scribd company logo
© 2016 IBM Corporation
IBM Spectrum Scale Object
Metadata Search
An Overview
1
© 2016 IBM Corporation
Agenda
 What is Object Metadata?
 What is Metadata Search?
 Use Cases
 Implementation Details
 Availability
2
© 2016 IBM Corporation
What is Metadata?
 User-defined metadata
 Unique feature of object storage compared to other storage systems
 Swift and S3 metadata are compatible through Swift3 middleware
 Metadata is the structured data about the unstructured object
 Who, what, when, where and why of account, container, object
 Perfect for indexing and searching
3
© 2016 IBM Corporation
Metadata Examples
Age Biomarkers Developmental Stage Cell Surface Markers Cell Type/Cell Line
Disease State Extract Molecule Genetic Characteristics Immunoprecipitation Antibody
Organism Platform Sex Strain Time Point Tissue Type Treatment Compound
Biomedical
Astronomy & Astrophysics
Geospatial
Image
Music
4
© 2016 IBM Corporation
What Swift Metadata Exists and How do I use it?
 User Metadata can be added/removed to Accounts/Containers/Objects
 E.g., X-Container-Meta-{name}, X-Remove-Container-Meta-{name}
 System metadata also exists, some can even be set by the user
 E.g., Content-Type, Last-Modified
 Semantics
 PUT and POST Metadata Semantics
• Account/Container – New user metadata added to existing list of metadata
• Object – New user metadata overwrites all existing user metadata
 COPY retains existing metadata unless new metadata is specified
 HEAD returns metadata only
5
© 2016 IBM Corporation
What is Metadata Search?

Automatically index and catalog Swift user and system
metadata

Provide REST-API for searching for objects based on their
metadata
6
© 2016 IBM Corporation
Why is Metadata Search Valuable?
 Imagine internet without Google
 Swiftly find needles in the OpenStack
 Help users and administrators perform Data
Analytics
 Metadata can be on highest tier (SSD) while data
resides on lower tier (Disk/Tape)
General Use Cases
 Data Mining
 Data Warehousing
 Selective data retrieval, data backup, data
archival, data migration
 Management/Reporting
7
© 2016 IBM Corporation
Sample Use-Cases
Advanced Photo Album
City: Rome
Time: Day
photo1.jpg
City: Rome
Time: Night
photo2.jpg
City: Haifa
Time: Day
photo3.jpg photo4.jpg
City: Tokyo
Time: Night
GET /MyPhotoSpace?query=city=‘Rome’ AND Time=’Day’
GET /MyPhotoSpace?query=time=‘Night’
* Schematic, not complete syntax 8
© 2016 IBM Corporation
Metadata Search – Media use case
Search Query
GET /MyPhotoSpace?query=tags ~ 'John' AND date >
2/12/2012 AND date < 3/12/2013 AND num_views > 10000
What we searched for?
 Date range search
 Free Text matching
 Integer comparison
9
* Schematic, not complete syntax
© 2016 IBM Corporation
10
What happens behind the screens?
Storage system input data path Indexer
Queue
Index/
SearchIndex DB
© 2016 IBM Corporation
11
Indexing objects' Metadata
Swift Proxy Pipeline
MD Indexer
Middleware
RabbitMQ
Index/
SearchElastic-
search
© 2016 IBM Corporation
12
Swift Proxy Pipeline
MD Search
Middleware
Index/
Search
Elastic-
Search
DB
Serving Search Requests
© 2016 IBM Corporation
Spectrum Scale Object Store
Spectrum Scale Object Store w/ Metadata Search
13
© 2016 IBM Corporation
Availability
Available via - IBM Spectrum Scale Metadata Search Open Beta
(link), that contains:
Roll-your-own solution

White Paper to be released describing how to setup and configure

A source tarball with an easy install tool
Also available at: IBM Spectrum Scale Beta website (link)
IBM Confidential 14

More Related Content

Viewers also liked

NSURB - Maritime Link Project Exhibit M46 - MPA Report
NSURB - Maritime Link Project Exhibit M46 - MPA Report NSURB - Maritime Link Project Exhibit M46 - MPA Report
NSURB - Maritime Link Project Exhibit M46 - MPA Report
MPAMorrisonPark
 
14,45 15,00 h. gloria perez salmeron
14,45 15,00 h. gloria perez salmeron14,45 15,00 h. gloria perez salmeron
14,45 15,00 h. gloria perez salmeron
FESABID
 
Spectrum Scale final
Spectrum Scale finalSpectrum Scale final
Spectrum Scale final
Joe Krotz
 
Jarchow Ancestry
Jarchow AncestryJarchow Ancestry
Jarchow Ancestry
jarchowancestry1
 

Viewers also liked (18)

Ibm spectrum scale fundamentals workshop for americas part 4 Replication, Str...
Ibm spectrum scale fundamentals workshop for americas part 4 Replication, Str...Ibm spectrum scale fundamentals workshop for americas part 4 Replication, Str...
Ibm spectrum scale fundamentals workshop for americas part 4 Replication, Str...
 
Noveno a feb 2016
Noveno a feb 2016Noveno a feb 2016
Noveno a feb 2016
 
NSURB - Maritime Link Project Exhibit M46 - MPA Report
NSURB - Maritime Link Project Exhibit M46 - MPA Report NSURB - Maritime Link Project Exhibit M46 - MPA Report
NSURB - Maritime Link Project Exhibit M46 - MPA Report
 
Dossier de prensa
Dossier de prensaDossier de prensa
Dossier de prensa
 
14,45 15,00 h. gloria perez salmeron
14,45 15,00 h. gloria perez salmeron14,45 15,00 h. gloria perez salmeron
14,45 15,00 h. gloria perez salmeron
 
Spectrum Scale final
Spectrum Scale finalSpectrum Scale final
Spectrum Scale final
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
 
El Comercio Eletrónico
El Comercio EletrónicoEl Comercio Eletrónico
El Comercio Eletrónico
 
2. south europe
2. south europe2. south europe
2. south europe
 
25 Blyth Road - BRITISH & CONTINENTAL PICTURES & PRINTS & PHOTOGRAPHS 17 Marc...
25 Blyth Road - BRITISH & CONTINENTAL PICTURES & PRINTS & PHOTOGRAPHS 17 Marc...25 Blyth Road - BRITISH & CONTINENTAL PICTURES & PRINTS & PHOTOGRAPHS 17 Marc...
25 Blyth Road - BRITISH & CONTINENTAL PICTURES & PRINTS & PHOTOGRAPHS 17 Marc...
 
Fabricar Zapatos De Moda En India
Fabricar Zapatos De Moda En IndiaFabricar Zapatos De Moda En India
Fabricar Zapatos De Moda En India
 
Garfield hs exploring computer science - games spring 2010
Garfield hs   exploring computer science - games spring 2010Garfield hs   exploring computer science - games spring 2010
Garfield hs exploring computer science - games spring 2010
 
Candy crush cheat codes: 10 Cheat Codes of candy crush saga
Candy crush cheat codes: 10 Cheat Codes of candy crush sagaCandy crush cheat codes: 10 Cheat Codes of candy crush saga
Candy crush cheat codes: 10 Cheat Codes of candy crush saga
 
PLANETA TERRA
PLANETA TERRAPLANETA TERRA
PLANETA TERRA
 
Rail station passenger congestion: Thinking outside the station
Rail station passenger congestion: Thinking outside the stationRail station passenger congestion: Thinking outside the station
Rail station passenger congestion: Thinking outside the station
 
¿En qué se equivocó Donald Norman?
¿En qué se equivocó Donald Norman?¿En qué se equivocó Donald Norman?
¿En qué se equivocó Donald Norman?
 
MOTION TO CONDITIONALLY CERTIFY A COLLECTIVE ACTION AND TO ISSUE NOTICE
MOTION TO CONDITIONALLY CERTIFY A COLLECTIVE ACTION AND TO ISSUE NOTICEMOTION TO CONDITIONALLY CERTIFY A COLLECTIVE ACTION AND TO ISSUE NOTICE
MOTION TO CONDITIONALLY CERTIFY A COLLECTIVE ACTION AND TO ISSUE NOTICE
 
Jarchow Ancestry
Jarchow AncestryJarchow Ancestry
Jarchow Ancestry
 

Recently uploaded

JustNaik Solution Deck (stage bus sector)
JustNaik Solution Deck (stage bus sector)JustNaik Solution Deck (stage bus sector)
JustNaik Solution Deck (stage bus sector)
Max Lee
 
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
Alluxio, Inc.
 

Recently uploaded (20)

TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERRORTROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERROR
 
KLARNA - Language Models and Knowledge Graphs: A Systems Approach
KLARNA -  Language Models and Knowledge Graphs: A Systems ApproachKLARNA -  Language Models and Knowledge Graphs: A Systems Approach
KLARNA - Language Models and Knowledge Graphs: A Systems Approach
 
AI/ML Infra Meetup | ML explainability in Michelangelo
AI/ML Infra Meetup | ML explainability in MichelangeloAI/ML Infra Meetup | ML explainability in Michelangelo
AI/ML Infra Meetup | ML explainability in Michelangelo
 
JustNaik Solution Deck (stage bus sector)
JustNaik Solution Deck (stage bus sector)JustNaik Solution Deck (stage bus sector)
JustNaik Solution Deck (stage bus sector)
 
Facemoji Keyboard released its 2023 State of Emoji report, outlining the most...
Facemoji Keyboard released its 2023 State of Emoji report, outlining the most...Facemoji Keyboard released its 2023 State of Emoji report, outlining the most...
Facemoji Keyboard released its 2023 State of Emoji report, outlining the most...
 
Breaking the Code : A Guide to WhatsApp Business API.pdf
Breaking the Code : A Guide to WhatsApp Business API.pdfBreaking the Code : A Guide to WhatsApp Business API.pdf
Breaking the Code : A Guide to WhatsApp Business API.pdf
 
Secure Software Ecosystem Teqnation 2024
Secure Software Ecosystem Teqnation 2024Secure Software Ecosystem Teqnation 2024
Secure Software Ecosystem Teqnation 2024
 
INGKA DIGITAL: Linked Metadata by Design
INGKA DIGITAL: Linked Metadata by DesignINGKA DIGITAL: Linked Metadata by Design
INGKA DIGITAL: Linked Metadata by Design
 
Top Mobile App Development Companies 2024
Top Mobile App Development Companies 2024Top Mobile App Development Companies 2024
Top Mobile App Development Companies 2024
 
how-to-download-files-safely-from-the-internet.pdf
how-to-download-files-safely-from-the-internet.pdfhow-to-download-files-safely-from-the-internet.pdf
how-to-download-files-safely-from-the-internet.pdf
 
OpenChain @ LF Japan Executive Briefing - May 2024
OpenChain @ LF Japan Executive Briefing - May 2024OpenChain @ LF Japan Executive Briefing - May 2024
OpenChain @ LF Japan Executive Briefing - May 2024
 
CompTIA Security+ (Study Notes) for cs.pdf
CompTIA Security+ (Study Notes) for cs.pdfCompTIA Security+ (Study Notes) for cs.pdf
CompTIA Security+ (Study Notes) for cs.pdf
 
5 Reasons Driving Warehouse Management Systems Demand
5 Reasons Driving Warehouse Management Systems Demand5 Reasons Driving Warehouse Management Systems Demand
5 Reasons Driving Warehouse Management Systems Demand
 
Designing for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web ServicesDesigning for Privacy in Amazon Web Services
Designing for Privacy in Amazon Web Services
 
Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)
Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)
Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)
 
10 Essential Software Testing Tools You Need to Know About.pdf
10 Essential Software Testing Tools You Need to Know About.pdf10 Essential Software Testing Tools You Need to Know About.pdf
10 Essential Software Testing Tools You Need to Know About.pdf
 
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
 
How to install and activate eGrabber JobGrabber
How to install and activate eGrabber JobGrabberHow to install and activate eGrabber JobGrabber
How to install and activate eGrabber JobGrabber
 
A Comprehensive Appium Guide for Hybrid App Automation Testing.pdf
A Comprehensive Appium Guide for Hybrid App Automation Testing.pdfA Comprehensive Appium Guide for Hybrid App Automation Testing.pdf
A Comprehensive Appium Guide for Hybrid App Automation Testing.pdf
 

IBM Spectrum Scale Object Metadata Search - An Overview

  • 1. © 2016 IBM Corporation IBM Spectrum Scale Object Metadata Search An Overview 1
  • 2. © 2016 IBM Corporation Agenda  What is Object Metadata?  What is Metadata Search?  Use Cases  Implementation Details  Availability 2
  • 3. © 2016 IBM Corporation What is Metadata?  User-defined metadata  Unique feature of object storage compared to other storage systems  Swift and S3 metadata are compatible through Swift3 middleware  Metadata is the structured data about the unstructured object  Who, what, when, where and why of account, container, object  Perfect for indexing and searching 3
  • 4. © 2016 IBM Corporation Metadata Examples Age Biomarkers Developmental Stage Cell Surface Markers Cell Type/Cell Line Disease State Extract Molecule Genetic Characteristics Immunoprecipitation Antibody Organism Platform Sex Strain Time Point Tissue Type Treatment Compound Biomedical Astronomy & Astrophysics Geospatial Image Music 4
  • 5. © 2016 IBM Corporation What Swift Metadata Exists and How do I use it?  User Metadata can be added/removed to Accounts/Containers/Objects  E.g., X-Container-Meta-{name}, X-Remove-Container-Meta-{name}  System metadata also exists, some can even be set by the user  E.g., Content-Type, Last-Modified  Semantics  PUT and POST Metadata Semantics • Account/Container – New user metadata added to existing list of metadata • Object – New user metadata overwrites all existing user metadata  COPY retains existing metadata unless new metadata is specified  HEAD returns metadata only 5
  • 6. © 2016 IBM Corporation What is Metadata Search?  Automatically index and catalog Swift user and system metadata  Provide REST-API for searching for objects based on their metadata 6
  • 7. © 2016 IBM Corporation Why is Metadata Search Valuable?  Imagine internet without Google  Swiftly find needles in the OpenStack  Help users and administrators perform Data Analytics  Metadata can be on highest tier (SSD) while data resides on lower tier (Disk/Tape) General Use Cases  Data Mining  Data Warehousing  Selective data retrieval, data backup, data archival, data migration  Management/Reporting 7
  • 8. © 2016 IBM Corporation Sample Use-Cases Advanced Photo Album City: Rome Time: Day photo1.jpg City: Rome Time: Night photo2.jpg City: Haifa Time: Day photo3.jpg photo4.jpg City: Tokyo Time: Night GET /MyPhotoSpace?query=city=‘Rome’ AND Time=’Day’ GET /MyPhotoSpace?query=time=‘Night’ * Schematic, not complete syntax 8
  • 9. © 2016 IBM Corporation Metadata Search – Media use case Search Query GET /MyPhotoSpace?query=tags ~ 'John' AND date > 2/12/2012 AND date < 3/12/2013 AND num_views > 10000 What we searched for?  Date range search  Free Text matching  Integer comparison 9 * Schematic, not complete syntax
  • 10. © 2016 IBM Corporation 10 What happens behind the screens? Storage system input data path Indexer Queue Index/ SearchIndex DB
  • 11. © 2016 IBM Corporation 11 Indexing objects' Metadata Swift Proxy Pipeline MD Indexer Middleware RabbitMQ Index/ SearchElastic- search
  • 12. © 2016 IBM Corporation 12 Swift Proxy Pipeline MD Search Middleware Index/ Search Elastic- Search DB Serving Search Requests
  • 13. © 2016 IBM Corporation Spectrum Scale Object Store Spectrum Scale Object Store w/ Metadata Search 13
  • 14. © 2016 IBM Corporation Availability Available via - IBM Spectrum Scale Metadata Search Open Beta (link), that contains: Roll-your-own solution  White Paper to be released describing how to setup and configure  A source tarball with an easy install tool Also available at: IBM Spectrum Scale Beta website (link) IBM Confidential 14