The Secret Life of a DataPhile
Evan Stein
Mark Wainwright
Georgiana Bogdan
Decibel Music Systems
PART 1
A Life in Search
Evan Stein
Decibel Music Systems
The Talk
 Evan Stein: Introduction – Reasons for Decibel as a product
 Mark Wainwright: Technical issues with music meta...
What is Decibel?
 Fact-based metadata system
 Social / buying-based recommendations (e.g., Amazon)
 Sound / mood-based ...
Decibel at work
A bit of history
 Musician
 Library of Congress
 Studies in Musicology
 Switch to computers, thanks to Fernando Pessoa...
Library work
 Classification
 Retrieval (by classification)
 LC / Dewey Decimal, metadata
 Knowledge of the domain is ...
Musicology
 Catalogues and classifications
 Works
 Instruments
 Eras, genres, styles
 Biography
 Ways of thinking ab...
Law enforcement
 Data for hypothesis-formation
 Unknown start and end
 Non-linear search
 Multiple languages, phonetic...
Finance
 Normalisation
 Language
 Workflow
 Currency
 Formulas
Why the British Library?
 Fact based systems are good for research
 You don’t know what the user wants to know until the...
Digitisation
 Information extracted from artefact (record, book, video, etc.)
 Cons
 Possible lack of context and backg...
PART 2
Asking a Lot
Mark Wainwright
Decibel Music Systems
Relational
Database
Graph
Database
Graph Database Features
 Polymorphism More detail without affecting performance
 Recursive Relationships Results are mor...
Album Artist Track
Album has one disc
Album is disc
Track
Sung
By a man
Married To
A woman
Whose Song
Is Performed
Relational Database: Organised by Type
Graph Database: Organised by Association
Graph Database
 Polymorphism More detail without affecting performance
 Recursive Relationships Results are more complet...
PART 3
Metadata Collection and Processing:
A Data Detective’s Investigation
Georgiana Bogdan
Decibel Music Systems
Why do we collect metadata?
KEEP
CALM
AND
JOIN THE
DIGITAL
MUSIC
REVOLUTION!
Source: IFPI Digital Music Report 2014
Why do we collect metadata?
Because few things matter more.
It is crucial for:
 Artists
 Music Listeners
 Music Provide...
Who do we collect metadata for?
Music streaming services
Copyright Collection Society
App Developers
Online radio services...
What metadata do we collect?
 Comprehensive data model
 Graph database for representing and storing data; API for delive...
What metadata do we collect?
Artist
Place
And
Dates of
Birth (and
Death)
Artist
Biogra
phy
Nationa
lity
Relation
ships
Alb...
How do we collect metadata?
 Online legal sources (and the magic of computer programming!)
 Research Team + Editorial Te...
How do we keep our metadata evolving?
 Keeping an eye on emerging music markets – data & content in local languages
 Bei...
http://www.decibel.net
@decibelnet
Evan Stein
evan.stein@decibel.net
Mark Wainwright
mark.wainwright@decibel.net
Georgiana...
The secret life of a dataphile - Keeping Tracks
The secret life of a dataphile - Keeping Tracks
The secret life of a dataphile - Keeping Tracks
The secret life of a dataphile - Keeping Tracks
The secret life of a dataphile - Keeping Tracks
Upcoming SlideShare
Loading in …5
×

The secret life of a dataphile - Keeping Tracks

2,828 views

Published on

A talk in three parts - metadata from a market, data and technical perspective.

Speakers : Decibel Music Systems

As part of the British Library's Keeping Tracks symposium on music and archives in the digital age.

21.03.14

http://britishlibrary.typepad.co.uk/music/2014/04/keeping-tracks.html

Published in: Technology, Education, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,828
On SlideShare
0
From Embeds
0
Number of Embeds
2,285
Actions
Shares
0
Downloads
3
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

The secret life of a dataphile - Keeping Tracks

  1. 1. The Secret Life of a DataPhile Evan Stein Mark Wainwright Georgiana Bogdan Decibel Music Systems
  2. 2. PART 1 A Life in Search Evan Stein Decibel Music Systems
  3. 3. The Talk  Evan Stein: Introduction – Reasons for Decibel as a product  Mark Wainwright: Technical issues with music metadata  Georgiana Bogdan: Metadata collection and processing
  4. 4. What is Decibel?  Fact-based metadata system  Social / buying-based recommendations (e.g., Amazon)  Sound / mood-based recommendations (e.g., EchoNest)  Fact-based navigation (e.g., MusicBrainz, Gracenote)  Data and search provided through an API  White-label services for customers’ products  Navigate collections through linked information  Furnish information, sleeve-note equivalent and file tagging  Repertoire, artist and recording normalisation  Insane level of detail
  5. 5. Decibel at work
  6. 6. A bit of history  Musician  Library of Congress  Studies in Musicology  Switch to computers, thanks to Fernando Pessoa  Manhattan DA  Standard & Poor’s  Decibel
  7. 7. Library work  Classification  Retrieval (by classification)  LC / Dewey Decimal, metadata  Knowledge of the domain is a key to good work  There are human databases walking about
  8. 8. Musicology  Catalogues and classifications  Works  Instruments  Eras, genres, styles  Biography  Ways of thinking about music  Repertoire, theory, performance practice  Sociology, anthropology, psychology, linguistics  Correlation with other art forms  Performers
  9. 9. Law enforcement  Data for hypothesis-formation  Unknown start and end  Non-linear search  Multiple languages, phonetics, semantics  Linkage
  10. 10. Finance  Normalisation  Language  Workflow  Currency  Formulas
  11. 11. Why the British Library?  Fact based systems are good for research  You don’t know what the user wants to know until they want to know it  Data-based thinking allows you to follow your train of thought  Good for navigating collections  Improvements in bandwidth and storage  Personal collections are getting larger  Stores and services are also collections  Library collections are being digitised, and physically smaller
  12. 12. Digitisation  Information extracted from artefact (record, book, video, etc.)  Cons  Possible lack of context and background  Ignores the artefact  Pros  “Good enough” for most uses  Can be consumed anywhere
  13. 13. PART 2 Asking a Lot Mark Wainwright Decibel Music Systems
  14. 14. Relational Database Graph Database
  15. 15. Graph Database Features  Polymorphism More detail without affecting performance  Recursive Relationships Results are more complete  Associative Structure More interesting questions
  16. 16. Album Artist Track
  17. 17. Album has one disc Album is disc
  18. 18. Track Sung By a man Married To A woman Whose Song Is Performed
  19. 19. Relational Database: Organised by Type Graph Database: Organised by Association
  20. 20. Graph Database  Polymorphism More detail without affecting performance  Recursive Relationships Results are more complete  Associative Structure More interesting questions
  21. 21. PART 3 Metadata Collection and Processing: A Data Detective’s Investigation Georgiana Bogdan Decibel Music Systems
  22. 22. Why do we collect metadata? KEEP CALM AND JOIN THE DIGITAL MUSIC REVOLUTION! Source: IFPI Digital Music Report 2014
  23. 23. Why do we collect metadata? Because few things matter more. It is crucial for:  Artists  Music Listeners  Music Providers  Copyright Holders  Music Libraries & Archives
  24. 24. Who do we collect metadata for? Music streaming services Copyright Collection Society App Developers Online radio services Record Labels Other music industry players Digital Music Stores Music distributors
  25. 25. What metadata do we collect?  Comprehensive data model  Graph database for representing and storing data; API for delivering it  Rich data fields; mix of internet sources, research and editorial content
  26. 26. What metadata do we collect? Artist Place And Dates of Birth (and Death) Artist Biogra phy Nationa lity Relation ships Album Release label Track Count Duration Album Contrib utionsRelease Date and Region Genre Release © and ℗ Cover Art Track Genre Mixing Venue and Date Publisher Writer Track ℗ details Participa nts/Artist sMastering Venue and Date Recording Venue and Date Number Performa nce Type Duration ISRC
  27. 27. How do we collect metadata?  Online legal sources (and the magic of computer programming!)  Research Team + Editorial Team. The Right People!  Data Partnerships
  28. 28. How do we keep our metadata evolving?  Keeping an eye on emerging music markets – data & content in local languages  Being aware of the music ecosystem; connecting with the industry players  Directly engaging with music industry professionals. Being social and sociable!
  29. 29. http://www.decibel.net @decibelnet Evan Stein evan.stein@decibel.net Mark Wainwright mark.wainwright@decibel.net Georgiana Bogdan georgiana.bogdan@decibel.net How to contact us?

×