SlideShare a Scribd company logo
1 of 18
Download to read offline
Large-Scale Capture of Producer-Defined Musical
Semantics
Ryan Stables
School of Digital Media Technology
Birmingham City University
Problem...
Problem Definition
Producer:
Audio effects parameters
usually refer to low-level
attributes.
Professionally produced audio
often requires extensive
training.
Researcher:
Lack of semantically annotated
music production datasets.
How can we map low-level
descriptors to perceived
muscial timbre?
Problem Definition
Descriptors need to represent
the views of music producers.
These may change with genre,
musical instruments, etc...
Various terms may be used to
define similar things (colour,
texture etc...)
Project Aims
1. Gather large amounts of semantics data during the music
creation/production process.
Develop a series of DAW plug-ins.
Extract information and anonymously upload it to a server.
2. Identify correlation and patterns in the semantics data.
3. Use the data to improve/aid music production tasks.
Model...
Project overview
Server
Descriptor name...
Save...Load...
Save...
Semantic Descriptor
Parameter Space
Feature Set
Pre/Post Gain
Analysis...
Natural Language
Processing
Dimensionality
Reduction
Etc...
(1)
(2)(3)
Figure : Schematic Overview of the Semantic Audio Feature Extraction Project.
(1) Plug-in interface
Parameters can be set
experimentally.
Semantic descriptors to be
stored in text field.
Descriptors can be loaded
through same interface.
Parameters are stored and/or
set.
Figure : Semantic Audio plug-in: Multi-band distortion
(2) Feature Extraction
Features are extracted from the
selected region.
The parameter space is stored.
Semantic descriptors are sent
as targets.
Additional metadata is sent, if
available.
Server
Descriptor name...
Save...Load...
Save...
Semantic Descriptor
Parameter Space
Feature Set
Pre/Post Gain
Analysis...
Natural Language
Processing
Dimensionality
Reduction
Etc...
Figure : Stored attributes.
(3) Mapping
NLP Algorithms to identify
semantic correlation.
Dimensionality reduction to
find correlation in
features/parameters.
Additional data partitions
based on metadata (Genre,
instrument, etc...)
Results sent back to user
plug-in.
Server
Descriptor name...
Save...Load...
Sav
Semantic
Paramete
Feature Se
Pre/
Analysis...
Natural Language
Processing
Dimensionality
Reduction
Etc...
Figure : Results processing
Design Constraints...
Architecture
Requirements:
Maximisation of user-base.
Transparency: Access to the processing chain.
Design decisions:
Stand-alone plug-ins.
MultiFX.
Plug-ins within a plug-in.
Analysis-only.
Other:
Free field vs. fixed word.
Before and after.
Metadata pane.
Analysis framework
LibXtract.
Hard-coded, C library.
Around 400 combined
audio features*.
[Bullock, 2007]
Vamp.
Plug-in within a plug-in.
Hosts LibXtract features,
amongst others.
[Cannam et al., 2006]
Mini-Project...
Mini-Project: Aims
Analyse the production requirements of musicians.
Birmingham Conservatoire
The Music Producers Guild
The Birmingham Music Network
Build a series of prototype systems for the collection of
musical semantics data.
Use these systems to collect data from a small group of
musicians during the production process.
Evaluate the results in order to identify a suitable system for
future research.
Demonstrate the feasibility of a wider research project in this
area.
Mini Project: Schematic
Plug-in
development
Interface design
Algorithm
Development
Server, network,
data distribution
User Testing Data
Aquisition
Results
Analysis
Figure : Schematic Overview of the Mini-Project.
Positions and Timescale
2 x PhD Students: 1 x C4DM (QMUL) & 1 x DMT (BCU).
3 x Advisory roles.
Timescale: 6-months from September 2013.
Future: collaborative grant application.
Thanks!
ryan.stables@bcu.ac.uk
References
Bullock, J. (2007).
Libxtract: A lightweight library for audio feature extraction.
In Proceedings of the International Computer Music
Conference, volume 43.
Cannam, C., Landone, C., Sandler, M. B., and Bello, J. P.
(2006).
The sonic visualiser: A visualisation platform for semantic
descriptors from musical signals.
In ISMIR, pages 324–327.

More Related Content

Similar to Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Semantic Media @ The British Library, 23 September 2013)

Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02
mohankota
 
Ig2 task 1 work sheet 12378
Ig2 task 1 work sheet 12378Ig2 task 1 work sheet 12378
Ig2 task 1 work sheet 12378
CallumDrakeCPFC
 
Ian definitions 3rd try 2
Ian definitions 3rd try 2Ian definitions 3rd try 2
Ian definitions 3rd try 2
thomasmcd6
 
Ian definitions 3rd try 2
Ian definitions 3rd try 2Ian definitions 3rd try 2
Ian definitions 3rd try 2
thomasmcd6
 
IG2 Task 1 Worksheet
IG2 Task 1 WorksheetIG2 Task 1 Worksheet
IG2 Task 1 Worksheet
SamDuxburyGDS
 
U-Boot community analysis
U-Boot community analysisU-Boot community analysis
U-Boot community analysis
xulioc
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
luisfvazquez1
 
Ig2 task 1 work sheet newi9979
Ig2 task 1 work sheet newi9979Ig2 task 1 work sheet newi9979
Ig2 task 1 work sheet newi9979
CallumDrakeCPFC
 

Similar to Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Semantic Media @ The British Library, 23 September 2013) (20)

Mini Project- Audio Enhancement
Mini Project- Audio EnhancementMini Project- Audio Enhancement
Mini Project- Audio Enhancement
 
Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02
 
Mini Project- Audio Enhancement
Mini Project-  Audio EnhancementMini Project-  Audio Enhancement
Mini Project- Audio Enhancement
 
Ig2 task 1 work sheet 12378
Ig2 task 1 work sheet 12378Ig2 task 1 work sheet 12378
Ig2 task 1 work sheet 12378
 
MIDP: Music and Sound
MIDP: Music and SoundMIDP: Music and Sound
MIDP: Music and Sound
 
Ian definitions 3rd try 2
Ian definitions 3rd try 2Ian definitions 3rd try 2
Ian definitions 3rd try 2
 
Multilevel Audio Descriptors @WWW09 develtrack
Multilevel Audio Descriptors @WWW09 develtrackMultilevel Audio Descriptors @WWW09 develtrack
Multilevel Audio Descriptors @WWW09 develtrack
 
Ian definitions 3rd try 2
Ian definitions 3rd try 2Ian definitions 3rd try 2
Ian definitions 3rd try 2
 
1– Introduction To Direct Show
1– Introduction To  Direct Show1– Introduction To  Direct Show
1– Introduction To Direct Show
 
IG2 Task 1 Worksheet
IG2 Task 1 WorksheetIG2 Task 1 Worksheet
IG2 Task 1 Worksheet
 
Automatic Music Generation Using Deep Learning
Automatic Music Generation Using Deep LearningAutomatic Music Generation Using Deep Learning
Automatic Music Generation Using Deep Learning
 
Sound Recording Glossary
Sound Recording GlossarySound Recording Glossary
Sound Recording Glossary
 
TAA eLearn Course - Presentation Week 3
TAA eLearn Course - Presentation Week 3TAA eLearn Course - Presentation Week 3
TAA eLearn Course - Presentation Week 3
 
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
 
U-Boot community analysis
U-Boot community analysisU-Boot community analysis
U-Boot community analysis
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
 
Ig2 task 1 work sheet newi9979
Ig2 task 1 work sheet newi9979Ig2 task 1 work sheet newi9979
Ig2 task 1 work sheet newi9979
 
Sound Recording Glossary Improved Version
Sound Recording Glossary   Improved VersionSound Recording Glossary   Improved Version
Sound Recording Glossary Improved Version
 
Mpeg 7 slides
Mpeg 7 slidesMpeg 7 slides
Mpeg 7 slides
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Semantic Media @ The British Library, 23 September 2013)

  • 1. Large-Scale Capture of Producer-Defined Musical Semantics Ryan Stables School of Digital Media Technology Birmingham City University
  • 3. Problem Definition Producer: Audio effects parameters usually refer to low-level attributes. Professionally produced audio often requires extensive training. Researcher: Lack of semantically annotated music production datasets. How can we map low-level descriptors to perceived muscial timbre?
  • 4. Problem Definition Descriptors need to represent the views of music producers. These may change with genre, musical instruments, etc... Various terms may be used to define similar things (colour, texture etc...)
  • 5. Project Aims 1. Gather large amounts of semantics data during the music creation/production process. Develop a series of DAW plug-ins. Extract information and anonymously upload it to a server. 2. Identify correlation and patterns in the semantics data. 3. Use the data to improve/aid music production tasks.
  • 7. Project overview Server Descriptor name... Save...Load... Save... Semantic Descriptor Parameter Space Feature Set Pre/Post Gain Analysis... Natural Language Processing Dimensionality Reduction Etc... (1) (2)(3) Figure : Schematic Overview of the Semantic Audio Feature Extraction Project.
  • 8. (1) Plug-in interface Parameters can be set experimentally. Semantic descriptors to be stored in text field. Descriptors can be loaded through same interface. Parameters are stored and/or set. Figure : Semantic Audio plug-in: Multi-band distortion
  • 9. (2) Feature Extraction Features are extracted from the selected region. The parameter space is stored. Semantic descriptors are sent as targets. Additional metadata is sent, if available. Server Descriptor name... Save...Load... Save... Semantic Descriptor Parameter Space Feature Set Pre/Post Gain Analysis... Natural Language Processing Dimensionality Reduction Etc... Figure : Stored attributes.
  • 10. (3) Mapping NLP Algorithms to identify semantic correlation. Dimensionality reduction to find correlation in features/parameters. Additional data partitions based on metadata (Genre, instrument, etc...) Results sent back to user plug-in. Server Descriptor name... Save...Load... Sav Semantic Paramete Feature Se Pre/ Analysis... Natural Language Processing Dimensionality Reduction Etc... Figure : Results processing
  • 12. Architecture Requirements: Maximisation of user-base. Transparency: Access to the processing chain. Design decisions: Stand-alone plug-ins. MultiFX. Plug-ins within a plug-in. Analysis-only. Other: Free field vs. fixed word. Before and after. Metadata pane.
  • 13. Analysis framework LibXtract. Hard-coded, C library. Around 400 combined audio features*. [Bullock, 2007] Vamp. Plug-in within a plug-in. Hosts LibXtract features, amongst others. [Cannam et al., 2006]
  • 15. Mini-Project: Aims Analyse the production requirements of musicians. Birmingham Conservatoire The Music Producers Guild The Birmingham Music Network Build a series of prototype systems for the collection of musical semantics data. Use these systems to collect data from a small group of musicians during the production process. Evaluate the results in order to identify a suitable system for future research. Demonstrate the feasibility of a wider research project in this area.
  • 16. Mini Project: Schematic Plug-in development Interface design Algorithm Development Server, network, data distribution User Testing Data Aquisition Results Analysis Figure : Schematic Overview of the Mini-Project.
  • 17. Positions and Timescale 2 x PhD Students: 1 x C4DM (QMUL) & 1 x DMT (BCU). 3 x Advisory roles. Timescale: 6-months from September 2013. Future: collaborative grant application. Thanks! ryan.stables@bcu.ac.uk
  • 18. References Bullock, J. (2007). Libxtract: A lightweight library for audio feature extraction. In Proceedings of the International Computer Music Conference, volume 43. Cannam, C., Landone, C., Sandler, M. B., and Bello, J. P. (2006). The sonic visualiser: A visualisation platform for semantic descriptors from musical signals. In ISMIR, pages 324–327.