SlideShare a Scribd company logo
1 of 27
Download to read offline
Decoding Digital Audio
Visualizing and Annotating
Linear Time-Based Media
Phil Desenne
Center	
  for	
  Hellenic	
  Studies,	
  	
  
Harvard	
  University
May 8, 2015
To decode or interpret audio is to
explain the meaning
or
understanding
of something about it
Relevant for Research, Teaching and Learning
across all disciplines
Decode –> Interpret
Curiosity - Discovery - Interpretation - Research
Amateurs - Learners - Educators - Scholars
Each one decodes audio in their own realm
the process transcends realms and roles
}
}
Songs
Music
Voice recordings
Field recordings
Lyrics / Transcription / Translation
Notes / Musicology / Ethnomusicology
Oral History / Languages / Speech Therapy
Bioacoustics Research / Anthropology
Audio Visualize and Annotate
{}
Listen Decode
In our day-to-day we are constantly, filtering, decoding
and attaching meaning in our brain to daily sound
bites that hit our ears
!
digital audio opens a broader spectrum of
decoding possibilities for
research, teaching and learning
Visualization and Annotation of Audio
Lorem ipsum dolor sit amet,
consectetur adipiscing elit.
Nam in augue sodales nisl
pharetra efficitur in in felis. Ut
malesuada justo nec libero
finibus placerat. Donec vitae
enim risus. Nunc eget purus
eget nunc bibendum tempus
hendrerit vel eros. Praesent
mollis diam augue, vel
convallis quam interdum eu.
Transcript
we now have digital tools that facilitate and
enhance the process of decoding, attaching
meaning and understanding
Audio Annotation & Visualizations in
Research
Acoustics / Bioacoustics / Hydroacoustics
Anthropology / Musicology / Ethnomusicology
Speech Therapy
Education / Teaching
Importance of Visualizing Sound:
Human ear does not hear or discern all sounds
www.cochlea.org739 × 252Search by image
Frequency hearing range in man and some common animal
very high-
frequency
sounds
very low-
frequency
sounds
}
visualization –> accessibility
Recording Natural Sounds
www.leaps.ms900 × 396Search by image
You can also use Raven Lite to slow down natural recordings so that the full complexity of a song may be heard. Listen to the complexity in the ending trill ...
Bioacoustics:
detection and interpretation of sounds in animals
Annotating the visual wave form of audio:
Amplitude and Frequency
BIRD SONGS AND CALLS WITH SPECTROGRAMS ( SONOGRAMS ) OF SOUTHERN ...
www.birdsongs.it489 × 396Search by image
Fig . 2 shows a 20 second fragment over 1-minute song sequence of a Cirl Bunting (Emberiza cirlus). Fig. 3
blows up one phrase of the same song and shows ...
Sonograms and Spectrograms of Bird Songs
The Mind's Machine - Chapter 15 A Step Further
www.mindsmachine.com800 × 619Search by image
(a) These sonograms show the typical adult song patterns of two sparrow
species. The songs illustrated in part (b) were produced by males reared ...
Species identification and animal behavior
WarblerWatch: Warbler Guy, where do I learn about "reading ...
warblerwatch.blogspot.com650 × 578Search by image
Ergo, you'll quickly have no problems identifying a song sparrow classic
song via its sonogram in comparison to a common yellowthroat's, and
so on.
Teaching and Learning with audio annotations
Examples at Harvard
Prof. Tom Kelly, First Nights course, Harvard College
Music Courses: Annotated Interactive Play-throughs
Learners explore music
Foreign Culture Courses: Listening Guides
Prof. Richard Wolfe, Foreign Cultures Course, Harvard College
Foreign Culture Courses: Listening Guides
Prof. Richard Wolfe, Foreign Cultures Course, Harvard College
Table text
XML text
Listening guide player
Harvard iSites CAT tool
Prof. Richard Wolfe, Harvard College
Research Teachingdrives
Research Teachingdrives
Prof. Richard Wolfe
Ethnomusicology research in
South, Central and West Asia
Courses in Ethnomusicology at
Harvard College
Audio
annotations
richardkwolf.com www.music.fas.harvard.
edu/faculty/rwolf.html
Repositories
Media and Annotation Metadata
How does it all connect ?
Annotation
Meta layer
Visualization layer
Audio layer (URI)
Annotation
DB
Media
Repositories
ideally under same repository entities
Media Search
Open Annotation
Data Model
Client tool !
Interoperability: Tying it all together
Persistent Annotation Meta-layer	

Open API Access
Stable Digital Repositories	

URNs resolving to URLs
Ephemeral Tools / Content / Learning 	

Management Systems
Open Annotation Model
Public
Archives
Other	

Institutions
Archives
Open
Source
Museums
HUL	

DRS
Research
Database
Catalyst,	

LibraryLab	

HILT	

...
Course
& Student
Content
Peer
Researchers
Personal
Research &
Archives
Subject
Experts
Incubator
Projects
Persistent	

Annotation	

Repositories
Individual 	

Repositories
External 	

Repositories
Internal Repositories
Ann
An
Open Annotation Federated Systems	

across all media
Future of audio annotation
• Searching: faceted, specific range target searches
• Semantic tagging: machine learning
• Automatic Annotations:
• transcription / translation
• acoustic detection,
• individual voice recognition
• bioacoustics species id
• AI -> Pairing crowdsourced data and automatic
annotation using semantic annotated data (OA)
• High definition audio and detailed audio analysis
• Collaboration and crowdsourcing tools
• Cross-referencing media annotations
ThankYou !	

Questions?	

!
desenne[ at ]fas[ dot ]harvard[ dot ]edu

More Related Content

What's hot

What's hot (11)

Sound recording glossary
Sound recording glossarySound recording glossary
Sound recording glossary
 
Draft
Draft Draft
Draft
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
 
Sound Recording Glossary
Sound Recording GlossarySound Recording Glossary
Sound Recording Glossary
 
Audio
AudioAudio
Audio
 
Audio media
Audio mediaAudio media
Audio media
 
Audio media resources
Audio media resourcesAudio media resources
Audio media resources
 
Can You Hear Me Now 2 25 09 Slideshare
Can You Hear Me Now 2 25 09 SlideshareCan You Hear Me Now 2 25 09 Slideshare
Can You Hear Me Now 2 25 09 Slideshare
 
Ig2 task 1 re edit version
Ig2 task 1 re edit versionIg2 task 1 re edit version
Ig2 task 1 re edit version
 
Victoria
VictoriaVictoria
Victoria
 
Chap69
Chap69Chap69
Chap69
 

Viewers also liked

Efficient spread spectrum communication without pre shared secrets
Efficient spread spectrum communication without pre shared secretsEfficient spread spectrum communication without pre shared secrets
Efficient spread spectrum communication without pre shared secretsambitlick
 
Sy tech rios ai mobile command mobile command with workstations
Sy tech rios ai mobile command   mobile command with workstationsSy tech rios ai mobile command   mobile command with workstations
Sy tech rios ai mobile command mobile command with workstationsSyTech Corporation
 
[DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one!
[DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one![DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one!
[DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one!Synack
 
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...Synack
 

Viewers also liked (7)

Efficient spread spectrum communication without pre shared secrets
Efficient spread spectrum communication without pre shared secretsEfficient spread spectrum communication without pre shared secrets
Efficient spread spectrum communication without pre shared secrets
 
Sy tech rios ai mobile command mobile command with workstations
Sy tech rios ai mobile command   mobile command with workstationsSy tech rios ai mobile command   mobile command with workstations
Sy tech rios ai mobile command mobile command with workstations
 
Jamming in Wireless Sensor Networks
Jamming in Wireless Sensor NetworksJamming in Wireless Sensor Networks
Jamming in Wireless Sensor Networks
 
Unit 4 bandwidth utilization
Unit 4 bandwidth utilizationUnit 4 bandwidth utilization
Unit 4 bandwidth utilization
 
[DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one!
[DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one![DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one!
[DefCon 2016] I got 99 Problems, but 
Little Snitch ain’t one!
 
Cdma by svr
Cdma by svrCdma by svr
Cdma by svr
 
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
DEF CON 23: Spread Spectrum Satcom Hacking: Attacking The GlobalStar Simplex ...
 

Similar to Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015

Human Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A ReviewHuman Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A ReviewEditor IJCATR
 
SodaBottles-licensing Copyright-Fix.pdf
SodaBottles-licensing Copyright-Fix.pdfSodaBottles-licensing Copyright-Fix.pdf
SodaBottles-licensing Copyright-Fix.pdfNga Trinh
 
The Importance Of Enjoying Hi-Res Audio
The Importance Of Enjoying Hi-Res AudioThe Importance Of Enjoying Hi-Res Audio
The Importance Of Enjoying Hi-Res AudioKendra Cote
 
Speech signal processing lizy
Speech signal processing lizySpeech signal processing lizy
Speech signal processing lizyLizy Abraham
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processingazhagujaisudhan
 
A.primate communication
A.primate communicationA.primate communication
A.primate communicationRoya Shariati
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language ProcessingVikalp Mahendra
 
Difference Between Alphabet And International Phonetic Theory
Difference Between Alphabet And International Phonetic TheoryDifference Between Alphabet And International Phonetic Theory
Difference Between Alphabet And International Phonetic TheorySandy Harwell
 
Conquer the Code
Conquer the CodeConquer the Code
Conquer the CodeUSAteacher
 
Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...eSAT Journals
 
DFH_UFA_Workshop2015_program
DFH_UFA_Workshop2015_programDFH_UFA_Workshop2015_program
DFH_UFA_Workshop2015_programL Roumazeilles
 
Class 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phx
Class 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phxClass 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phx
Class 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phxLisa Lavoie
 
Stop Looking and Start Listening
Stop Looking and Start ListeningStop Looking and Start Listening
Stop Looking and Start ListeningBecky Stewart
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualizationicchp2012
 
Engage Me! Summer 2014 Multisensory Literacy Strategies
Engage Me! Summer 2014  Multisensory Literacy StrategiesEngage Me! Summer 2014  Multisensory Literacy Strategies
Engage Me! Summer 2014 Multisensory Literacy StrategiesLisa Shaw
 
Russianmusicgenre
RussianmusicgenreRussianmusicgenre
Russianmusicgenrepengel1
 

Similar to Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015 (20)

Human Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A ReviewHuman Perception and Recognition of Musical Instruments: A Review
Human Perception and Recognition of Musical Instruments: A Review
 
SodaBottles-licensing Copyright-Fix.pdf
SodaBottles-licensing Copyright-Fix.pdfSodaBottles-licensing Copyright-Fix.pdf
SodaBottles-licensing Copyright-Fix.pdf
 
The Importance Of Enjoying Hi-Res Audio
The Importance Of Enjoying Hi-Res AudioThe Importance Of Enjoying Hi-Res Audio
The Importance Of Enjoying Hi-Res Audio
 
Speech signal processing lizy
Speech signal processing lizySpeech signal processing lizy
Speech signal processing lizy
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processing
 
A.primate communication
A.primate communicationA.primate communication
A.primate communication
 
Phonics
PhonicsPhonics
Phonics
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
 
Difference Between Alphabet And International Phonetic Theory
Difference Between Alphabet And International Phonetic TheoryDifference Between Alphabet And International Phonetic Theory
Difference Between Alphabet And International Phonetic Theory
 
Conquer the Code
Conquer the CodeConquer the Code
Conquer the Code
 
B110512
B110512B110512
B110512
 
Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...Audio descriptive analysis of singer and musical instrument identification in...
Audio descriptive analysis of singer and musical instrument identification in...
 
DFH_UFA_Workshop2015_program
DFH_UFA_Workshop2015_programDFH_UFA_Workshop2015_program
DFH_UFA_Workshop2015_program
 
Class 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phx
Class 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phxClass 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phx
Class 06 emerson_phonetics_fall2014_intro_to_linguistics_clinical_phx
 
T0 numtq0nzq=
T0 numtq0nzq=T0 numtq0nzq=
T0 numtq0nzq=
 
L2 Thinking
L2 ThinkingL2 Thinking
L2 Thinking
 
Stop Looking and Start Listening
Stop Looking and Start ListeningStop Looking and Start Listening
Stop Looking and Start Listening
 
Creating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music VisualizationCreating an Entertaining and Informative Music Visualization
Creating an Entertaining and Informative Music Visualization
 
Engage Me! Summer 2014 Multisensory Literacy Strategies
Engage Me! Summer 2014  Multisensory Literacy StrategiesEngage Me! Summer 2014  Multisensory Literacy Strategies
Engage Me! Summer 2014 Multisensory Literacy Strategies
 
Russianmusicgenre
RussianmusicgenreRussianmusicgenre
Russianmusicgenre
 

Recently uploaded

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Recently uploaded (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Decoding Digital Audio: Visualizing and Annotating Linear Time-Based Media 2015

  • 1. Decoding Digital Audio Visualizing and Annotating Linear Time-Based Media Phil Desenne Center  for  Hellenic  Studies,     Harvard  University May 8, 2015
  • 2. To decode or interpret audio is to explain the meaning or understanding of something about it Relevant for Research, Teaching and Learning across all disciplines Decode –> Interpret
  • 3. Curiosity - Discovery - Interpretation - Research Amateurs - Learners - Educators - Scholars Each one decodes audio in their own realm the process transcends realms and roles } }
  • 4. Songs Music Voice recordings Field recordings Lyrics / Transcription / Translation Notes / Musicology / Ethnomusicology Oral History / Languages / Speech Therapy Bioacoustics Research / Anthropology Audio Visualize and Annotate {} Listen Decode
  • 5. In our day-to-day we are constantly, filtering, decoding and attaching meaning in our brain to daily sound bites that hit our ears ! digital audio opens a broader spectrum of decoding possibilities for research, teaching and learning
  • 6. Visualization and Annotation of Audio Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nam in augue sodales nisl pharetra efficitur in in felis. Ut malesuada justo nec libero finibus placerat. Donec vitae enim risus. Nunc eget purus eget nunc bibendum tempus hendrerit vel eros. Praesent mollis diam augue, vel convallis quam interdum eu. Transcript we now have digital tools that facilitate and enhance the process of decoding, attaching meaning and understanding
  • 7. Audio Annotation & Visualizations in Research Acoustics / Bioacoustics / Hydroacoustics Anthropology / Musicology / Ethnomusicology Speech Therapy Education / Teaching
  • 8. Importance of Visualizing Sound: Human ear does not hear or discern all sounds www.cochlea.org739 × 252Search by image Frequency hearing range in man and some common animal very high- frequency sounds very low- frequency sounds } visualization –> accessibility
  • 9. Recording Natural Sounds www.leaps.ms900 × 396Search by image You can also use Raven Lite to slow down natural recordings so that the full complexity of a song may be heard. Listen to the complexity in the ending trill ... Bioacoustics: detection and interpretation of sounds in animals Annotating the visual wave form of audio: Amplitude and Frequency
  • 10. BIRD SONGS AND CALLS WITH SPECTROGRAMS ( SONOGRAMS ) OF SOUTHERN ... www.birdsongs.it489 × 396Search by image Fig . 2 shows a 20 second fragment over 1-minute song sequence of a Cirl Bunting (Emberiza cirlus). Fig. 3 blows up one phrase of the same song and shows ... Sonograms and Spectrograms of Bird Songs
  • 11. The Mind's Machine - Chapter 15 A Step Further www.mindsmachine.com800 × 619Search by image (a) These sonograms show the typical adult song patterns of two sparrow species. The songs illustrated in part (b) were produced by males reared ... Species identification and animal behavior
  • 12. WarblerWatch: Warbler Guy, where do I learn about "reading ... warblerwatch.blogspot.com650 × 578Search by image Ergo, you'll quickly have no problems identifying a song sparrow classic song via its sonogram in comparison to a common yellowthroat's, and so on.
  • 13. Teaching and Learning with audio annotations Examples at Harvard
  • 14. Prof. Tom Kelly, First Nights course, Harvard College Music Courses: Annotated Interactive Play-throughs Learners explore music
  • 15. Foreign Culture Courses: Listening Guides Prof. Richard Wolfe, Foreign Cultures Course, Harvard College
  • 16. Foreign Culture Courses: Listening Guides Prof. Richard Wolfe, Foreign Cultures Course, Harvard College
  • 18. Harvard iSites CAT tool Prof. Richard Wolfe, Harvard College
  • 20. Research Teachingdrives Prof. Richard Wolfe Ethnomusicology research in South, Central and West Asia Courses in Ethnomusicology at Harvard College Audio annotations richardkwolf.com www.music.fas.harvard. edu/faculty/rwolf.html
  • 21. Repositories Media and Annotation Metadata How does it all connect ?
  • 22. Annotation Meta layer Visualization layer Audio layer (URI) Annotation DB Media Repositories ideally under same repository entities Media Search Open Annotation Data Model Client tool !
  • 24. Persistent Annotation Meta-layer Open API Access Stable Digital Repositories URNs resolving to URLs Ephemeral Tools / Content / Learning Management Systems Open Annotation Model
  • 26. Future of audio annotation • Searching: faceted, specific range target searches • Semantic tagging: machine learning • Automatic Annotations: • transcription / translation • acoustic detection, • individual voice recognition • bioacoustics species id • AI -> Pairing crowdsourced data and automatic annotation using semantic annotated data (OA) • High definition audio and detailed audio analysis • Collaboration and crowdsourcing tools • Cross-referencing media annotations
  • 27. ThankYou ! Questions? ! desenne[ at ]fas[ dot ]harvard[ dot ]edu