SlideShare a Scribd company logo
1 of 22
Download to read offline
Introduction of my research history:
From instrument recognition
to support of amateurs' music
creation
Tetsuro Kitahara
Nihon University, Japan
kitahara@chs.nihon-u.ac.jp
http://www.kthrlab.jp/
twitter: @tetsurokitahara
Introduction of myself
● Name: Tetsuro Kitahara
● Living in: Tokyo, Japan
● Age: 37
● Position: Associate professor in Nihon University
● Favorites: Music, Drinking alchol, etc.
My research history
2000
2002
2004
2007
2010
2016
Audio signal processing
Pattern recognition
Content-based MIR
Automatic music generation
Probabilistic modeling
Computer-human interaction
StudentPostDocNow
Research when I was a student
● Instrument recognition for polyphonic music
● Content-based Music Information Retrieval
Instrument recognition
● Instrumentation is an important factor in MIR
● Not many attempts at polyphonic instrument
recognition at that time
● Typical framework:
Note detection -> feature extract. for each note
1:1:000
C4
B3
D4
E4
F4
G4
2:1:000 1:1:000
C4
B3
D4
E4
F4
G4
2:1:000
VN
VN
VN
PF
PF
(VN:Violin, PF: Piano)
For each note…
Pitch: C4
Start: 1:1:000
End: 1:3:000
Piano: 99.0%
Violin: 0.6%
……
A posteriori probabilitiesHarmonic structure
X1 = 0.124
X2 = 0.635
……
Feature vector
Instrument recognition
● Instrogram: Subsymbolic time-freq. representation
of instrument existence
● p(ωi; t, f ) : the probabilty that the sound of instru-
ment ωi with F0 of f exists at time t
Piano
Flute
Time [s]
Formulation of Instrogram
Non-specific IEP Conditional IEP
H
M
M
S E
…
S E
S E
HMM for each semitone
t
f
Instrument existence probability (IEP)
…
PreFEst [Goto, 2000]
(Estimate the weight for
tone model for each semitone)
= w110×
+ w660×
Observed spectrum
Tone model
for 110Hz
Tone model
for 660Hz
p(ωi; t, f ) = p(X; t, f ) p(ωi | X; t, f )
Demo VIdeo
Research when I was a PostDoc
● Chord voicing based on Bayesian network (skip)
● BayesianBand
● OrpheusBB
● CrestMuseXML and CrestMuse Toolkit (skip)
BayesianBand
A jam session system based on mutual prediction
of the user and system
C ? ? ?
Chord progression:
Predict
Determine
in real time
Musically
match
Pleasant
ChordMelody
Concept Melody
Each keystroke
Infer the next chord
Next chord
When changing
the measure
Get the latest result
Generate MIDI data
C
Dm
Em
F
G
Am
Bm(-5)
Likelihood
Chord Chord Chord
Melody
Note
Melody
Note
Melody
Note
Implementation
Demo VIdeo
OrpheusBB
Human-in-the-loop
Initial
input
Generate
music 
 Listen to
 the music
 Edit the
 melody
Regenerate
the backing
 Edit the
 backing
Regenerate  
the melody  
Finish
Model
(Collaboration with Univ. of Tokyo)
Demo
Allows users to edit outputs of the system
Automatically re-generate the remaining part
according to part edited by users
Input lyrics
Edit the
melody
The chord is auto-
matically changed
Research in the current university
● Four-part harmonization using Bayes nets (skip)
● Humming-based composition support (skip)
● Melody editing using melodic outline
● Smart loop sequencer
Melody editing using melodic outline
● Melody is represented as a continous curve
● User can edit the melody by redrawing this curve
Edited part
Edit by
user
Note sequence
Pitch trajectory
Melodic outline
Procedure of melody editing
Pitch trajectory
Fourier transform
Inverse Fourier
transform Save for later use
How to extract melodic outline
Extract low-
order coeffs.
High-order coeffs. of
original melody
Low-order coeffs. of
edited outline
Fourier transform
Inverse Fourier
transform
Do Mi Fa So Ti Ra So Do Re Mi
Hidden Markov model
How to generate a melody from the edited outline
Demo
Smart loop sequencer
● Automatically selects music loops from collection
● Uses the degree of excitement as an input
The degree of
excitement
Tuple of loop IDs for 5 parts at measure n ("0" for no loop)
Estimate the most likely
[s1, ..., sN] for given x
Observed signal
Fomulating with HMM
Simplify the formulation by independently considering
qn, i × s'n, i
Whether a loop is placed
at measure n in part i
If so, which loop is placed
there
P(xn | qn)
The more loops are inserted,
the higher xn is emitted
5 1
the higher deg. of excitement
is annotated in the loop,
the higher xn is emitted
P(xn | s'n)
High deg. of excitement
Drums
Drums
Sequence
Sequence
Low deg. of excitement
How to estimate the degree of excitement for each music loop
Demo
Discussions
● Generate music data
from users’ inputs
● Users’ inputs are
usually imcomplete
● Typically based on
probabilistic models
These works have two aspects:
Automatic music generation Human-computer interaction
● Allow users to input
their intent easily
● More abstract than
specific music data
● Details are hidden
● Tradeoff btwn details
and intuitiveness
Conclusion
● My research
– MIR-related subjects (-2007)
● Musical instrument recognition
● MIR based on instrumentation similarity
– Music generation (2007-)
● BayesianBand, Melodic outline, Smart loop sequencer, …
● 2 aspects in my recent works
– Automatic music generation
– human-computer interaction
● Research plan during this stay
– Improvement of melody generation model, ...

More Related Content

Similar to Introduction of my research histroy: From instrument recognition to support of amateurs' music creation

音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して
音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して
音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指してkthrlab
 
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)kthrlab
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mirYi-Hsuan Yang
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information RetrievalSease
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information RetrievalAndrea Gazzarini
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier
 
machine learning x music
machine learning x musicmachine learning x music
machine learning x musicYi-Hsuan Yang
 
Thesis presentation on Music Information Retrieval
Thesis presentation on Music Information RetrievalThesis presentation on Music Information Retrieval
Thesis presentation on Music Information RetrievalGanesh Harugeri
 
Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Yi-Hsuan Yang
 
Computational models of symphonic music
Computational models of symphonic musicComputational models of symphonic music
Computational models of symphonic musicEmilia Gómez
 
Interval Hashing Based Ranking
Interval Hashing Based RankingInterval Hashing Based Ranking
Interval Hashing Based RankingAndrea Gazzarini
 
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based RankingMusical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based RankingSease
 
Presentation on LMMS
Presentation on LMMSPresentation on LMMS
Presentation on LMMSbhattigurjot
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010ocor203
 
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...multimediaeval
 

Similar to Introduction of my research histroy: From instrument recognition to support of amateurs' music creation (20)

音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して
音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して
音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して
 
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposal
 
machine learning x music
machine learning x musicmachine learning x music
machine learning x music
 
Thesis presentation on Music Information Retrieval
Thesis presentation on Music Information RetrievalThesis presentation on Music Information Retrieval
Thesis presentation on Music Information Retrieval
 
Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017
 
AMT overview
AMT overviewAMT overview
AMT overview
 
Computational models of symphonic music
Computational models of symphonic musicComputational models of symphonic music
Computational models of symphonic music
 
BTP_MIDSEM_RNN.pptx
BTP_MIDSEM_RNN.pptxBTP_MIDSEM_RNN.pptx
BTP_MIDSEM_RNN.pptx
 
Interval Hashing Based Ranking
Interval Hashing Based RankingInterval Hashing Based Ranking
Interval Hashing Based Ranking
 
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based RankingMusical Information Retrieval Take 2: Interval Hashing Based Ranking
Musical Information Retrieval Take 2: Interval Hashing Based Ranking
 
Presentation on LMMS
Presentation on LMMSPresentation on LMMS
Presentation on LMMS
 
Ism2011
Ism2011Ism2011
Ism2011
 
楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索楊奕軒/音樂資料檢索
楊奕軒/音樂資料檢索
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
 
CHAPTER – 5 Audio
CHAPTER – 5     AudioCHAPTER – 5     Audio
CHAPTER – 5 Audio
 
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
 

More from kthrlab

ハイフレックス授業のススメ
ハイフレックス授業のススメハイフレックス授業のススメ
ハイフレックス授業のススメkthrlab
 
Generating Walking Bass Lines with HMM
Generating Walking Bass Lines with HMMGenerating Walking Bass Lines with HMM
Generating Walking Bass Lines with HMMkthrlab
 
JamSketch: Improvisation Support System with GA-based Melody Creation from Us...
JamSketch: Improvisation Support System with GA-based Melody Creation from Us...JamSketch: Improvisation Support System with GA-based Melody Creation from Us...
JamSketch: Improvisation Support System with GA-based Melody Creation from Us...kthrlab
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試みkthrlab
 
卒研発表
卒研発表卒研発表
卒研発表kthrlab
 
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試みGTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試みkthrlab
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試みkthrlab
 
卒論発表_oshita
卒論発表_oshita卒論発表_oshita
卒論発表_oshitakthrlab
 
多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲kthrlab
 
振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システム振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システムkthrlab
 
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)kthrlab
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学kthrlab
 
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)kthrlab
 
音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用kthrlab
 
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?kthrlab
 
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」kthrlab
 
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」kthrlab
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学kthrlab
 
A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...kthrlab
 
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution AnalysisExtracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysiskthrlab
 

More from kthrlab (20)

ハイフレックス授業のススメ
ハイフレックス授業のススメハイフレックス授業のススメ
ハイフレックス授業のススメ
 
Generating Walking Bass Lines with HMM
Generating Walking Bass Lines with HMMGenerating Walking Bass Lines with HMM
Generating Walking Bass Lines with HMM
 
JamSketch: Improvisation Support System with GA-based Melody Creation from Us...
JamSketch: Improvisation Support System with GA-based Melody Creation from Us...JamSketch: Improvisation Support System with GA-based Melody Creation from Us...
JamSketch: Improvisation Support System with GA-based Melody Creation from Us...
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
 
卒研発表
卒研発表卒研発表
卒研発表
 
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試みGTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
 
卒論発表_oshita
卒論発表_oshita卒論発表_oshita
卒論発表_oshita
 
多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲
 
振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システム振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システム
 
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学
 
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
 
音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用
 
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
 
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
 
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学
 
A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...
 
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution AnalysisExtracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
 

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 

Introduction of my research histroy: From instrument recognition to support of amateurs' music creation

  • 1. Introduction of my research history: From instrument recognition to support of amateurs' music creation Tetsuro Kitahara Nihon University, Japan kitahara@chs.nihon-u.ac.jp http://www.kthrlab.jp/ twitter: @tetsurokitahara
  • 2. Introduction of myself ● Name: Tetsuro Kitahara ● Living in: Tokyo, Japan ● Age: 37 ● Position: Associate professor in Nihon University ● Favorites: Music, Drinking alchol, etc.
  • 3. My research history 2000 2002 2004 2007 2010 2016 Audio signal processing Pattern recognition Content-based MIR Automatic music generation Probabilistic modeling Computer-human interaction StudentPostDocNow
  • 4. Research when I was a student ● Instrument recognition for polyphonic music ● Content-based Music Information Retrieval
  • 5. Instrument recognition ● Instrumentation is an important factor in MIR ● Not many attempts at polyphonic instrument recognition at that time ● Typical framework: Note detection -> feature extract. for each note 1:1:000 C4 B3 D4 E4 F4 G4 2:1:000 1:1:000 C4 B3 D4 E4 F4 G4 2:1:000 VN VN VN PF PF (VN:Violin, PF: Piano) For each note… Pitch: C4 Start: 1:1:000 End: 1:3:000 Piano: 99.0% Violin: 0.6% …… A posteriori probabilitiesHarmonic structure X1 = 0.124 X2 = 0.635 …… Feature vector
  • 6. Instrument recognition ● Instrogram: Subsymbolic time-freq. representation of instrument existence ● p(ωi; t, f ) : the probabilty that the sound of instru- ment ωi with F0 of f exists at time t Piano Flute Time [s]
  • 7. Formulation of Instrogram Non-specific IEP Conditional IEP H M M S E … S E S E HMM for each semitone t f Instrument existence probability (IEP) … PreFEst [Goto, 2000] (Estimate the weight for tone model for each semitone) = w110× + w660× Observed spectrum Tone model for 110Hz Tone model for 660Hz p(ωi; t, f ) = p(X; t, f ) p(ωi | X; t, f ) Demo VIdeo
  • 8. Research when I was a PostDoc ● Chord voicing based on Bayesian network (skip) ● BayesianBand ● OrpheusBB ● CrestMuseXML and CrestMuse Toolkit (skip)
  • 9. BayesianBand A jam session system based on mutual prediction of the user and system C ? ? ? Chord progression: Predict Determine in real time Musically match Pleasant ChordMelody Concept Melody Each keystroke Infer the next chord Next chord When changing the measure Get the latest result Generate MIDI data C Dm Em F G Am Bm(-5) Likelihood Chord Chord Chord Melody Note Melody Note Melody Note Implementation Demo VIdeo
  • 10. OrpheusBB Human-in-the-loop Initial input Generate music   Listen to  the music  Edit the  melody Regenerate the backing  Edit the  backing Regenerate   the melody   Finish Model (Collaboration with Univ. of Tokyo) Demo Allows users to edit outputs of the system Automatically re-generate the remaining part according to part edited by users
  • 11. Input lyrics Edit the melody The chord is auto- matically changed
  • 12. Research in the current university ● Four-part harmonization using Bayes nets (skip) ● Humming-based composition support (skip) ● Melody editing using melodic outline ● Smart loop sequencer
  • 13. Melody editing using melodic outline ● Melody is represented as a continous curve ● User can edit the melody by redrawing this curve
  • 14. Edited part Edit by user Note sequence Pitch trajectory Melodic outline Procedure of melody editing
  • 15. Pitch trajectory Fourier transform Inverse Fourier transform Save for later use How to extract melodic outline Extract low- order coeffs.
  • 16. High-order coeffs. of original melody Low-order coeffs. of edited outline Fourier transform Inverse Fourier transform Do Mi Fa So Ti Ra So Do Re Mi Hidden Markov model How to generate a melody from the edited outline Demo
  • 17. Smart loop sequencer ● Automatically selects music loops from collection ● Uses the degree of excitement as an input The degree of excitement
  • 18. Tuple of loop IDs for 5 parts at measure n ("0" for no loop) Estimate the most likely [s1, ..., sN] for given x Observed signal Fomulating with HMM
  • 19. Simplify the formulation by independently considering qn, i × s'n, i Whether a loop is placed at measure n in part i If so, which loop is placed there P(xn | qn) The more loops are inserted, the higher xn is emitted 5 1 the higher deg. of excitement is annotated in the loop, the higher xn is emitted P(xn | s'n)
  • 20. High deg. of excitement Drums Drums Sequence Sequence Low deg. of excitement How to estimate the degree of excitement for each music loop Demo
  • 21. Discussions ● Generate music data from users’ inputs ● Users’ inputs are usually imcomplete ● Typically based on probabilistic models These works have two aspects: Automatic music generation Human-computer interaction ● Allow users to input their intent easily ● More abstract than specific music data ● Details are hidden ● Tradeoff btwn details and intuitiveness
  • 22. Conclusion ● My research – MIR-related subjects (-2007) ● Musical instrument recognition ● MIR based on instrumentation similarity – Music generation (2007-) ● BayesianBand, Melodic outline, Smart loop sequencer, … ● 2 aspects in my recent works – Automatic music generation – human-computer interaction ● Research plan during this stay – Improvement of melody generation model, ...