SlideShare a Scribd company logo
1 of 34
Download to read offline
音楽の非専門家が創作・演奏を通じて
音楽を楽しめる世界を目指して
北原 鉄朗 (日本大学文理学部)
Twitter: @tetsurokitahara
Feel free to take & distribute photos on Twitter etc!
Towards the World in which Musical Novices Enjoy Music
through Creation and Performance
Tetsuro Kitahara (Nihon University)
Introduction to myself
Introduction to myself
● Name:
きたはら  てつろう
北原 鉄朗 (Tetsuro Kitahara)
● Born in Sapporo, Japan
● Living in Tokyo, Japan
● Age: 39 (born in 1979)
● Position: Associate professor at Nihon University
● Favorites: Music, Drinking alcohol, etc.
● Studied music at Yamaha Junior Music School
My academic history
● 2002: BSc from Tokyo University of Science
– Musical instrument recognition
– (Virtually) supervised by Prof. Okuno and Dr. Goto
● 2007: PhD from Kyoto University
– Musical instument recognition and its appl. to MIR
– Supervised by Prof. Okuno and Dr. Goto
● 2007-2010: PostDoc at Kwansei Gakuin University
– Project head: Prof. Katayose
● 2010-2014: Assistant professor at Nihon University
● 2014-present: Associate professor at Nihon Univ.
– 2016-2017: Visiting reseacher at UPF (about 5 months)
born here
Tokyo Univ. of Sci.
(Chiba branch)
Nihon Univ.
Kyoto Univ.
Universitat Pompeu Fabra
Kwansei Gakuin Univ.
(super country-side)
Kitahara Lab. (Sound Computing Lab.)
Music-related topics
Automatic music generation
HCIs for assisting
non-musicians’ composition
C
G
E
C
Am
A
E
C
Music recommendation
Karaoke enhancement
Guitar transcription
Bass analysis etc.
Other topics
Analysis of cuteness of female voices
Human-agent interaction via sound etc.
Spectrum of topics in music computing
What to support
Appreciation Performance Composition
Whotosupport
ExpertsNon-experts
JamSketch: Improvisation support system
with GA-based melody creation
from user’s drawing
Special thanks to Rafael Ramirez and Sergio Giraldo
(Universitat Pompeu Fabra)
Our goal
Enable non-musicians to enjoy improvisation
Why it’s difficult
Players have to create melodies and
play an instrument at the same time
Technical issues
Human Interface
Non-musician-friendly,
easy-to-input, intuitive
& quickly inputable
Melody generation
Automatic melody generation
based on the user’s abstract,
incomplete input in real time
Our idea: JamSketch
Human Interface
User draws a melodic outline on the pianoroll
Music generation
Once the melodic outline is drawn, a melody is
immediately generated with a genetic algorithm
System overview
time
During the playback of the accompaniment
0th
bar 1st
bar 2nd
bar
Draws a melodic outline at least one bar in advance
Determines the rhythm of the melody
User
If the mouse cursor moves to the next bar
or the mouse button is released,
Determines the pitch of each note
Determines expression parameters
Plays the melody
Determine the rhythm
● High gradient → short notes
● Low gradient → long notes
Basic idea
3
Details of algorithm
0 Candidates
3
33
3 3
33
1 Tentative rhythm
Threshold
the gradient
2 Best-fit rhythm
Search the
closest one
from 0
3
Determine the pitches
Basic idea
closeness to
the outline
Search a sequence of notes satisfying
both
3
C A F
and musical
likeliness
using a genetic algorithm (GA)
Details of formulation
Chromosome N = (n0
, n1
, …, nL-1
) (ni
: note number)
Fitness function
F(N) = w0
sim(N) + w1
seq1
(N) + w2
seq2
(N)
+ w3
harm(N) + w4
ent(N)
Similarity
to outline
Bigram
P(ni
| ni-1
)
Delta bigram
P(ni
-ni-1
| ni-1
-ni-2
)
P(notes | chord) Entropy
Details of fitness function
Similarity to outline
sim(N) = - Σ(ni
- y(ti
))2
Pitch bigram probability
seq1
(N) = Σ log P(ni
| ni-1
)
(y(ti
): pitch at time ti
in outline)
Delta (pitch-motion) bigram probability
seq2
(N) = Σ log P(ni
– ni-1
| ni-1
– ni-2
)
Conditional probability for given chords
harm(N) = Σ log P(ni
| ci
, bi
)
(ci
: chord at time ti
)
(bi
: metric position at ti
,
{head, on-beat, off-beat})Entropy
ent(N) = -(H(N) – Hmean
– ε)2
Determine expression parameters
Use a model presented at [Giraldo & Ramirez 2016]
C
Feature extraction
for each note
k-NN, multilayer perceptron, ...
onset deviation
duration ratio
energy ratio
(Duration, onset, prev. duration,
next duration, prev. interval,
next interval, namour, chord, etc.)
Implementation
Java
Groovy
Octave
MIDI Toolbox
WekaJamSketch
CrestMuse
Toolkit
Apache
Commons Math
Java
Octave
Estimation of
expression parameters
GA
Feature extraction
Main program
Training data
53 melodies with the Tonality-type of Blues from
Weimar Jazz Database
Experiment
Purpose
Confirm the quality of generated melodies
Method
1. Participants play improvisation with our system and
a normal MIDI keyboard
2. The performances with our system and the keyboard
are listed without distinction
3. Other participants evaluate them
If there are no clear differences between evaluations
for our system and the keyboard, our system can be
considered to have human-like composition capability
Participants & performances
Participants
Experience Use
Play Impr. KB Syst.
P1 Yes Yes Yes No
P2 Yes No Yes Yes
P3 Yes No No Yes
P4 No No No Yes
P5 Yes No Yes Yes
P6 Yes Yes Yes No
Performances
?
?
?
?
?
?
?
?
STOP
System
System
KB (P2)
KB (P1)
System
KB (P5)
System
KB (P6)
s2
s1
u2
u1
s4
u3
s3
u4
Listening test
● We asked 12 researchers / students in musical
fields to evaluate them on a scale of 0 to 10
● Criteria
– Overall quality
– Pitch appropriateness
– Rhythmic appropriateness
– Melodic variation
● http://www.kthrlab.jp/proj/jamsketch/listen/
– Blues-likeness
– Skillfulness
– Human-likeness
Please participate in the listening test!
Result: Overall quality
{u4} > {u1, u2} > {s1, s2, s3, s4} ≈ {u3}
Result: Pitch appropriateness
{u1, u2, u4} > {s1, s2, s3, s4} > {u3}
Result: Rhythmic appropriateness
{u2, u4} > {u1, u3} > {s1, s2, s3, s4}
Result: Melodic variation
{u4} >> {s1, s2, s4} ≈ {u1, u2, u3} > {s3} ≈ {u3}
Result: Blues-likeness
{u4} > {u2} > {s1, s2, s3, s4} ≈ {u1} >> {u3}
Result: Skillfulness
{u4} >> {u2} > {s1, s2, s3, s4} ≈ {u1, u3}
Result: Human-likeness
{u4} > {u1, u2} > {u3} >> {s1, s2, s3, s4}
Discussions
● Our system is inferior to experienced players
● But it is superior (or close) to inexperienced players
● Human-likeness is lower than both-class players
● Possible reasons:
– Out-of-scale pitches appear at the ends of phrases
● The system does not consider boundaries of phrases
– Out-of-scale pitches appear at long notes
● Rhythm and pitches are independently determined
– Rhythm is too monotonous
● Rhythm needs more variations
● We obtained positive opinions from participants
On-going next plans
● Improvement of rhythm determination
– Data-driven
● Input with eye tracking
– Focusing on disabled people
● Input with smartphones
– Approach 1: moving them up and down
– Approach 2: capture the hand’s motion with the camera
(collaboration with Shiramatsu Lab., NITech)
● Performance by multiple players
(collaboration with Shiramatsu Lab., NITech)
● Incremental melody generation
General discussions
(just my personal opinions)
What is the state-of-the-art of
automatic music composition?
● Recent systems are capable to create musically
appropriate pieces
● In particular, some systems imitate existing
composers’ features well
– e.g. DeepBach (F. Pachet)
https://www.youtube.com/watch?v=QiBM7-5hA6o
● Systems using deep neural nets (CNN, RNN) are
increasing
● Large-scale DBs are still lacking
Are there better human interfaces for
non-experts to express musical ideas?
● Melodic outlines are non-musician-friendly,
easy-to-input, intuitive, and quickly inputable
● Basic concept: “Macro structure by the user,
micro structure by the system”
● Other possibilities:
– verbal expressions (impression words)
– melodic examples (existing melodies)
● With melodic outlines, users cannot specify
rhythmic features
What is the best relationship
between humans and systems?
● Relation of a producer and a composer?
● Ideally, systems should be ghostwriters?
– Human ghostwriters can compose but cannot
improvise according to another person’s instructions
● Truly original part should be created by humans
● When humans give musically low-quality ideas,
how can systems suggest better ideas?
● What can systems do if the user’s final goal is to
be able to compose music without any support?
Conclusion
Conclusion
● To enable non-experts to create music, systems
need to have
– an intuitive user interface
– a technique for generating music
● JamSketch: an improvisation support system with
– melodic outlines
– GA-based melody generator
https://github.com/kthrlab/JamSketch/tree/jamsketch-0.1.0-lite
Let’s try it! (on Windows or other Java-compatible platforms)
Please participate in listening test!
http://www.kthrlab.jp/proj/jamsketch/listen/

More Related Content

Similar to 音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して

The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...multimediaeval
 
saito22research_talk_at_NUS
saito22research_talk_at_NUSsaito22research_talk_at_NUS
saito22research_talk_at_NUSYuki Saito
 
AI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDAI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDJaideep Ghosh
 
Distinguishing Violinists and Pianists Based on Their Brain Signals
Distinguishing Violinists and Pianists Based on Their Brain SignalsDistinguishing Violinists and Pianists Based on Their Brain Signals
Distinguishing Violinists and Pianists Based on Their Brain SignalsGianpaolo Coro
 
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)kthrlab
 
Audio-Only Augmented Reality System for Social Interaction
Audio-Only Augmented Reality System for Social InteractionAudio-Only Augmented Reality System for Social Interaction
Audio-Only Augmented Reality System for Social InteractionTom Gurion
 
AMIS 2016 2, Music Math Coding
AMIS 2016 2, Music Math CodingAMIS 2016 2, Music Math Coding
AMIS 2016 2, Music Math CodingSamuel Wright
 
Graphical visualization of musical emotions
Graphical visualization of musical emotionsGraphical visualization of musical emotions
Graphical visualization of musical emotionsPranay Prasoon
 
Ict Introduction and investigation
Ict Introduction and investigationIct Introduction and investigation
Ict Introduction and investigationdiena996
 
Lorenzo Porcaro PhD Defense
Lorenzo Porcaro PhD Defense Lorenzo Porcaro PhD Defense
Lorenzo Porcaro PhD Defense Lorenzo Porcaro
 
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...multimediaeval
 
Analyzing DJ Scratching and its Acoustics
Analyzing DJ Scratching and its Acoustics Analyzing DJ Scratching and its Acoustics
Analyzing DJ Scratching and its Acoustics Flyboost
 
IRJET - EMO-MUSIC(Emotion based Music Player)
IRJET - EMO-MUSIC(Emotion based Music Player)IRJET - EMO-MUSIC(Emotion based Music Player)
IRJET - EMO-MUSIC(Emotion based Music Player)IRJET Journal
 
Systems Thinking Lesson Plan for Classical Music
Systems Thinking Lesson Plan for Classical MusicSystems Thinking Lesson Plan for Classical Music
Systems Thinking Lesson Plan for Classical Musicronalynpiano
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mirYi-Hsuan Yang
 
2010 i son barrass-schaffert-v0.2
2010 i son barrass-schaffert-v0.22010 i son barrass-schaffert-v0.2
2010 i son barrass-schaffert-v0.2umwebejo
 
Deep Learning Meetup #5
Deep Learning Meetup #5Deep Learning Meetup #5
Deep Learning Meetup #5Aloïs Gruson
 
The Wonder of Digitizing Artifacts Left Behind by Our Kindred Dead
The Wonder of Digitizing Artifacts Left Behind by Our Kindred DeadThe Wonder of Digitizing Artifacts Left Behind by Our Kindred Dead
The Wonder of Digitizing Artifacts Left Behind by Our Kindred DeadRsMom
 

Similar to 音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して (20)

Emofy
Emofy Emofy
Emofy
 
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
The MediaEval 2017 AcousticBrainz Genre Task: Content-based Music Genre Recog...
 
saito22research_talk_at_NUS
saito22research_talk_at_NUSsaito22research_talk_at_NUS
saito22research_talk_at_NUS
 
AI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUNDAI THROUGH THE EYES OF ORGANISE SOUND
AI THROUGH THE EYES OF ORGANISE SOUND
 
Distinguishing Violinists and Pianists Based on Their Brain Signals
Distinguishing Violinists and Pianists Based on Their Brain SignalsDistinguishing Violinists and Pianists Based on Their Brain Signals
Distinguishing Violinists and Pianists Based on Their Brain Signals
 
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
北原研究室の研究事例紹介:ベーシストの旋律分析とイコライザーの印象分析(Music×Analytics Meetup vol.5 ロングトーク)
 
Audio-Only Augmented Reality System for Social Interaction
Audio-Only Augmented Reality System for Social InteractionAudio-Only Augmented Reality System for Social Interaction
Audio-Only Augmented Reality System for Social Interaction
 
AMIS 2016 2, Music Math Coding
AMIS 2016 2, Music Math CodingAMIS 2016 2, Music Math Coding
AMIS 2016 2, Music Math Coding
 
Graphical visualization of musical emotions
Graphical visualization of musical emotionsGraphical visualization of musical emotions
Graphical visualization of musical emotions
 
Ict Introduction and investigation
Ict Introduction and investigationIct Introduction and investigation
Ict Introduction and investigation
 
Lorenzo Porcaro PhD Defense
Lorenzo Porcaro PhD Defense Lorenzo Porcaro PhD Defense
Lorenzo Porcaro PhD Defense
 
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
MediaEval 2018: AcousticBrainz Genre Task: Content-based Music Genre Recognit...
 
Analyzing DJ Scratching and its Acoustics
Analyzing DJ Scratching and its Acoustics Analyzing DJ Scratching and its Acoustics
Analyzing DJ Scratching and its Acoustics
 
MULHER@AVI2012
MULHER@AVI2012MULHER@AVI2012
MULHER@AVI2012
 
IRJET - EMO-MUSIC(Emotion based Music Player)
IRJET - EMO-MUSIC(Emotion based Music Player)IRJET - EMO-MUSIC(Emotion based Music Player)
IRJET - EMO-MUSIC(Emotion based Music Player)
 
Systems Thinking Lesson Plan for Classical Music
Systems Thinking Lesson Plan for Classical MusicSystems Thinking Lesson Plan for Classical Music
Systems Thinking Lesson Plan for Classical Music
 
20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir20211026 taicca 1 intro to mir
20211026 taicca 1 intro to mir
 
2010 i son barrass-schaffert-v0.2
2010 i son barrass-schaffert-v0.22010 i son barrass-schaffert-v0.2
2010 i son barrass-schaffert-v0.2
 
Deep Learning Meetup #5
Deep Learning Meetup #5Deep Learning Meetup #5
Deep Learning Meetup #5
 
The Wonder of Digitizing Artifacts Left Behind by Our Kindred Dead
The Wonder of Digitizing Artifacts Left Behind by Our Kindred DeadThe Wonder of Digitizing Artifacts Left Behind by Our Kindred Dead
The Wonder of Digitizing Artifacts Left Behind by Our Kindred Dead
 

More from kthrlab

ハイフレックス授業のススメ
ハイフレックス授業のススメハイフレックス授業のススメ
ハイフレックス授業のススメkthrlab
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試みkthrlab
 
卒研発表
卒研発表卒研発表
卒研発表kthrlab
 
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試みGTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試みkthrlab
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試みkthrlab
 
卒論発表_oshita
卒論発表_oshita卒論発表_oshita
卒論発表_oshitakthrlab
 
多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲kthrlab
 
振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システム振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システムkthrlab
 
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)kthrlab
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学kthrlab
 
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)kthrlab
 
音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用kthrlab
 
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?kthrlab
 
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」kthrlab
 
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」kthrlab
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学kthrlab
 
A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...kthrlab
 
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution AnalysisExtracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysiskthrlab
 
Music Synchronizer with Runner's Pace for Supporting Steady Pace Jogging
Music Synchronizer with Runner's Pace for Supporting Steady Pace JoggingMusic Synchronizer with Runner's Pace for Supporting Steady Pace Jogging
Music Synchronizer with Runner's Pace for Supporting Steady Pace Joggingkthrlab
 
音声認識の仕組みを知ろう
音声認識の仕組みを知ろう音声認識の仕組みを知ろう
音声認識の仕組みを知ろうkthrlab
 

More from kthrlab (20)

ハイフレックス授業のススメ
ハイフレックス授業のススメハイフレックス授業のススメ
ハイフレックス授業のススメ
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
 
卒研発表
卒研発表卒研発表
卒研発表
 
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試みGTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
GTTM分析器「PyGTTM」の試作,及び,ディープラーニングを用いた精度向上の試み
 
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
盛り上がり度に基づくループシーケンサにおけるユーザ適応の試み
 
卒論発表_oshita
卒論発表_oshita卒論発表_oshita
卒論発表_oshita
 
多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲多人数演奏楽譜から連弾譜への自動編曲
多人数演奏楽譜から連弾譜への自動編曲
 
振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システム振りのタイミングを評価するダンス練習システム
振りのタイミングを評価するダンス練習システム
 
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
誰もが創作を通じて音楽を楽しめる世界を目指して(音学シンポジウム2021 招待講演)
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学
 
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
土浦日大高校向け模擬授業「音楽と情報科学」(日本大学文理学部)
 
音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用音楽の知識表現:自動作編曲への応用
音楽の知識表現:自動作編曲への応用
 
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
平成30年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?
 
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
平成29年度 日本大学文理学部オープンキャンパス 「人工知能ってどんな仕組み?」
 
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
バルセロナ在外研究報告「統計モデルによるメロディのモデル化とその自動作曲・自動編曲への応用」
 
音楽と情報科学
音楽と情報科学音楽と情報科学
音楽と情報科学
 
A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...A Machine Learning Approach to Support Music Creation by Musically Untrained ...
A Machine Learning Approach to Support Music Creation by Musically Untrained ...
 
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution AnalysisExtracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
Extracting Melodic Contour Using Wavelet-based Multi-resolution Analysis
 
Music Synchronizer with Runner's Pace for Supporting Steady Pace Jogging
Music Synchronizer with Runner's Pace for Supporting Steady Pace JoggingMusic Synchronizer with Runner's Pace for Supporting Steady Pace Jogging
Music Synchronizer with Runner's Pace for Supporting Steady Pace Jogging
 
音声認識の仕組みを知ろう
音声認識の仕組みを知ろう音声認識の仕組みを知ろう
音声認識の仕組みを知ろう
 

Recently uploaded

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicinesherlingomez2
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 

Recently uploaded (20)

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 

音楽の非専門家が演奏・創作を通じて音楽を楽しめる世界を目指して

  • 1. 音楽の非専門家が創作・演奏を通じて 音楽を楽しめる世界を目指して 北原 鉄朗 (日本大学文理学部) Twitter: @tetsurokitahara Feel free to take & distribute photos on Twitter etc! Towards the World in which Musical Novices Enjoy Music through Creation and Performance Tetsuro Kitahara (Nihon University)
  • 3. Introduction to myself ● Name: きたはら  てつろう 北原 鉄朗 (Tetsuro Kitahara) ● Born in Sapporo, Japan ● Living in Tokyo, Japan ● Age: 39 (born in 1979) ● Position: Associate professor at Nihon University ● Favorites: Music, Drinking alcohol, etc. ● Studied music at Yamaha Junior Music School
  • 4. My academic history ● 2002: BSc from Tokyo University of Science – Musical instrument recognition – (Virtually) supervised by Prof. Okuno and Dr. Goto ● 2007: PhD from Kyoto University – Musical instument recognition and its appl. to MIR – Supervised by Prof. Okuno and Dr. Goto ● 2007-2010: PostDoc at Kwansei Gakuin University – Project head: Prof. Katayose ● 2010-2014: Assistant professor at Nihon University ● 2014-present: Associate professor at Nihon Univ. – 2016-2017: Visiting reseacher at UPF (about 5 months)
  • 5. born here Tokyo Univ. of Sci. (Chiba branch) Nihon Univ. Kyoto Univ. Universitat Pompeu Fabra Kwansei Gakuin Univ. (super country-side)
  • 6. Kitahara Lab. (Sound Computing Lab.) Music-related topics Automatic music generation HCIs for assisting non-musicians’ composition C G E C Am A E C Music recommendation Karaoke enhancement Guitar transcription Bass analysis etc. Other topics Analysis of cuteness of female voices Human-agent interaction via sound etc.
  • 7. Spectrum of topics in music computing What to support Appreciation Performance Composition Whotosupport ExpertsNon-experts
  • 8. JamSketch: Improvisation support system with GA-based melody creation from user’s drawing Special thanks to Rafael Ramirez and Sergio Giraldo (Universitat Pompeu Fabra)
  • 9. Our goal Enable non-musicians to enjoy improvisation Why it’s difficult Players have to create melodies and play an instrument at the same time Technical issues Human Interface Non-musician-friendly, easy-to-input, intuitive & quickly inputable Melody generation Automatic melody generation based on the user’s abstract, incomplete input in real time
  • 10. Our idea: JamSketch Human Interface User draws a melodic outline on the pianoroll Music generation Once the melodic outline is drawn, a melody is immediately generated with a genetic algorithm
  • 11. System overview time During the playback of the accompaniment 0th bar 1st bar 2nd bar Draws a melodic outline at least one bar in advance Determines the rhythm of the melody User If the mouse cursor moves to the next bar or the mouse button is released, Determines the pitch of each note Determines expression parameters Plays the melody
  • 12. Determine the rhythm ● High gradient → short notes ● Low gradient → long notes Basic idea 3 Details of algorithm 0 Candidates 3 33 3 3 33 1 Tentative rhythm Threshold the gradient 2 Best-fit rhythm Search the closest one from 0 3
  • 13. Determine the pitches Basic idea closeness to the outline Search a sequence of notes satisfying both 3 C A F and musical likeliness using a genetic algorithm (GA) Details of formulation Chromosome N = (n0 , n1 , …, nL-1 ) (ni : note number) Fitness function F(N) = w0 sim(N) + w1 seq1 (N) + w2 seq2 (N) + w3 harm(N) + w4 ent(N) Similarity to outline Bigram P(ni | ni-1 ) Delta bigram P(ni -ni-1 | ni-1 -ni-2 ) P(notes | chord) Entropy
  • 14. Details of fitness function Similarity to outline sim(N) = - Σ(ni - y(ti ))2 Pitch bigram probability seq1 (N) = Σ log P(ni | ni-1 ) (y(ti ): pitch at time ti in outline) Delta (pitch-motion) bigram probability seq2 (N) = Σ log P(ni – ni-1 | ni-1 – ni-2 ) Conditional probability for given chords harm(N) = Σ log P(ni | ci , bi ) (ci : chord at time ti ) (bi : metric position at ti , {head, on-beat, off-beat})Entropy ent(N) = -(H(N) – Hmean – ε)2
  • 15. Determine expression parameters Use a model presented at [Giraldo & Ramirez 2016] C Feature extraction for each note k-NN, multilayer perceptron, ... onset deviation duration ratio energy ratio (Duration, onset, prev. duration, next duration, prev. interval, next interval, namour, chord, etc.)
  • 16. Implementation Java Groovy Octave MIDI Toolbox WekaJamSketch CrestMuse Toolkit Apache Commons Math Java Octave Estimation of expression parameters GA Feature extraction Main program Training data 53 melodies with the Tonality-type of Blues from Weimar Jazz Database
  • 17. Experiment Purpose Confirm the quality of generated melodies Method 1. Participants play improvisation with our system and a normal MIDI keyboard 2. The performances with our system and the keyboard are listed without distinction 3. Other participants evaluate them If there are no clear differences between evaluations for our system and the keyboard, our system can be considered to have human-like composition capability
  • 18. Participants & performances Participants Experience Use Play Impr. KB Syst. P1 Yes Yes Yes No P2 Yes No Yes Yes P3 Yes No No Yes P4 No No No Yes P5 Yes No Yes Yes P6 Yes Yes Yes No Performances ? ? ? ? ? ? ? ? STOP System System KB (P2) KB (P1) System KB (P5) System KB (P6) s2 s1 u2 u1 s4 u3 s3 u4
  • 19. Listening test ● We asked 12 researchers / students in musical fields to evaluate them on a scale of 0 to 10 ● Criteria – Overall quality – Pitch appropriateness – Rhythmic appropriateness – Melodic variation ● http://www.kthrlab.jp/proj/jamsketch/listen/ – Blues-likeness – Skillfulness – Human-likeness Please participate in the listening test!
  • 20. Result: Overall quality {u4} > {u1, u2} > {s1, s2, s3, s4} ≈ {u3}
  • 21. Result: Pitch appropriateness {u1, u2, u4} > {s1, s2, s3, s4} > {u3}
  • 22. Result: Rhythmic appropriateness {u2, u4} > {u1, u3} > {s1, s2, s3, s4}
  • 23. Result: Melodic variation {u4} >> {s1, s2, s4} ≈ {u1, u2, u3} > {s3} ≈ {u3}
  • 24. Result: Blues-likeness {u4} > {u2} > {s1, s2, s3, s4} ≈ {u1} >> {u3}
  • 25. Result: Skillfulness {u4} >> {u2} > {s1, s2, s3, s4} ≈ {u1, u3}
  • 26. Result: Human-likeness {u4} > {u1, u2} > {u3} >> {s1, s2, s3, s4}
  • 27. Discussions ● Our system is inferior to experienced players ● But it is superior (or close) to inexperienced players ● Human-likeness is lower than both-class players ● Possible reasons: – Out-of-scale pitches appear at the ends of phrases ● The system does not consider boundaries of phrases – Out-of-scale pitches appear at long notes ● Rhythm and pitches are independently determined – Rhythm is too monotonous ● Rhythm needs more variations ● We obtained positive opinions from participants
  • 28. On-going next plans ● Improvement of rhythm determination – Data-driven ● Input with eye tracking – Focusing on disabled people ● Input with smartphones – Approach 1: moving them up and down – Approach 2: capture the hand’s motion with the camera (collaboration with Shiramatsu Lab., NITech) ● Performance by multiple players (collaboration with Shiramatsu Lab., NITech) ● Incremental melody generation
  • 29. General discussions (just my personal opinions)
  • 30. What is the state-of-the-art of automatic music composition? ● Recent systems are capable to create musically appropriate pieces ● In particular, some systems imitate existing composers’ features well – e.g. DeepBach (F. Pachet) https://www.youtube.com/watch?v=QiBM7-5hA6o ● Systems using deep neural nets (CNN, RNN) are increasing ● Large-scale DBs are still lacking
  • 31. Are there better human interfaces for non-experts to express musical ideas? ● Melodic outlines are non-musician-friendly, easy-to-input, intuitive, and quickly inputable ● Basic concept: “Macro structure by the user, micro structure by the system” ● Other possibilities: – verbal expressions (impression words) – melodic examples (existing melodies) ● With melodic outlines, users cannot specify rhythmic features
  • 32. What is the best relationship between humans and systems? ● Relation of a producer and a composer? ● Ideally, systems should be ghostwriters? – Human ghostwriters can compose but cannot improvise according to another person’s instructions ● Truly original part should be created by humans ● When humans give musically low-quality ideas, how can systems suggest better ideas? ● What can systems do if the user’s final goal is to be able to compose music without any support?
  • 34. Conclusion ● To enable non-experts to create music, systems need to have – an intuitive user interface – a technique for generating music ● JamSketch: an improvisation support system with – melodic outlines – GA-based melody generator https://github.com/kthrlab/JamSketch/tree/jamsketch-0.1.0-lite Let’s try it! (on Windows or other Java-compatible platforms) Please participate in listening test! http://www.kthrlab.jp/proj/jamsketch/listen/