Prediction of Time Varying Musical Mood Distributions Using
Kalman Filtering
Researched by
Erik M. Schmidt and Youngmoo E....
Objective

Enabling more robust emotion-based music recommendation considering how emotion in music varies over
time.
Keywords

Emotion recognition,
Audio features,
Regression,

Linear dynamical systems,
Kalman filtering
Emotion Recognition
Block diagram of modeling and predicting emotion in music through machine learning
Audio Feature Extraction
GROUND TRUTH DATA COLLECTION
Participants use a graphical interface in Moodswing game to indicate a dynamic position withi...
MoodSwings Lite Corpus
Developed a reduced dataset consisting of 15-second music clips from 240 songs, selected using the
...
Data Preprocessing
Time-varying emotion distribution regression results for three example 15-second music clips (markers b...
Experiment result
Experiment Analysis
The Kalman mixture provides the best result of any system, using only four clusters they achieve an av...
Conclusion
Analysis of state of musical emotion in terms of mathematical representation along with the help of A-V
repsent...
Thank You
Upcoming SlideShare
Loading in …5
×

0907008

253 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
253
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

0907008

  1. 1. Prediction of Time Varying Musical Mood Distributions Using Kalman Filtering Researched by Erik M. Schmidt and Youngmoo E. Kim Music and Entertainment Technology Laboratory (MET-lab) Presented by Sanjoy Dutta Roll: 0907008 Department of Computer Science & Engineering Khulna University of Engineering & Technology
  2. 2. Objective Enabling more robust emotion-based music recommendation considering how emotion in music varies over time.
  3. 3. Keywords Emotion recognition, Audio features, Regression, Linear dynamical systems, Kalman filtering
  4. 4. Emotion Recognition
  5. 5. Block diagram of modeling and predicting emotion in music through machine learning
  6. 6. Audio Feature Extraction
  7. 7. GROUND TRUTH DATA COLLECTION Participants use a graphical interface in Moodswing game to indicate a dynamic position within the A-V space to annotate five 30-second music clips. Each subject provides a check against the other, reducing the probability of nonsense labels.
  8. 8. MoodSwings Lite Corpus Developed a reduced dataset consisting of 15-second music clips from 240 songs, selected using the original label set, to approximate an even distribution across the four primary quadrants of the A-V space. These clips were subjected to intense focus within the game in order to form a corpus, referred to here as MoodSwings Lite, with significantly more labels per song clip, which is used in this analysis.
  9. 9. Data Preprocessing Time-varying emotion distribution regression results for three example 15-second music clips (markers become darker as time advances): Second-by-second labels per song (gray bullet), Standard deviation of the collected labels over 1-second intervals (red ellipse), and Standard deviation of the distribution projected from acoustic features in 1-second intervals (blue ellipse). Heavy amount of noise in the covariance ellipses Apply preprocessing using a Kalman/Rauch-Tung-Striebel (RTS) smoother.
  10. 10. Experiment result
  11. 11. Experiment Analysis The Kalman mixture provides the best result of any system, using only four clusters they achieve an average KL of 2.881, which is a significant improvement MLR system at 4.576 and the MLR mixture at 3.179. In terms of mean error, however, Kalman and MLR mixtures produce nearly identical results, with normalized distances of about 0.109. not over both the
  12. 12. Conclusion Analysis of state of musical emotion in terms of mathematical representation along with the help of A-V repsentation. They proposed that they can develop the most accurate representation of their ground truth using a distribution. Using a Kalman filtering approach, have been able to form robust estimates of their distribution and how it evolves over time.
  13. 13. Thank You

×