3. Waveform and magnitude Fourier transform of a tone C4 (261.6 Hz) played by different instruments (see also
Figure 1.23). (a) Piano. (b) Trumpet. (c) Violin. (d) Flute.
https://www.researchgate.net/publication/290440767_2015_Mueller_FundamentalsMusicProcessing_Springer_Section2-1_SamplePages
악기 특성
7. Wavenet
가정1. 임의의 샘플 xt는 앞선 샘플들 x1, …, xt-1에 의해 결정된다.
RNN을 사용하자!
단점 : training 시간이 엄청나게 김, vanishing gradient problem
사운드 데이터는 1초에 통상 16,000-48,000 samples
https://medium.com/@florijan.stamenkovic_99541/
rnn-language-modelling-with-pytorch-packed-batching-and-tied-weights-9d8952db35a9
11. Domain Adaptation
Training Set (labeled) Test Set (unlabeled)
1. Training Set과 Test Set이 상이할 경우 성능에 문제가 생긴다.
2. Latent space에서 두 set간의 distribution을 일치시키는 방향으로 해결!
12. Training Set 파랑색은 잘 분류된데 반해
Test Set 빨강색은 제대로 분류되지 못함
Latent space에서 Test Set의 distribution을
Training Set에 맞춤
Domain Adaptation
https://github.com/pumpikano/tf-dann
17. References
[1] Noam Mor, Lior Wolf, Adam Polyak, Yaniv Taigman : A Universal Music Translation Network
[2] Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François
Laviolette, Mario Marchand, Victor Lempitsky : Domain-Adversarial Training of Neural Networks
[3] Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves,
Nal Kalchbrenner, Andrew Senior, Koray Kavukcuoglu : WaveNet: A Generative Model for Raw
Audio