SlideShare a Scribd company logo
1 of 17
Download to read offline
A Universal Music
Translation Network
https://arxiv.org/abs/1805.07848

Noam Mor, Lior Wolf, Adam Polyak, Yaniv Taigman

박수철
https://www.youtube.com/watch?v=vdxCqNWTpUs
샘플 영상
Waveform and magnitude Fourier transform of a tone C4 (261.6 Hz) played by different instruments (see also
Figure 1.23). (a) Piano. (b) Trumpet. (c) Violin. (d) Flute.
https://www.researchgate.net/publication/290440767_2015_Mueller_FundamentalsMusicProcessing_Springer_Section2-1_SamplePages
악기 특성
배음harmonics
엔벨로프envelope
음고pitch
(C4)
음량velocity
(0.5)
공유속성
개별속성
파형waveform
악기 특성
Z
Encoder
Decoder
1. 공유속성만 추출해 latent space Z에 embed하는 encoder를 만든다.
2. embed된 Z의 정보를 바탕으로 개별 악기의 특성을 재현하는 decoder를 만든다.
KEY IDEAS
어떻게?
공유 속성 embedding은
Domain-Adversarial
Neural Network으로!
Encoder, Decoder는
Wavenet으로!
Wavenet
가정1. 임의의 샘플 xt는 앞선 샘플들 x1, …, xt-1에 의해 결정된다.
RNN을 사용하자!
단점 : training 시간이 엄청나게 김, vanishing gradient problem
사운드 데이터는 1초에 통상 16,000-48,000 samples
https://medium.com/@florijan.stamenkovic_99541/
rnn-language-modelling-with-pytorch-packed-batching-and-tied-weights-9d8952db35a9
Wavenet
가정2. 특정 시점 이전의 샘플들은 xt에 영향을 주지 않는다.
Causal CNN을 사용하자!
Wavenet
Source (자체 구현)
Domain Adaptation
Training Set (labeled) Test Set (unlabeled)
1. Training Set과 Test Set이 상이할 경우 성능에 문제가 생긴다.
2. Latent space에서 두 set간의 distribution을 일치시키는 방향으로 해결!
Training Set 파랑색은 잘 분류된데 반해
Test Set 빨강색은 제대로 분류되지 못함
Latent space에서 Test Set의 distribution을
Training Set에 맞춤
Domain Adaptation
https://github.com/pumpikano/tf-dann
Domain-Adversarial Training of Neural Networks
https://arxiv.org/abs/1505.07818
https://github.com/pumpikano/tf-dann
Gradient Reversal Layer
Domain-Adversarial Training of Neural Networks
전체 구조
Source
(piano)
Latent
Target
(violin)
결과 (자체 구현)
References
[1] Noam Mor, Lior Wolf, Adam Polyak, Yaniv Taigman : A Universal Music Translation Network
[2] Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François
Laviolette, Mario Marchand, Victor Lempitsky : Domain-Adversarial Training of Neural Networks
[3] Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves,
Nal Kalchbrenner, Andrew Senior, Koray Kavukcuoglu : WaveNet: A Generative Model for Raw
Audio

More Related Content

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

A universal music translation network