NMF in PyTorch

NMF in PyTorch
yoyololicon/pytorch-NMF
By Chin Yun Yu
Workshop in Marine Ecoacoustics and Informatics Lab, 9/19

About me
● Background
○ Bachelor of CS, NCTU
○ Research Assistant in Music and Culture Technology Lab, IIS, AS
○ Engineer in Vive R&D/Multi Media, HTC (current position)
○ MIR research/open source project entusiast
○ Music producer
○ Guitarist of Catalyst (Taipei)

Outline
● What is NMF?
● What is PyTorch?
● How I developed PyTorch NMF
● What NMF can do with PyTorch
(particularly in the erra of deep
learning)?

What is NMF?
● Non-negative Matrix Factorization
● V, W, H > 0
● Useful to analyze audio spectrogram

Parameter update
● Févotte, Cédric, and Jérôme Idier. "Algorithms for nonnegative matrix
factorization with the β-divergence." Neural computation 23.9 (2011):
2421-2456.
● multiplicative update

Drums Transcription
● Wu, Chih-Wei, et al. "A review of automatic drum transcription." IEEE/ACM
Transactions on Audio, Speech, and Language Processing 26.9 (2018):
1457-1483.

Extend to convolutional case
● Smaragdis, Paris. "Non-negative matrix factor deconvolution; extraction of
multiple sound sources from monophonic inputs." International Conference on
Independent Component Analysis and Signal Separation. Springer, Berlin,
Heidelberg, 2004.

Drums Transcription (with NMFD)
● Wu, Chih-Wei, et al. "A review of automatic drum transcription." IEEE/ACM
Transactions on Audio, Speech, and Language Processing 26.9 (2018):
1457-1483.

Drums Transcription (with NMFD)
● Dittmar, Christian. "Source Separation and Restoration of Drum Sounds in
Music Recordings." (2018).

Reverse Engineering the Amen Break
● Dittmar, Christian. "Source Separation and Restoration of Drum Sounds in
Music Recordings." (2018).

Music Structure Analyze
● López-Serrano, Patricio, et al. "NMF TOOLBOX: MUSIC PROCESSING
APPLICATIONS OF NONNEGATIVE MATRIX FACTORIZATION." (2019).

2D deconvolutional case
● Schmidt, Mikkel N., and Morten Mørup. "Nonnegative matrix factor 2-D
deconvolution for blind single channel source separation." International
Conference on Independent Component Analysis and Signal Separation.
Springer, Berlin, Heidelberg, 2006.

Let it BEE - replace sound source
● Driedger, Jonathan, Thomas Prätzlich, and Meinard Müller. "Let it Bee-Towards
NMF-Inspired Audio Mosaicing." ISMIR. 2015.
https://www.audiolabs-erlangen.de/resources/
MIR/2015-ISMIR-LetItBee

PyTorch
● One of the most well-known deep learning frameworks
● Lauched in 2016 by Facebook
● Features
○ Easy to use python API
○ Dynamic computing graphs
○ Support GPU acceleration
○ automatic gradients calculation
○ Easy for prototyping
● Has been quickly adopted by researchers from many ﬁelds

CONFERENCE
PT
2018
PT
2019
PT GROWTH
TF
2018
TF 2019 TF GROWTH
CVPR 82 280 240% 116 125 7.7%
NAACL 12 66 450% 34 21 -38.2%
ACL 26 103 296% 34 33 -2.9%
ICLR 24 70 192% 54 53 -1.9%
ICML 23 69 200% 40 53 32.5%
https://thegradient.pub/state-of-ml-frameworks-2019-pytorch-dominates-research-tensorﬂow-dominates-industry/

Developement
of torchnmf
● multiplicative update
● deconvolutional class
inheritance

re-visit multiplicative update rules

Gradient descent in Deep Learning
can obtain via torch.autogradlearning rate
old parameters
new parameters

re-factor multiplicative update
coefficients
can obtain via torch.autograd

Regular NMF v.s. PyTorch NMF
➔ Weight update in regular NMF
a. Compute positive component
b. Compute negative component
c. derive multiplicative update coeﬃcients
d. multiplication
➔ Weight update in PyTorch NMF
a. Compute loss value
b. derive gradients via backward propagation
c. Compute positive component
d. Compute negative component by subtraction
e. derive multiplicative update coeﬃcients
f. multiplication
➔ Reduce almost half lines of code -> easy to maintain

Extended to convolutional cases
Using torch.nn.functional.conv1d/2d/3d and inheriting from NMF base class
NMF
NMFD NMF2D NMF3D

Into the erra of
Deep Learning

https://github.com/Pold87/academic-keyword-occurrence

How if we combine DL with traditional
methods ?
● Ravanelli, Mirco, and Yoshua Bengio.
"Speaker recognition from raw waveform with
sincnet." 2018 IEEE Spoken Language
Technology Workshop (SLT). IEEE, 2018.
● Use Band-passed signal as input feature
● Less parameters to learn, more robust,
converge faster, lower error

Conclusion of torchnmf
● Advantages compare to others
○ Maintenance is more easy
○ Better support for convolutional cases (especially 3D)
○ Can run on GPU for faster convergence
● News
○ Feature such as batching/controlling sparsity is on the way!
● Next step
○ Full autograd support so it can be integrated in other DL models
○ Documentations
○ upload to PyPI
● Feel free to create PRs or issues!
name = "torchnmf"
__version__ = '0.2'
__maintainer__ = 'Chin-Yun Yu'
__email__ = 'ya70201@gmail.com'

NMF in PyTorch

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NMF in PyTorch

Similar to NMF in PyTorch (20)

Recently uploaded

Recently uploaded (20)

NMF in PyTorch