音源分離における音響モデリング(Acoustic modeling in audio source separation)Daichi Kitamura
北村大地, "音源分離における音響モデリング," 日本音響学会 サマーセミナー 招待講演, September 11th, 2017.
Daichi Kitamura, "Acoustic modeling in audio source separation," The Acoustical Society of Japan, Summer Seminar Invited Talk, September 11th, 2017.
ICASSP 2019音声&音響論文読み会(https://connpass.com/event/128527/)での発表資料です。
AASP (Audio and Acoustic Signal Processing) 分野の紹介と、ICASSP 2019での動向を紹介しています。#icassp2019jp
Presentation slide for AI seminar at Artificial Intelligence Research Center, The National Institute of Advanced Industrial Science and Technology, Japan.
URL (in Japanese): https://www.airc.aist.go.jp/seminar_detail/seminar_046.html
音源分離における音響モデリング(Acoustic modeling in audio source separation)Daichi Kitamura
北村大地, "音源分離における音響モデリング," 日本音響学会 サマーセミナー 招待講演, September 11th, 2017.
Daichi Kitamura, "Acoustic modeling in audio source separation," The Acoustical Society of Japan, Summer Seminar Invited Talk, September 11th, 2017.
ICASSP 2019音声&音響論文読み会(https://connpass.com/event/128527/)での発表資料です。
AASP (Audio and Acoustic Signal Processing) 分野の紹介と、ICASSP 2019での動向を紹介しています。#icassp2019jp
Presentation slide for AI seminar at Artificial Intelligence Research Center, The National Institute of Advanced Industrial Science and Technology, Japan.
URL (in Japanese): https://www.airc.aist.go.jp/seminar_detail/seminar_046.html
Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...Kitamura Laboratory
Shoya Kawaguchi and Daichi Kitamura,
"Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and loudness using deep neural networks,"
Proceedings of RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP 2023), pp. 225–228, Honolulu, USA, March 2023.
Heart rate estimation of car driver using radar sensors and blind source sepa...Kitamura Laboratory
Keito Murata, Daichi Kitamura, Ryo Saito, and Daichi Ueki,
"Heart rate estimation of car driver using radar sensors and blind source separation,"
Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2022), pp. 1157–1164, Chiang Mai, Thailand, November 2022.
DNN-based frequency-domain permutation solver for multichannel audio source s...Kitamura Laboratory
Fumiya Hasuike, Daichi Kitamura, and Rui Watanabe,"DNN-based frequency-domain permutation solver for multichannel audio source separation," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2022), pp. 872–877, Chiang Mai, Thailand, November 2022.
Linear multichannel blind source separation based on time-frequency mask obta...Kitamura Laboratory
Soichiro Oyabu, Daichi Kitamura, and Kohei Yatabe, "Linear multichannel blind source separation based on time-frequency mask obtained by harmonic/percussive sound separation," Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), pp. 201–205, Toronto, Canada, June 2021.
Prior distribution design for music bleeding-sound reduction based on nonnega...Kitamura Laboratory
Yusaku Mizobuchi, Daichi Kitamura, Tomohiko Nakamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo, "Prior distribution design for music bleeding-sound reduction based on nonnegative matrix factorization," Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2021), pp. 651–658, Tokyo, Japan, December 2021.
Blind audio source separation based on time-frequency structure modelsKitamura Laboratory
Daichi Kitamura, "Blind audio source separation based on time-frequency structure models," Invited Overview Session in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2021), Tokyo, Japan, December 2021.
6. 6
入力特徴量
• メル周波数ケプストラム係数(mel-frequency cepstral coefficient: MFCC)
–音の高さと音量を可能な限り除去した純粋な音色の特徴量
• ラウドネス
–振幅スペクトログラムの時間ごとの振幅の総和
MFCC
振幅スペクトログラム
振幅スペクトログラム ラウドネス
Time [s]
Frequency
[kHz]
Time [s]
Coefficient
Time [s] Time [s]
Frequency
[kHz]
Volume