Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition. Xingfeng Li 0001, Xiaohan Shi, Desheng Hu, Yongwei Li, Qingchen Zhang, Zhengxia Wang, Masashi Unoki, Masato Akagi
IEEE/ACM Transactions on Audio, Speech and Language Processing, 31, 2534-2547, 2023
A Discriminative Feature Representation Method Based on Cascaded Attention Network With Adversarial Strategy for Speech Emotion Recognition. Yang Liu, Haoqin Sun, Wenbo Guan, Yuqi Xia, Yongwei Li, Masashi Unoki, Zhen Zhao 0006
IEEE/ACM Transactions on Audio, Speech and Language Processing, 31, 1063-1074, 2023
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection Kai Li, Sheng Li, Xugang Lu, Masato Akagi, Meng Liu, Lin Zhang, Chang Zeng, Longbiao Wang, Jianwu Dang, Masashi Unoki
Interspeech 2022, 664-668, 2022
ISO/TC 43・ISO/TC 43/SC 1・ISO/TC 43/SC 2・ISO/TC 43/SC 3総会――音響に関する国際規格の審議状況:2021パリ会議(オンライン開催)―― 鈴木 陽一, 倉片 憲治, 今泉 博之, 佐藤 洋, 赤松 友成, 山崎 隆志, 藤坂 洋一, 内田 匠, 鵜木 祐史, 桑野 園子, 山田 一郎, 大島 俊也, 高橋 幸雄, 下田 康平, 白橋 良宏, 杉江 聡, 小林 知尋, 永幡 幸司, 森長 誠, 白木 秀児, 平光 厚雄, 古賀 貴士, 平川 侑, 澤田 浩一
日本音響学会誌, 78, 4, 203-208, 2022
Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept
Thuan Van Ngo, Tuan Vu Ho, Masashi Unoki, Rieko Kubo, Masato Akagi
2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings, 753-758, 2020
How the temporal amplitude envelope of speech contributes to urgency perception
Masashi Unoki, Miho Kawamura, Maori Kobayashi, Shunsuke Kidani, Masato Akagi
Proceedings of 23rd International Congress on Acoustics (ICA 2019), 1739-1744, 2019
Method of Estimating Direction of Arrival of Sound Source for Monaural Hearing Based on Temporal Modulation Perception. Nguyen Khanh Bui, Daisuke Morikawa, Masashi Unoki
2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, 5014-5018, 2018
Speech Watermarking Based on Robust Principal Component Analysis and Formant Manipulations. Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki
2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, 2082-2086, 2018
Speech Emotion Recognition Using MPCRNN based on Gammatone auditory Filterbank
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
APSIPA2017, -, 2017
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank. Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 1750-1755, 2017
Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room. Masashi Unoki, Yuta Kashihara, Maori Kobayashi, Masato Akagi
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 1199-1204, 2017
F0 estimation using empirical mode decomposition and complex cepstrum analysis in reverberant environments. Surasak Boonkla, Masashi Unoki, Chai Wutiwiwatchai, Stanislav, S. Makhanov
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 980-986, 2017
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification. Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 193-202, 2017
Preliminary Study on Blind Estimation of Room Acoustic Parameters in Noisy Reverberant Environments
Unoki, M, Morita, S, Miyazaki, A, Akagi, M
Proc. WESPAC2015, Singapore, 428-435-, 2015
Watermarking of speech signals based on formant enhancement
Shengbei Wang, Masashi Unoki
European Signal Processing Conference, 1257-1261, 2014
A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model Phung, T. N, Unoki, M, Akagi, M
Journal of Signal Processing, 16, 5, 5, 409-417-417, 2012
Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals
Yanao, Y, Miyauchi, R, Unoki, M, Akagi, M
Proc. NCSP2011, 231-234-, 2011
A study on the IMTF-based filtering on the modulation spectrum of reverberant signal
Shota Morita, Masashi Unoki, Masato Akagi
Journal of Signal Processing, 14, 4, 269-272, 2010
A study on the MTF-based inverse filtering for the modulation spectrum of reverberant speech
Morita, S, Unoki, M, Akagi, M
Proc. NCSP10, -, 2010
An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech
Kinugasa, K, Unoki, M, Akagi, M
Proc. NCSP'09, 105-108-, 2009
Effects from spatial cues on detectability of alarm signals in car environments
N. Kuroda, J. Li, Y. Iwaya, M. Unoki, M. Akagi
Proc. NCSP’09, pp. 45-48-, 2009
A psychoacoustically-motivated conceptual model for automatic speech recognition
Haniu, A, Unoki, M, Akagi, M
Proc. WESPAC2009, -, 2009
TEMPORAL CONTRAST NORMALIZATION AND EDGE-PRESERVED SMOOTHING ON TEMPORAL MODULATION STRUCTURE FOR ROBUST SPEECH RECOGNITION X. Lu, S. Matsuda, M. Unoki, T. Shimizu, S. Nakamura
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 4573-4576, 2009
MTF-based power envelope restoration in noisy reverberant environments. Masashi Unoki, Yutaka Yamasaki, Masato Akagi
17th European Signal Processing Conference, EUSIPCO 2009, Glasgow, Scotland, UK, August 24-28, 2009, 228-232, 2009
The Construction of Large-scale Bone-conducted and Air-conducted Speech Databases for Speech Intelligibility Tests
Vu, T. T, Unoki, M, Akagi, M
Proc. Oriental COCOSDA2007, 88-91-, 2007
Vocal conversion from speaking voice to singing voice using STRAIGHT. Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007, 4005-4006, 2007
A study on a speech recognition method based on the selective sound segregation in noisy environment
Haniu, A, Unoki, M, Akagi, M
Proc. NCSP05, 403-406-, 2005
A model for selective segregation of a target instrument sound from the mixed sound of various instruments. Masashi Unoki, Masaaki Kubo, Atsushi Haniu, Masato Akagi
INTERSPEECH 2005 - Eurospeech, 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005, 2097-2100, 2005
Temporal decomposition of speech and its application to speech coding and modification
Akagi, M, Nguyen, P. C, Saitou, T, Tsuji, N, Unoki, M
Proc. KEST2004, 280-288-, 2004
Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice. Takeshi Saitou, Naoya Tsuji, Masashi Unoki, Masato Akagi
INTERSPEECH 2004 - ICSLP, 8th International Conference on Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004, 1929-1932, 2004
A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions. Masashi Unoki, Masato Toi, Masato Akagi
2004 12th European Signal Processing Conference, Vienna, Austria, September 6-10, 2004, 34, 5, 1689-1692, 2004
Development of the F0 control method for singing-voices synthesis
Saitou, T, Unoki, M, Akagi, M
Proc. SP2004, 491-49-, 2003
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal M Unoki, M Furukawa, K Sakata, M Akagi
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, vol. Ipp. 840-943, 888-891, 2003
A method for recovering the power envelope from reverberant speech
Unoki, M, Furukawa, M, Akagi, M
Forum Acousticum Sevilla 2002, SPA-Gen-002-, 2002
A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency. Yuichi Ishimoto, Masashi Unoki, Masato Akagi
EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001, 2439-2442, 2001
Eurospeech99, IEEE MMSP99 会議報告 中村 哲, 大川 茂樹, 伊藤 彰則, 田本 真詞, 水野 秀之, 鵜木 祐史, 徳田 恵一, 鏑木 時彦, 畑岡 信夫
情報処理学会研究報告. SLP, 音声言語情報処理, 28, 91, 21-28, 1999
Signal extraction from noisy signal based on auditory scene analysis. Masashi Unoki, Masato Akagi
The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998, 98, 1-29, 1998