TOP  >  Personal Info.  >  Published Papers
AKAGI, Masato Professor
School of Information Science,Human Life Design Area

Published Papers

186 items
Unsupervised Singing Voice Separation Based on Robust Principal Component Analysis Exploiting Rank-1 Constraint
Feng Li and Masato Akagi
Proc. EUSIPCO2018, Rome, Italy, 1934-1938-, 2018/09/06
A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech
Xingfeng, Li and Masato Akagi
Proc. InterSpeech2018, Hyderabad, India, 3643-3647-, 2018/09/06
Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space
Yawen Xue, Yasuhiro Hamada, and Masato Akagi
Speech Communication, 102, 54-67-, 2018/09/01
Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal space
Yongwei Li, Junfeng Li, and Masato Akagi
J. Acoust. Soc. Am., 144, 2, 908-916-, 2018/08/01
Non-parallel Dictionary-based Voice Conversion using Variational Autoencoder with Modulation Spectrum-constrained Training
Ho-Tuan Vu and Akagi Masato
Journal of Signal Processing, 22, 4, 189-192-, 2018/08/01
Auditory-Inspired End-to-End Speech Emotion Recognition using 3D Convolutional Recurrent Neural Networks based on Spectral Temporal Representation
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, and Masato Akagi
Proc. ICME2018, San Diego, USA, -, 2018/07/26
Speech Emotion Recognition Using MPCRNN based on Gammatone auditory Filterbank
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, and Masato Akagi
APSIPA2017, -, 2017/12/15
Study on Method for Protecting Speech Privacy by Actively Controlling Speech Transmission Index in Simulated Room
Masashi Unoki, Yuta Kashihara, Maori Kobayashi, and Masato Akagi
APSIPA2017, -, 2017/12/14
Feature Selection Method for Real-time Speech Emotion Recognition
Reda Elbarougy and Masato Akagi
O-COCOSDA2017, 86-91-, 2017/11/01
Method of Estimating Signal-to-Noise Ratio Based on Optimal Design for Sub-band Voice Activity Detection
Shota Morita, Xugang Lu, Masashi Unoki and Masato Akagi
Journal of Information Hiding and Multimedia Signal Processing, International, 8, 6, 1446-1459-, 2017/11/01
Method of Blindly Estimating Speech Transmission Index in Noisy Reverberant Environments
Masashi Unoki, Akikazu Miyazaki, Shota Morita, and Masato Akagi
Journal of Information Hiding and Multimedia Signal Processing, International, 8, 6, 1430-1445-, 2017/11/01
Commonalities of glottal sources and vocal tract shapes among speakers in emotional speech
Li, Y., Sakakibara, K-I., Morikawa, D., and Akagi, M.
ISSP2017, -, 2017/10/17
Acoustical analyses of tendencies of intelligibility in Lombard speech with different background noise levels
Ngo, T. V., Kubo, R., Morikawa, D., and Akagi, M.
Journal of Signal Processing, 21, 4, 171-174-, 2017/07/01
Voice Conversion to Emotional Speech based on Three-layered Model in Dimensional Approach and Parameterization of Dynamic Features in Prosody
Xue, Y., Hamada, Y., and Akagi, M.
Proc. APSIPA2016, Cheju, Korea, -, 2016/12/15
Quality Improvement of Vietnamese HMM-Based Speech Synthesis System Based on Decomposition of Naturalness and Intelligibility Using Non-negative Matrix Factorization
Dinh, A. T., Phan, T. S., and Akagi, M.
International Conference on Advances in Information and Communication Technology 2016, Thai Nguyen, Vietnam, 490-499-, 2016/12/12
Voice conversion system to emotional speech in multiple languages based on three-layered model for dimensional space
Xue, Y., Hamada, Y., Elbarougy, R., and Akagi, M.
O-COCOSDA2016, Bali, Indonesia, 122-127-, 2016/10/27
Quality Improvement of HMM-based Synthesized Speech Based on Decomposition of Naturalness and Intelligibility using Non-Negative Matrix Factorization
Dinh, A. T. and Akagi, M.
O-COCOSDA2016, Bali, Indonesia, 62-67-, 2016/10/26
Optimizing Fuzzy Inference Systems for Improving Speech Emotion Recognition
Elbarougy, R. and Akagi, M.
The 2nd International Conference on Advanced Intelligent Systems and Informatics (AISI2016), Cairo, Egypt, 85-95-, 2016/10/24
Multilingual Speech Emotion Recognition System Based on a Three-Layer Model
Li, X. and Akagi, M.
Proc. InterSpeech2016, San Francisco, 3608-3612-, 2016/09/12
Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyance
Kubo, R, Morikawa, D., and Akagi, M.
Proc. Inter-Noise2016, Hamburg, Germany, 171-176-, 2016/08/22
Study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model
Dinh, T. A, Morikawa, D., and Akagi, M.
Journal of Signal Processing, 20, 4, 205-208-, 2016/07/01
A study on applying target prediction model to parameterize power envelope of emotional speech
Xue, Y. and Akagi, M.
Proc. NCSP2016, Honolulu, HW, USA, 157-160-, 2016/03/07
Automatic Speech Emotion Recognition in Chinese Using a Three-layered Model in Dimensional Approach
Li, X. and Akagi, M.
Proc. NCSP2016, Honolulu, HW, USA, 17-20-, 2016/03/07
A study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model
Dinh, T. A, Morikawa, D., and Akagi, M.
Proc. NCSP2016, Honolulu, HW, USA, 13-16-, 2016/03/07
Preliminary Study on Blind Estimation of Room Acoustic Parameters in Noisy Reverberant Environments
Unoki, M., Morita, S., Miyazaki, A., and Akagi, M.
Proc. WESPAC2015, Singapore, 428-435-, 2015/12/07
Toward Improving Estimation Accuracy of Emotion Dimensions in Bilingual Scenario Based on Three-layered Model
Li, X. and Akagi, M.
Proc. O-COCOSDA2015, Shanghai, 21-26-, 2015/10/28
Dependence on age of interference with phoneme perception by first- and second-language speech maskers
Kubo, R., Akagi, M., and Akahane-Yamada, R.
Acoustical Science and Technology, 36, 5, 397-407, 36, 5, 397 - 407-, 2015/09/01
A study on perception of emotional states in multiple languages on Valence-Activation approach
Xiao Han, Reda Elbarougy, Masato Akagi, Junfeng Li, Thi Duyen Ngo, and The Duy Bui
Proc NCSP2015, Kuala Lumpur, Malaysia, -, 2015/02/28
Improving the naturalness of concatenative Vietnamese speech synthesis under limited data conditions
Phung Trung Nghia, Luong Chi Mai, Masato Akagi
Journal of Computer Science and Cybernetics, V.31, N.1, 1-16, 31, 1, 1-16-, 2015/01/01
Emotional speech synthesis system based on a three-layered model using a dimensional approach
Xue, Y., Hamada, Y., and Akagi, M.
Proc. APSIPA2015, Hong Kong, 505-514-, 2015
Study on method to control fundamental frequency contour related to a position on Valence-Activation space
Hamada, Y., Elbarougy, R., Xue, Y., and Akagi, M.
Proc. WESPAC2015, Singapore, 519-522-, 2015
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Morita, S., Unoki, M., Lu, X., and Akagi, M.
Journal of Signal Processing Systems, DOI 10.1007/s11265-015-1014-4, -, 2015
A Method for Emotional Speech Synthesis Based on the Position of Emotional State in Valence-Activation Space
Yasuhiro Hamada, Reda Elbarougy and Masato Akagi
Proc. APSIPA2014, Siem Reap, Cambodia, -, 2014/12/12
Toward Affective Speech-to-Speech Translation: Strategy for Emotional Speech Recognition and Synthesis in Multiple Languages
Masato AKAGI, Xiao HAN, Reda ELBAROUGY, Yasuhiro HAMADA, and Junfeng LI
Proc. APSIPA2014, Siem Reap, Cambodia, -, 2014/12/10
Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English
Junfeng Li, Risheng Xia, Dongwen Ying, Yonghong Yan, and Masato Akagi
Journal of Acoustical Society of America, 136, 6, 3301-3312-, 2014/12/05
Binaural sound source localization in noisy reverberant environments based on Equalization-Cancellation Theory
Chau, D. T., Li, J., and Akagi, M.
IEICE Trans. Fundamental, E97-A, 10, 2011-2020-, 2014/10/01
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Morita, S., Unoki, M., Lu, X., and Akagi, M.
Proc. ISCSLP2014, Singapore, 108-112-, 2014/09/13
Toward relaying an affective speech-to-speech translator: Cross-language perception of emotional state represented by emotion dimensions
Elbarougy. R., Han.X., Akagi, M., and Li, J.
Proc. O-COCOSDA2014, Phuket, Thailand, 48-53-, 2014/09/10
Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System
Akagi, M., Han, X., El-Barougy, R., Hamada, Y. and Li, J.
Proc. IIHMSP2014, Kitakyushu, Japan, 574-577-, 2014/08/27
Toward Relaying Emotional State for Speech-To-Speech Translator: Estimation of Emotional State for Synthesizing Speech with Emotion
Akagi, M. and Elbarougy, R.
Proc. ICSV2014. Beijing, -, 2014/07/16
Perception of second language phoneme masked by first- or second-language speech in 20 – 60 years old listeners
Kubo, R., Akagi, M., and Akahane-Yamada, R.
167th ASA, Providence, RI, -, 2014/05/09
Glottal source analysis of emotional speech
Li, Y. and Akagi, M.
Proc. NCSP2014, Hawaii, USA, 513-516-, 2014/03/02
Speech recognition in noisy conditions based on speech separation using Non-negative Matrix Factorization
Du, Y. and Akagi, M.
Proc. NCSP2014, Hawaii, USA, 429-432-, 2014/03/02
Study on Analyzing Individuality of Instrument Sounds Using Non-negative Matrix Factorization
Kobayashi, K., Morikawa, D., and Akagi, M.
Proc. NCSP2014, Hawaii, USA, 37-40-, 2014/03/01
Improving Speech Emotion Dimensions Estimation Using a Three-Layer Model for Human Perception
Elbarougy, R. and Akagi, M.
Acoustical Science and Technology, 35, 2, 86-98-, 2014/03/01
Cross-lingual speech emotion recognition system based on a three-layer model for human perception
Elbarougy, R. and Akagi, M.
Proc. APSIPA2013, Kaohsiung, Taiwan, -, 2013/11/01
Improving naturalness of HMM-based TTS trained with limited data by temporal decomposition
Phung, T. N., Phan, T. S., Vu, T. T., Loung, M. C., and Akagi, M.
IEICE Trans. Inf. & Syst., E96-D, 11, 2417-2426-, 2013/11/01
Admissible range for individualization of head-related transfer function in median plane
Akagi, M. and Hisatsune, H.
Proc. IIHMSP2013, Beijing, -, 2013/10/17
A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions
Phung, T. N., Luong, M. C., and Akagi, M.
Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain 281-284, -, 2013/09/02
Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese
Li, J, Chen, F., Akagi, M., and Yan, Y.
Proc. InterSpeech2013, Lyon, 1184-1187-, 2013/08/27
Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio
Chau, D. T., Li, J., and Akagi, M.
Proc. ChinaSIP2013, Beijing, 322-326-, 2013/07/08
Exploring auditory aging can exclusively explain Japanese adults′ age-related decrease in training effects of American English /r/-/l/
Kubo, R. and Akagi, M.
Proc. ICA2013, 2aSC34, Montreal, -, 2013/06/04
A Study on individualization of Head-Related Transfer Function in the median plane
Hisatsune, H. and Akagi, M.
Proc. NCSP2013, Hawaii, USA, 161-164-, 2013/03/05
A singing voices synthesis system to characterize vocal registers using ARX-LF model
Motoda, H. and Akagi, M.
Proc. NCSP2013, Hawaii, USA, 93-96-, 2013/03/05
Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages
Phung, T. N., Luong, M. C., and Akagi, M.
Proc. O-COCOSDA2012, Macau, 129-134-, 2012/12/12
A concatenative speech synthesis for monosyllabic languages with limited data
Phung, T. N., Luong, M. C., and Akagi, M.
Proc. APSIPA2012, Hollywood, USA, -, 2012/12/06
Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered Model
Elbarougy, R. and Akagi, M.
Proc. APSIPA2012, Hollywood, USA, -, 2012/12/04
A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model
Phung, T. N., Unoki, M., and Akagi, M.
Journal of Signal Processing, 16, 5, 409-417-, 2012/09/01
Privacy protection for speech based on concepts of auditory scene analysis
Akagi, M. and Irie, Y.
Proc. INTERNOISE2012, -, 2012/08/22
Evaluation of objective intelligibility prediction measures for noise-reduced signals in Mandarin
Xia, R., Li, J., Akagi, M., and Yan, Y.
Proc. ICASSP2012, 4465-4468-, 2012/03/28
Study on hearing impression of speaker identification focusing on dynamic features
Izumida, T. and Akagi, M.
Proc. NCSP2012, Honolulu, 401-404-, 2012/03/05
Speech enhancement technique in noisy reverberant environment using two microphone arrays
Sasaki, Y. and Akagi, M.
Proc. NCSP2012, 333-336-, 2012/03/05
Voice activity detection in MTF-based power envelope restoration
Unoki, M., Lu, X., Petrick, R., Morita, S., Akagi, M., and Hoffmann R.
Proc. INTERSPEECH 2011, Florence, Italy, 2609-2612-, 2011/08/30
Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y.
Speech Communication, 53, 677-689-, 2011/06
Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English
Li, J., Yang, L., Zhang, J., Yan, Y., Hu, Y., Akagi, M., and Loizou, P. C.
J. Acoust. Soc. Am., 129, 3291-3301-, 2011/05
Influences of transformed auditory feedback with first three formant frequencies
Shih, T, Suemitsu, A., and Akagi, M.
Proc. NCSP2011, 340-343-, 2011/03/03
Towards an intelligent binaural speech enhancement system by integrating meaningful signal extraction
Chau, D. T., Li, J., and Akagi, M.
Proc. NCSP2011, 344-347-, 2011/03/03
A binaural model accounting for spatial masking release
Mizukawa, S. and Akagi, M.
Proc. NCSP2011, 179-182-, 2011/03/02
Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals
Yanao, Y., Miyauchi, R., Unoki, M., and Akagi, M.
Proc. NCSP2011, 231-234-, 2011/03/02
Study on blind estimation of Speech Transmission Index in room acoustics
Ikeda, T., Unoki, M., and Akagi, M.
Proc. NCSP2011, 235-238-, 2011/03/02
Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristics
Kosugi, T., Haniu, A., Miyauchi, R., Unoki, M., and Akagi, M.
Proc. NCSP2011, 135-138-, 2011/03/01
An investigation on speech perception over coarticulation
Trung-Nghia Phung, Mai Chi Luong, and Masato Akagi
Proc. ICSAP2011, 507-511-, 2011/02
An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenon
Trung-Nghia Phung, Mai Chi Luong, and Masato Akagi
Proc. ICSAP2011, 512-514-, 2011/02
Towards intelligent binaural speech enhancement by meaningful sound extraction
Chau, D. T., Li, J., and Akagi, M.
Journal of Signal Processing, 15, 4, 291-294-, 2011
Study on MTF-based power envelope restoration in noisy reverberant environments
Morita, S., Lu, X., Unoki, M., and Akagi, M.
Proc. NCSP2011, 247-250-, 2011
Pitch perception of complex sounds with varied fundamental frequency and spectral tilt
Ishida, M. and Akagi, M.
Proc. NCSP10, -, 2010/03/04
A study on brain activities elicited by synthesized emotional voices controlled with prosodic features
Hamada, Y., Kitamura, T., and Akagi, M.
Proc. NCSP10, -, 2010/03/04
Experimental evaluations of TS-BASE/WF in reverberant conditions
Li, J. Sasaki, Y., Akagi, M. and Yan, Y.
Proc. NCSP10, -, 2010/03/04
A study on the MTF-based inverse filtering for the modulation spectrum of reverberant speech
Morita, S., Unoki, M., and Akagi, M.
Proc. NCSP10, -, 2010/03/04
Effects of spatial cues on detectability of alarm signals in noisy environments
Naoki Kuroda, Junfeng Li, Yukio Iwaya, Masashi Unoki, and Masato Akagi
Proc. IWPASH2009, -, 2009/11
Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation model
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yôiti Suzuki
Proc. WASPPA2009 --- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, -, 2009/10
Physiologically-inspired feature extraction for emotion recognition
Yu Zhou, Yanqing Sun, Junfeng Li, Jianping Zhang, Yonghong Yan
Proc. Interspeech2009, -, 2009/09
Advancement of two-stage binaural speech enhancement (TS-BASE) for high-quality speech communication
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yôiti Suzuki
Proc. The 10th Western Pacific Acoustics Conference, -, 2009/09
Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices
Saitou, T., Goto, M., Unoki, M., and Akagi, M.
NCMMSC2009, -, 2009/08/15
A flexible spectral modification method based on temporal decomposition and Gaussian mixture model
Nguyen, B. P. and Akagi, M.
Acoustical Science and Techechnology, 30, 3, 170-179-, 2009/05/01
Psychoacoustically-motivated adaptive -order generalized spectral subtraction for cochlear implant patients
Junfeng Li, Qian-Jie Fu, Hui Jiang, Masato Akagi
Proc. ICASSP2009, pp. 4665-4668-, 2009/04
Effects from Spatial Cues on Detectability of Alarm Signals in Car Environments
Kuroda, N., Li, J., Iwaya, Y., Unoki, M., and Akagi, M.
Proc. NCSP'09, 45-48-, 2009/03/01
An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech
Kinugasa, K., Unoki, M., and Akagi, M.
Proc. NCSP'09, 105-108-, 2009/03/01
An emotional speech recognition system based on multi-layer emotional speech perception model
Aoki, Y., Huang, C-F., and Akagi, M.
Proc. NCSP'09, 133-136-, 2009/03/01
Effects from spatial cues on detectability of alarm signals in car environments
N. Kuroda, J. Li, Y. Iwaya, M. Unoki and M. Akagi
Proc. NCSP’09, pp. 45-48-, 2009/03
Analysis of production and perception characteristics of non-linguistic information in speech and its application to inter-language communications
Akagi, M.
Proc. APSIPA2009, -, 2009
Comparison of emotion perception among different cultures
Dang, J., Li, A., Erickson, D., Suemitsu, A., Akagi, M., Sakuraba, K., Minematsu, N., and Hirose, K.
Proc. APSIPA2009, -, 2009
Advancement of two-stage binaural speech enhancement (TS-BASE) for high-quality speech communication
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y.
Proc. WESPAC2009, -, 2009
A psychoacoustically-motivated conceptual model for automatic speech recognition
Haniu, A., Unoki, M., and Akagi, M.
Proc. WESPAC2009, -, 2009
Efficient modeling of temporal structure of speech for applications in voice transformation
Nguyen B. P. and Akagi M.
Proc. InterSpeech2009, -, 2009
The improved TS-BASE approaches with interference compensation and their evaluations for speech enhancement
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yo-iti Suzuki
in Proc. ISCSLP2008-The 6th International Symposium on Chinese Spoken Language Processing, -, 2008/12
Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
Lu, X., Unoki, M., and Akagi, M.
Acoustical Science and Technology, 29, 6, 351-361-, 2008/11/01
Adaptive Beta-order Generalized Spectral Subtraction for Speech Enhancement
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yo-iti Suzuki
Signal Processing, 88, 11, pp. 2764-2776-, 2008/11
A three-layered model for expressive speech perception
Huang, C-F. and Akagi, M.
Speech Communication, 50, 810-828-, 2008/10
High-quality analysis/synthesis method based on Temporal decomposition for speech modification
Nguyen, B. P., Shibata, T., and Akagi, M.
Proc. InterSpeech2008, Brisbane, 662-665-, 2008/09/24
Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization
Li, J., Jiang, H., and Akagi, M.
Proc. InterSpeech2008, Brisbane, 171-174-, 2008/09/23
Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristics
Petric, R., Lu, X., Unoki, M., Akagi, M., and Hoffmann, R.
Proc. InterSpeech2008, Brisbane, 658-661-, 2008/09/23
Psychoacoustically-motivated adaptive beta-order generalized spectral subtraction based on data-driven optimization
Junfeng Li, Hui Jiang and Masato Akagi
in Proc. Interspeech2008, -, 2008/09
Improved two-stage binaural speech enhancement based on accurate interference estimation for hearing aids
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y.
IHCON2008, -, 2008/08/16
Experimental evaluation of the two-stage binaural speech enhancement with Wiener filter for speech enhancement and sound localization
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yôiti Suzuki
Proc. ISAAR2009, -, 2008/08
Improved two-stage binaural speech enhancement based on accurate interference estimation for hearing aids
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yôiti Suzuki
in Proc. International Hearing Aid Research Conference, -, 2008/08
An MTF-based blind restoration for temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments
Lu, X., Unoki, M., and Akagi, M.
Acoustics2008, Paris, 1419-1424-, 2008/07/01
Comparison of Japanese expressive speech perception by Japanese and Taiwanese listeners
Huang, C. F., Erickson, D., and Akagi, M.
Acoustics2008, Paris, 2317-2322-, 2008/07/01
Extension of the two-microphone noise reduction method for binaural hearing aids
Junfeng Li, Masato Akagi and Yo-iti Suzuki
in Proc. International Conference on Audio, Language and Image Processing 2008, pp. 97-101-, 2008/07
A two-stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y.
Acoustics2008, Paris, 723-727-, 2008/06/30
Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture mode
Nguyen B. P. and Akagi M.
Proc. ICCE2008, 224-229-, 2008/06/06
An LP-based blind model for restoring bone-conducted speech
Vu, T. T. Unoki, M. and Akagi, M.
Proc. ICCE2008, 212-217-, 2008/06/05
A hybrid microphone array post-filter in a diffuse noise field
Junfeng Li and Masato Akagi
Applied Acoustics, 69, 6, pp. 546-557-, 2008/06
A Two-Microphone Noise Reduction Method in Highly Non-Stationary Multiple-Noise-Source Environments
Junfeng Li, Masato Akagi and Yo-iti Suzuki
IEICE Trans. on Fundamentals of Electronics, Communications and Computer Science, E91-A, 6, pp. 1337-1346-, 2008/06
A two-stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yo-iti Suzuki
in Proc. Forum Acousticum 2008, pp. 723-727-, 2008/06
Two-input two-output speech enhancement using adaptive filter and soft decision mask filter
Ai Sasaki, Shuichi Sakamoto, Satoshi Hongo, Junfeng Li and Yo-iti Suzuki
in Proc. the 3rd International Symposium on Medical, Bio- and Nano-Electronics, -, 2008/03
A study on nonlinguistic feature in singing and speaking voices by brain activity measurement
Nakamura, T., Kitamura, T. and Akagi, M.
Proc. NCSP'09, 217-220-, 2008
Adaptive β-order generalized spectral subtraction for speech enhancement
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y.
Signal Processing, 88, 11, 2764-2776-, 2008
A two-microphone noise reduction method in highly non-stationary multiple-noise-source environments
Li, J., Akagi, M., and Suzuki, Y.
IEICE Trans. Fundamentals, E91-A, 6, 1337-1346-, 2008
A hybrid microphone array post-filter in a diffuse noise field
Li, J. and Akagi, M.
Applied Acoustics (Elsevier), 69, 546-557-, 2008
The Construction of Large-scale Bone-conducted and Air-conducted Speech Databases for Speech Intelligibility Tests
Vu, T. T. Unoki, M. and Akagi, M.
Proc. Oriental COCOSDA2007, 88-91-, 2007
Speech-to-singing synthesis: converting speaking voices to singing voices by controlling acoustic features unique to singing voices
Saitou, T., Goto, M., Unoku, M., and Akagi, M.
Proc. WASPAA2007, 215-218-, 2007
Improvement in detectability of alarm signals in noisy environments by utilizing spatial cues
Uchiyama, H., Unoku, M., and Akagi, M.
Proc. WASPAA2007, 74-77-, 2007
Method of LP-based blind restoration for improving intelligibility of bone-conducted speech
Vu, T. T., Seide, G., Unoki, M., and Akagi, M.
Proc. Interspeech2007, 966-969-, 2007
Vocal conversion from speaking voice to singing voice using STRAIGHT
Saitou, T., Goto, M., Unoki, M., and Akagi, M.
Proc. Interspeech2007, Singing Challenge, -, 2007
Noise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement
Li, J., Sakamoto, S., Hongo, S., Akagi, M., and Suzuki, Y.
Proc. Interspeech2007, 802-805-, 2007
A rule-based speech morphing for verifying an expressive speech perception model
Huang, C. F. and Akagi, M.
Proc. Interspeech2007, 2661-2664-, 2007
A flexible spectral modification method based on temporal decomposition and Gaussian mixture model
Nguyen B. P. and Akagi M.
Proc. Interspeech2007, 538-541-, 2007
Common factors in emotion perception among different cultures
Sawamura K., Dang J., Akagi M., Erickson D., Li, A., Sakuraba, K., Minematsu, N., and Hirose, K.
Proc. ICPhS2007, 2113-2116-, 2007
Voice conversion to add non-linguistic information into speaking voices
Akagi, M., Saitou, T., and Huang, C-F.
Proc. JCA2007, -, 2007
Limited error based event localizing temporal decomposition and its application to variable-rate speech coding
Nguyen, P. C., Akagi, M., and Nguyen, P. B.
Speech Communication, 49, 292-304-, 2007
Spectral Modification for Voice Gender Conversion using Temporal Decomposition
Nguyen B. P. and Akagi M.
Journal of Signal Processing,, 11, 4, 333-336-, 2007
A Model-Concept of the Selective Sound Segregation: ?A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments
Unoki, M., Kubo, M., Haniu, A., and Akagi, M.
Journal of Signal Processing, 10, 6, 419-431-, 2006/11
A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-based Models
Vu, T., Unoki, M., and Akagi, M.
Journal of Signal Processing, 10, 6, 407-417-, 2006/11
A study on an LP-based model for restoring bone-conducted speech
Vu, T., Unoki, M., and Akagi, M.
Proc. HUT-ICCE2006, Hanoi, -, 2006/10
A robust feature extraction based on the MTF concept for speech recognition in reverberant environment
Lu, X., Unoki, M., and Akagi, M.
Proc. ICSLP2006, Pittsburgh, USA, 2546-2549, -, 2006/09
Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement
Li, J, Akagi, M., and Suzuki, Y.
Proc. ICSLP2006, Pittsburgh, USA, 2130-2133, -, 2006/09
Effects of complicated vocal tract shapes on vocal tract transfer functions
Nishimoto, H. and Akagi, M.
Journal of Signal Processing, 10, 4, 267-270-, 2006/07
Effect of ITD and component frequencies on perception of alarm signals in noisy environments
Nakanishi, J., Unoki, M., and Akagi, M.
Journal of Signal Processing, 10, 4, 231-234-, 2006/07
Noise reduction method based on generalized subtractive beamformer
Li, J. and Akagi, M.
Acoust. Sci. & Tech., The journal of the Acoustical Society of Japan, 27, 4, 206-215-, 2006/07
Effects of complicated vocal tract shapes on vocal tract transfer functions
Nishimoto, H. and Akagi, M.
Proc. NCSP2006, 114-117-, 2006
Effect of ITD and component frequencies on perception of alarm signals in noisy environments
Nakanishi, J., Unoki, M., and Akagi, M.
Proc. NCSP2006, 37-40-, 2006
A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments
Li, J. and Akagi, M.
Speech Communication, 48, 111-126-, 2006
Comparative analysis of the two-step reaction catalyzed by prokaryotic and eukaryotic phytochelatin synthase by an ion-pair liquid chromatography assay.
Naoki Tsuji, Shingo Nishikori, Sachiko Matsumoto, Osamu Iwabe, Kentaro Shiraki, Hitoshi Miyasaka, Masahiro Takagi, Kazumasa Hirata, and Kazuhisa Miyamoto.
Planta, 222, 181-191 (2005), -, 2005
A hybrid microphone array post-filter in a diffuse noise field
Li, J. and Akagi, M.
Proc. EuroSpeech2005, 2313-2316-, 2005
A model for selective segregation of a target instrument sound from the mixed sound of various instruments
Unoki, M., Kubo, M., Haniu, A., and Akagi, M.
Proc. EuroSpeech2005, 2097-2100-, 2005
A Multi-Layer fuzzy logical model for emotional speechPerception
Huang, C. F. and Akagi, M.
Proc. EuroSpeech2005, 417-420-, 2005
A noise reduction system in arbitrary noise environments and its application to speech enhancement and speech recognition
Li, J., Lu, X., and Akagi, M.
Proc. ICASSP2005, III-277-280-, 2005
A study on a speech recognition method based on the selective sound segregation in noisy environment
Haniu, A., Unoki, M. and Akagi, M.
Proc. NCSP05, 403-406-, 2005
Toward a rule-based synthesis of emotional speech on linguistic description of perception
Huang, C. F. and Akagi, M.
Affective Computing and Intelligent Interaction, Springer LNCS 3784, 366-373-, 2005
Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model
Ito, K. and Akagi, M.
Auditory Signal Processing, Springer, 91-99-, 2005
A computational model of cochlear nucleus neurons
Maki, K. and Akagi, M.
Auditory Signal Processing, Springer, 84-90-, 2005
Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis
Saitou, T., Unoki, M. and Akagi, M.
Speech Communication, 46, 405-417-, 2005
Temporal decomposition of speech and its application to speech coding and modification
Akagi, M. and Nguyen, P. C.
Proc. Special Workshop in MAUI (SWIM), 1,4-, 2004/01
Fundamental frequency estimation for noisy speech using entropy-weighted periodic and harmonic features
Ishimoto, Y. and Akagi, M.
IEICE Trans. Inf. & Syst., E87-D, 1, 205-214-, 2004/01
Noise reduction using hybrid noise estimation technique and post-filtering
Li, J. and Akagi, M.
Proc. ICSLP2004, -, 2004
Analysis of acoustic features affecting “singing-ness” and its application to singing-voice synthesis from speaking-voice
Saitou, T., Tsuji, N., Unoki, M. and Akagi, M.
Proc. ICSLP2004,, -, 2004
Temporal decomposition of speech and its application to speech coding and modification
Akagi, M., Nguyen, P. C., Saitou, T., Tsuji, N., and Unoki, M.
Proc. KEST2004, 280-288-, 2004
A model for selective segregation of a target instrument sound from the mixed sound of various instruments
Unoki, M., Kubo, M., and Akagi, M.
Proc. ICMC2003, 295-298-, 2003/10
A speech dereverberation method based on the MTF concept
Unoki, M., Sakata, K. and Akagi, M.
Proc. EUROSPEECH200, 1417-1420-, 2003/09
Efficient quantization of speech excitation parameters using temporal decomposition
Nguyen, P. C. and Akagi, M.
Proc. EUROSPEECH2003, 449-452-, 2003/09
Study on improving regularity of neural phase locking in single neuron of AVCN via computational model
Ito, K. and Akagi, M.
Proc. ISH2003, 77-83-, 2003/08
A computational model of cochlear nucleus neurons
Maki, K. and Akagi, M.
Proc. ISH2003, 70-76-, 2003/08
Temporal decomposition: A promising approach to VQ-based speaker identification
Nguyen, P. C., Akagi, M., and Ho, T. B.
Proc. ICME2003, III, 617-620-, 2003/07
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal
Unoki, M., Furukawa, M., Sakata, K., and Akagi, M.
Proc. ICASSP2003, I, 840-843-, 2003/05
Temporal decomposition: A promising approach to VQ-based speaker identification
Nguyen, P. C., Akagi, M., and Ho, T. B.
Proc. ICASSP2003, I, 184-187-, 2003/05
Modified Restricted Temporal Decomposition and its Application of Low Rate Speech Coding
Nguyen, P. C., Ochi, T., and Akagi, M.
IEICE Trans. Inf. & Syst., E86-D, 3, 397-405-, 2003/03
Development of the F0 control method for singing-voices synthesis
Saitou, T., Unoki, M., and Akagi, M.
Proc. SP2004, 491-49-, 2003
A method for recovering the power envelope from reverberant speech
Unoki, M., Furukawa, M., and Akagi, M.
Forum Acousticum Sevilla 2002, SPA-Gen-002-, 2002/09
Perception of fundamental frequency fluctuation
Akagi, M.
Forum Acousticum Sevilla 2002 (Invited), HEA-02-003-IP-, 2002/09
Coding speech at very low rates using STRAIGHT and temporal decomposition
Nguyen, P. C. and Akagi, M.
Proc. ICSLP2002, 1849-1852-, 2002/08
Limited error based event localizing temporal decomposition
Nguyen, P. C. and Akagi, M.
Proc. EUSIPCO2002, 90, -, 2002/08
Extraction of F0 dynamic characteristics and development of F0 control model in singing voice
Saitou, T., Unoki, M., and Akagi, M.
Proc. ICAD2002, 275-278-, 2002/07
Improvement of the restricted temporal decomposition method for line spectral frequency parameters
Nguyen, P. C. and Akagi, M.
Proc. ICASSP2002, I, 265-268-, 2002/05
Noise reduction using a small-scale microphone array in multi noise source environment
Akagi, M. and Kago, T.
Proc. ICASSP2002, I, 909-912-, 2002/05
Speech enhancement and segregation based on human auditory mechanisms
Akagi, M., Mizumachi, M., Ishimoto, Y., and Unoki, M.
Enabling Society with Information Technology, Q. Jin, J. Li, N. Zhang, J. Cheng, C. Yu, and S. Noguchi (Eds.), Springer Tokyo, pp.186-196-, 2002
A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency
Ishimoto, Y., Unoki, M., and Akagi, M.
Proc. EUROSPEECH2001, pp.2439-2442-, 2001
Noisiness estimation of machine working noise using human auditory model
Akagi, M., Kakehi, M., Kawaguchi, M., Nishinuma, M., and Ishigami, A.
Proc. Internoise2001, pp.2451-2454-, 2001
A computational model of co-modulation masking release
Unoki, M. and Akagi, M.
Computational Models of Auditory Function, (Eds. Greenberg, S. and Slaney, M.), NATO ASI Series, IOS Press, Amsterdam, pp.221-232-, 2001
A computational model of auditory sound localization
Itoh, K. and Akagi, M.
Computational Models of Auditory Function (Eds. Greenberg, S. and Slaney, M.), NATO ASI Series, IOS Press, Amsterdam, pp.97-111-, 2001
Perception of Lateral Misarticulation and Its Physical Correlates
Akagi, M., Suzuki, N., Hayashi, K., Saito, H., and Michi, K.
Folia Phoniatrica et Logopaedica, Vol.53, No.6, pp.291-307-, 2001
Spectral stability based event localizing temporal decomposition
A. C. R. Nandasena, P. C. Nguyen, and M. Akagi
Computer Speech & Language, Vol.15, No.4, pp.381-401-, 2001
Effect of the basilar membrane nonlinearities on rate-place representation of vowel in the cochlear nucleus: A modeling approach
Maki, K., Akagi, M. and Hirota, K.
Recent Developments in Auditory Mechanics, World Scientific Publishing, 490-496-, 2000
A computational model of auditory sound localization based on ITD
Ito, K. and Akagi, M.
Recent Developments in Auditory Mechanics, World Scientific Publishing, 483-489-, 2000
The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises
Mizumachi, M. and Akagi, M.
J. Acoust. Soc. Jpn. (E), 21, 5, 251-258-, 2000
A method of signal extraction from noisy signal based on auditory scene
Unoki Masashi, Akagi Masato
analysis Speech Communication, 27, 3-4, 261-279-, 1999/04