AKAGI, Masato Professor
School of Information Science, Human Life Design Area, School of Information Science
◆Degrees
PhD Tokyo Institute of Technology
PhD Tokyo Institute of Technology
◆Professional Experience
: Associate Professor of School of Information Science at JAIST (1992-1999)
: NTT Basic Research Laboratories (1984), ATR Auditory and Visual Perception Research Laboratories (1986-1990)
◆Specialties
Perceptual information processing
◆Research Keywords
音波形の信号処理
◆Research Interests
Non-linguistic information in speech
1. Non-linguistic Information 1-1 Singing Voice 1-2 Speaker Indivisuality 1-3 Emotional Speech 1-4 Voice Conversion 1-5 Speech Coding 備考
Noise reduction in speech
2. Noise Reduction 2-1 Microphone Array 2-2 F0 Extraction 2-3 De-reverberation 2-4 Bone-conducted Speech 2-5 Speech Recognition 2-6 DOA
Modeling of "Cocktail-party effect"
3. Cocktail-party Effect Modeling 3-1 Sound Segregation 3-2 Privacy Protection 3-3 Noisy Sound Perception
Modeling of human ear based on psychoacoustics
4. Psychoacoustics 4-1 Auditory Model 4-2 Contextual Effect 4-3 Auditory Filter 4-4 Phase Perception 4-5 Vowel Perception 4-6 Noise Evaluation
Physiological Auditory Modeling
5. Physiological Auditory Modeling
Abnormal speech
6. Abnormal Speech 6-1 Abnormal Speech Perception 6-2 3D Vocal Tract Modeling
Interaction between Perception and Production
7. Interaction between Perception and Production

■Publications

◆Published Papers
Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks with Auditory Front-Ends
Zhichao Peng, Xingfeng Li, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
IEEE Access, 8, 16560-16572, 2020
A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech
Xingfeng, Li, Masato Akagi
Proc. InterSpeech2018, Hyderabad, India, 3643-3647-, 2018
Unsupervised Singing Voice Separation Based on Robust Principal Component Analysis Exploiting Rank-1 Constraint
Feng Li, Masato Akagi
Proc. EUSIPCO2018, Rome, Italy, 1934-1938-, 2018
Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space
Yawen Xue, Yasuhiro Hamada, Masato Akagi
Speech Communication, 102, 54-67-, 2018
Non-parallel Dictionary-based Voice Conversion using Variational Autoencoder with Modulation Spectrum-constrained Training
Ho-Tuan Vu, Akagi Masato
Journal of Signal Processing, 22, 4, 189-192-, 2018
◆Misc
Study on modeling of room impulse response and its room acoustic characteristics
鵜木 祐史, 石川 大介, 柏原 佑太, 小林 まおり, 赤木 正人
電子情報通信学会技術研究報告 = IEICE technical report : 信学技報, 116, 302, 79-84, 2016
A modulation-transfer-function-based method for restores sub-band power envelope from noisy reverberant speech
S. Morita, X. Lu, M. Unoki, M. Akagi, R. Hoffmann
The Acoustics 2012 Hong Kong Conference and exhibition, -, 2012
A study on the IMTF-based filtering on the modulation spectrum of reverberant signal
S. Morita, M. Unoki, M. Akagi
2010 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP’2010, 265-268, 2010
音声生成における軟口蓋の働きのモデル化に関する研究(音声・聴覚,一般)
朴永男, 党建武, 中井考芳, 赤木正人
電子情報通信学会技術研究報告. SP, 音声, 106, 178, 37-42, 2006
Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency
Yuichi Ishimoto, Masashi Unoki, Masato Akagi
JAIST Research report, 2005, 1-31, 2005
◆Books
音響学入門,第2章「音を聞く仕組み」
1-244, コロナ社, 2011
Effects of spatial cues on detectability of alarm signals in noisy environments, In Principles and applications of spatial hearing (Eds. Suzuki, Y., Brungart, D., Iwaya, Y., Iida, K., Cabrera, D., and Kato, H.)
484-493, World Scientific, 2011
Noise Reduction Based on Microphone Array and Post-Filtering
ISBN-NR.:978-3-639-20483-4, VDM Publishing House Ltd., 2009
脳科学大事典
共著, 朝倉書店, 2000
音のなんでも小辞典
共著, 講談社ブルーバックス, 1996
◆Conference Activities & Talks
Toward Affective Speech-to-Speech Translation
International Conference on Advances in Information and Communication Technology 2016, DOI 10.1007/978-3-319-49073-1 3, Thai Nguyen, Vietnam, 2016
表現豊かな音声の認識・合成とAffective Speech-to-Speech Translationへの応用
2015音学シンポジウム,情報処理学会研究報告,2015-MUS-107, 6, 電気通信大学, 2015
カクテルパーティ効果とスピーチプライバシー保護
日本音響学会平成24年春季研究発表会,2-2-3, 神奈川大学, 2012
音情景理解を応用した音声プライバシー保護
電子情報通信学会技術報告,EMM2011-59, 機械振興会館(東京), 2011
聴覚と音研究
音響学会聴覚研究会資料,41, 7, H-2011-104, 2011

■Teaching Experience

Speech Signal Processing, Statistics for Data Analytics, 音声情報処理特論, データ分析のための情報統計学

■Contributions to  Society

◆Academic Society Affiliations
信号処理学会, The Institute of Electronics, Information and Communication Engineers, The Acousitcal Society of Japan

■Academic  Awards

・ 日本音響学会佐藤論文賞 , 日本音響学会 , 2011
・ インタラクション2009,インタラクティブ発表賞 , 情報処理学会 , 2009
・ 信号処理学会 Best Paper Award , 信号処理学会 , 2009