AKAGI, Masato Professor
School of Information Science, Human Life Design Area
◆Degrees
B.E.from Nagoya Institute of Technology (1979), M.E and Ph.D.from Tokyo Institute of Technology (1981,1984)
◆Professional Experience
NTT Basic Research Laboratories (1984), ATR Auditory and Visual Perception Research Laboratories (1986-1990), Associate Professor of School of Information Science at JAIST (1992-1999)
◆Specialties
Speech Signal Processing, Modeling of Speech Perception Mechanism of Humans
◆Research Interests
Non-linguistic information in speech
1. Non-linguistic Information 1-1 Singing Voice 1-2 Speaker Indivisuality 1-3 Emotional Speech 1-4 Voice Conversion 1-5 Speech Coding
Noise reduction in speech
2. Noise Reduction 2-1 Microphone Array 2-2 F0 Extraction 2-3 De-reverberation 2-4 Bone-conducted Speech 2-5 Speech Recognition 2-6 DOA
Modeling of "Cocktail-party effect"
3. Cocktail-party Effect Modeling 3-1 Sound Segregation 3-2 Privacy Protection 3-3 Noisy Sound Perception
Modeling of human ear based on psychoacoustics
4. Psychoacoustics 4-1 Auditory Model 4-2 Contextual Effect 4-3 Auditory Filter 4-4 Phase Perception 4-5 Vowel Perception 4-6 Noise Evaluation
Physiological Auditory Modeling
5. Physiological Auditory Modeling
Abnormal speech
6. Abnormal Speech 6-1 Abnormal Speech Perception 6-2 3D Vocal Tract Modeling
Interaction between Perception and Production
7. Interaction between Perception and Production

■Publications

◆Published Papers
・ Non-parallel Dictionary-based Voice Conversion using Variational Autoencoder with Modulation Spectrum-constrained Training , Ho-Tuan Vu and Akagi Masato , Journal of Signal Processing , 22 , 4 , 189-192 , 2018/08/01
・ Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal space , Yongwei Li, Junfeng Li, and Masato Akagi , J. Acoust. Soc. Am. , 144 , 2 , 908-916 , 2018/08/01
・ Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space , Yawen Xue, Yasuhiro Hamada, and Masato Akagi , Speech Communication , 102 , 54-67 , 2018/09/01
・ Perceptual grouping with prosodic features in Japanese dialects , Ling Zhang and Masato Akagi , Proc. NCSP2018, Honolulu, USA , 184-187 , 2018/03/05
・ Study on differences between perceptions of Japanese and Chinese emotional speech by Japanese and Chinese listeners , Chenyi Zhang and Masato Akagi , Proc. NCSP2018, Honolulu, USA , 359-362 , 2018/03/06
◆Books
・ 音響学入門,第2章「音を聞く仕組み」 , 鈴木陽一,赤木正人,伊藤彰則,佐藤洋,苣木禎史,中村健太郎 , コロナ社 , 2011
・ Effects of spatial cues on detectability of alarm signals in noisy environments, In Principles and applications of spatial hearing (Eds. Suzuki, Y., Brungart, D., Iwaya, Y., Iida, K., Cabrera, D., and Kato, H.) , Kuroda, N., Li, J., Iwaya, Y., Unoki, M., and Akagi, M. , World Scientific , 2011
・ Noise Reduction Based on Microphone Array and Post-Filtering , Junfeng Li, Masato Akagi , VDM Publishing House Ltd. , 2009
・ 音のなんでも小辞典 , 日本音響学会編 , 講談社ブルーバックス , 1996
・ 脳科学大事典 , 甘利、外山編 , 朝倉書店 , 2000
◆Conference Activities & Talks
・ 避難呼びかけ音声の心理的評価 ~ 30~60代を対象とした調査 ~ , 小林まおり,赤木正人 , 音響学会聴覚研究会資料,H-2018-31 , 那覇IT創造館 , 2018/03/03
・ 雑音・残響環境下での緊迫感がある音声の知覚 , 小林まおり,赤木正人 , 日本音響学会騒音振動研究会資料,N-2018-17 , 金沢しいのき迎賓館 , 2018/03/09
・ 残響下の了解度と発話変形との関係-フォルマント分析による検討- , 久保理恵子,赤木正人 , 日本音響学会騒音振動研究会資料,N-2018-18 , 金沢しいのき迎賓館 , 2018/03/09
・ 発話時の残響時間によるフォルマント周波数の変化と残響下における了解度 , 久保理恵子,赤木正人 , 日本音響学会電気音響研究会資料,EA-2017-108 , 石垣 , 2018/03/19
・ 音声の緊迫感に関与する音響特徴の検討 , 小林 まおり 濱田 康弘 赤木 正人 (). “”,. , 音響学会聴覚研究会資料,H-2018-71 , 北海道大学 , 2018/07/24

■Contributions to  Society

◆Social Contribution
・ InterSpeech
・ INt. Conf. Acoustic, Speech, Signal Processing
・ 3rd International conference on Spoken Language Processing , Member of Technical programing comittee(1994)

■Academic  Awards

・ The 1st place in Singing Synthesis Challenge, InterSpeech2007 , 2007