UNOKI, Masashi Professor, Director of Research Center for Biological Function and Sensory Information, Director of Human Information Science Research Area
Information Science, Human Information Science, Research Center for Biological Function and Sensory Information
◆Degrees
M.S. and Ph.D. from Japan Advanced Institute of Science and Technology (1996,1999) 北陸先端科学技術大学院大学
◆Professional Experience
2000 - 2001 : 日本学術振興会特別研究員(DC2)(1998),ATR人間情報通信研究所 客員研究員(1999),ケンブリッジ大学CNBH客員研究員(2000-2001),日本学術振興会特別研究員(PD,北陸先端科学技術大学院大学 情報科学研究科)(1999-2001)
2000 - 2001 : JSPS Research Fellow (DC2)(1998), Visiting researcher, ATR Human Information Processing Laboratories (1999), Visiting Associate, CNBH, Univ. of Cambridge (2000-2001), JSPS Research Fellow (PD)(1999-2001)
◆Specialties
Intelligent robotics, Perceptual information processing, Intelligent informatics
◆Research Keywords
Audio Information Hiding, 音声信号処理, 聴覚情景解析, 聴覚モデル, Speech dereverberation, Computational Auditory Scene Analysis, Auditory filterbank
◆Research Interests
Construction of the auditory filterbank
The aim of this work is to construct the auditory filterbank that can account psycho-acoustical data and physiological data for frequency selectivity of the auditory system.
Computational Auditory Scene Analysis
The study of computational theory of the auditory system tries to answer the following questions:
Extraction of the fundamental frequency of speech in real environments
Extraction of the fundamental frequency (F0) of target speech is an important problem not only in speech analysis/synthesis but also in various speech signal processings such as speech segregation. Various F0 estimation methods have been proposed, but the most of these methods have the drawbacks for estimating accurate F0s of target speech in real environments. My approach is to construct an estimation model based on computational auditory scene analysis (CASA).
A study on the speech dereverberation method
To dereverberate the original signal from a reverberant signal is an important issue concerning speech signal processing such as preprocessing for speech recognition systems. Most of the inverse filtering methods have to measure the impulse response of the room acoustics to determine its inverse filter before the dereverberation. Moreover, the impulse response temporally varies with various environmental factors (temperature etc.), so the room acoustics have to be measured each time these methods are used. In this work, it has been trying to model a speech dereverberation based on the Modulation Transfer Function, without measuring the impulse response of room acoustics.

■Publications

◆Published Papers
Effects of delayed auditory feedback on consonant, vowel, and mora timing in Japanese speech
Yasufumi Uezu, Yosuke Himekomatsu, Masato Akagi, Masashi Unoki
Acoustical Science and Technology, 47, 1, 66-70, 2026
Lightweight Speech Intelligibility Prediction with Spectro-Temporal Modulation for Hearing-Impaired Listeners
Xiajie Zhou, Candy Olivia Mawalim, Huy Quoc Nguyen, Masashi Unoki
The 6th Clarity Workshop on Improving Speech-in-Noise for Hearing Devices (Clarity-2025), 1-3, 2025
Integrating Linguistic and Acoustic Cues for Machine Learning-Based Speech Intelligibility Prediction in Hearing Impairment
Candy Olivia Mawalim, Xiajie Zhou, Huy Quoc Nguyen, Masashi Unoki
The 6th Clarity Workshop on Improving Speech-in-Noise for Hearing Devices (Clarity-2025), 22-24, 2025
Important Modulation Frequency Components of Temporal Amplitude Envelope Contributing to Vocal Emotion Perception
Taiyang Guo, Shunsuke Kidani, Takuto Isoyama, Peter Birkholz, Masato Akagi, Masashi Unoki
Journal of Speech, Language, and Hearing Research, 1-15, 2025
Robust Multilingual Audio Deepfake Detection Through Hybrid Modeling
Candy Olivia Mawalim, Yutong Wang, Aulia Adila, Shogo Okada, Masashi Unoki
Proceedings of the ACM Workshop on Information Hiding and Multimedia Security, 181-192, 2025
◆Misc
ISO/TC 43・ISO/TC 43/SC 1・ISO/TC 43/SC 2・ISO/TC 43/SC 3総会 : 音響に関する国際規格の審議状況 : 2024ベルリン会議(ハイブリッド開催)—ISO/TC 43, ISO/TC 43/SC 1, ISO/TC 43/SC 2 and ISO/TC 43/SC 3 Plenary Meetings : Progress Report of International Standardization on Acoustics : 2024 Berlin Meetings (Held in Hybrid Format)
今泉 博之, 佐藤 洋, 高橋 弘宜, 倉片 憲治, 赤松 友成, 鈴木 陽一, 藤坂 洋一, 山崎 隆志, 鵜木 祐史, 桑野 園子, 山田 一郎, 高橋 幸雄, 杉江 聡, 吉村 純一, 横田 考俊, 小林 知尋, 下田 康平, 君塚 郁夫, 和田 将行, 白橋 良宏, 岡田 恭明, 大島 俊也, 森長 誠, 永幡 幸司, 須田 直樹, 平川 侑, 平光 厚雄, 佐藤 逸人, 澤田 浩一
騒音制御 = The journal of the INCE of Japan, 49, 3, 144-151, 2025
Study on urgency perception of noise-vocoded speech with controlled instantaneous modulation components of temporal amplitude envelope
房野早希, GUO Taiyang, 磯山拓都, 木谷俊介, 鵜木祐史
日本音響学会研究発表会講演論文集(CD-ROM), 2025, -, 2025
Study on speech watermarking method with time-stretching and compression process.
磯山拓都, 鵜木祐史
日本音響学会研究発表会講演論文集(CD-ROM), 2025, -, 2025
Changes in urgency perception by temporal stretching and compression of temporal amplitude envelope of speech
房野早希, GUO Taiyang, 磯山拓都, 木谷俊介, 鵜木祐史
電子情報通信学会技術研究報告(Web), 124, 271(EA2024 43-63), -, 2024
Study on evaluations of sensory pleasantness using sound quality metrics
谷口亮太郎, 磯山拓都, 上江洲安史, 木谷俊介, 鵜木祐史
電子情報通信学会技術研究報告(Web), 124, 94(EA2024 10-30), -, 2024
◆Books
「マスキング」,音響キーワードブック 日本音響学会編
コロナ社ISBN:978-4-339-00880-7, 2016
Method of Digital-Audio Watermarking Based on Cochlear Delay Characteristics, Multimedia Information Hiding Technologies and Methodologies for Controlling Data, Ed. Kazuhiko Kondo, Chapter 2
pp. 42-70, IGI Global, 2012
聴覚モデル
コロナ社, 2011
Effects of spatial cues on detectability of alarm signals in noisy environments, PRINCIPLES AND APPLICATIONS OF SPATIAL HEARING, Edited by Yoiti Suzuki, Douglas Brungart, Hiroaki Kato, Kazuhiro Iida, Densil Cabrera, & Yukio Iwaya
484-493, World Scientific,, 2011
◆Conference Activities & Talks
緊迫感知覚に寄与する音声の振幅包絡線情報の検討
日本音響学会騒音振動研究会, 2019
楽音を模した調波複合音の雑音駆動合成音のピッチ知覚の検討
日本音響学会聴覚研究会, 2019
Study on cochlear-delay based audio information hiding by linear time-variant IIR filter
日本音響学会2019年度秋季研究発表会, 2019
骨導音声の外耳道内放射特性の推定
日本音響学秋季研究発表会, 2019
雑音残響環境における雑音駆動音声の非言語情報知覚の検討
日本音響学会聴覚研究会, 2019

■Contributions to  Society

◆Academic Society Affiliations
信号処理学会, International Speech Communication Association, 電子情報通信学会, 日本音響学会, Research Institute of Signal Processing, Japan, Information and Communication Engineers, Institute of Electronics, The Acoustical Society of Japan, Institute of Electrical and Electronics Engineers, Acoustical Society of America
◆Academic Contribution
2017 International Workshop on Nonlinear Circuits and Signal Processing (NCSP17) Committee member (General Chair) , JAIST, Prof. Unoki Masashi , 2017 - 2017 , Guam, USA
2016 International Workshop on Nonlinear Circuits and Signal Processing (NCSP16) Committee member (General Vice Chair) , RISP , 2016 - 2016 , Honolulu, Hawaii, USA
The 14th IWDW, International Workshop on Digital-forensics and Watermarking (IWDW 2015) , Organizaing committee, , 2015 - 2015 , Tokyo University of Sciences

■Academic  Awards

・ Best paper award , Masashi Unoki , 11th International Conference on Social Computing and Social Media (SCSM 2019) , 2019
・ Best paper award , Masashi Unoki , 14th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (II , 2018
・ 平成30年度支部学会活動貢献賞 , 日本音響学会北陸支部 , 2018