TOP  >  Personal Info.  >  Published Papers
AKAGI, Masato Professor
School of Information Science, Human Life Design Area, School of Information Science

Published Papers

270 items
Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model
Yongwei Li, Ken-Ichi Sakakibara, Masato Akagi
Journal of Signal Processing Systems, 92, 8, 831-838, 2020
A Two-Stage Phase-Aware Approach for Monaural Multi-Talker Speech Separation
Lu Yin, Junfeng Li, Yonghong Yan, Masato Akagi
IEICE Trans. Information and Systems, E103-D, 7, 1732-1743, 2020
The Effect of Silence Feature in Dimensional Speech Emotion Recognition
Bagus Tris Atmaja, Masato Akagi
10th International Conference on Speech Prosody 2020, -, 2020
Mimicking Lombard effect: An analysis and reconstruction
Thuan Van Ngo, Rieko Kubo, Masato Akagi
IEICE Trans. Information and Systems, E103-D, 5, 1108-1117, 2020
Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition
Bagus Tris Atmaja, Masato Akagi
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4482-4486, 2020
Effect of articulatory and acoustic features on the intelligibility of speech in noise: An articulatory synthesis study
Thuanvan Ngo, Masato Akagi, Peter Birkholz
Speech Communication, 117, 13-20, 2020
Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning
Bagus Tris Atmaja, Masato Akagi
APSIPA Transactions on Signal and Information Processing, 9, 1-12, 2020
Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks with Auditory Front-Ends
Zhichao Peng, Xingfeng Li, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
IEEE Access, 8, 16560-16572, 2020
Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational Autoencoder
Tuan Vu Ho, Masato Akagi
Proc. APSIPA2019, Lanzhou, China, 106-111, 2019
Evaluation of the Lombard Effect Model on Synthesizing Lombard Speech in Varying Noise Level Environments with Limited Data
Thuan Van Ngo, Rieko Kubo, Masato Akagi
Proc. APSIPA2019, Lanzhou, China, 133-137, 2019
Speech Emotion Recognition Using Speech Feature and Word Embedding
Bagus Tris Atmaja, Kiyoaki Shirai, Masato Akagi
Proc. APSIPA2019, Lanzhou, China, 519-523, 2019
Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Network
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
Proc. APSIPA2019, Lanzhou, China, 524-528, 2019
Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking
Feng Li, Kaizhi Qian, Mark Hasegawa-Johnson, Masato Akagi
Proc. APSIPA2019, Lanzhou, China, 1239-1243, 2019
How the temporal amplitude envelope of speech contributes to urgency perception
Masashi Unoki, Miho Kawamura, Maori Kobayashi, Shunsuke Kidani, Masato Akagi
Proc. The 23rd International Congress of Acoustics, Aachen, Germany, 1739-1744, 2019
The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity
Xingfeng Li, Masato Akagi
Proc. InterSpeech2019, Graz, Austria, 3262-3266, 2019
Blind Monaural Singing Voice Separation Using Rank-1 Constraint Robust Principal Component Analysis and Vocal Activity Detection
Feng Li, Masato Akagi
Neurocomputing, 350, 44-52, 2019
Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model
Bagus Tris Atmaja, Masato Akagi
Proc. ICSigSys, The 3rd International Conference on Signals and Systems 2019, Bandung, Indonesia, 41-45, 2019
Deep Learning-based Categorical and Dimensional Emotion Recognition for Written and Spoken Text
Bagus Tris Atmaja, Kiyoaki Shirai, Masato Akagi
Proc. ISST2019, 5th International Seminar on Science and Technology, Surabaya, Indonesia, -, 2019
Psychological evaluation of evacuation announcements
Maori Kobayashi, Masato Akagi
日本音響学会誌, 74, 12, 633-640, 2018
Maximal Information Coefficient and Predominant Correlation-Based Feature Selection Toward A Three-Layer Model for Speech Emotion Recognition
Xingfeng Li, Masato Akagi
Proc. APSIPA2018, Honolulu, USA, 1428-1434, 2018
Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency range
Kyoko Takahashi, Masato Akagi
Proc. APSIPA2018, Honolulu, USA, 1879-1887, 2018
Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis
Feng Li, Masato Akagi
Proc. APSIPA2018, Honolulu, USA, 1924-1928, 2018
Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model
Yongwei Li, Ken-Ichi Sakakibara, Masato Akagi
Proc. ISCSLP2018, Taipei, Taiwan, -, 2018
A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech
Xingfeng, Li, Masato Akagi
Proc. InterSpeech2018, Hyderabad, India, 3643-3647-, 2018
Unsupervised Singing Voice Separation Based on Robust Principal Component Analysis Exploiting Rank-1 Constraint
Feng Li, Masato Akagi
Proc. EUSIPCO2018, Rome, Italy, 1934-1938-, 2018
Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space
Yawen Xue, Yasuhiro Hamada, Masato Akagi
Speech Communication, 102, 54-67-, 2018
Acoustic features of intelligible speech produced under reverberant environments
Rieko Kubo, Masato Akagi
The Journal of the Acoustical Society of America, 144, 3, 1802-1802, 2018
Acoustic features in speech for emergency perception
Maori Kobayashi, Yasuhiro Hamada, Masato Akagi
The Journal of the Acoustical Society of America, 144, 3, 1835-1835, 2018
Non-parallel Dictionary-based Voice Conversion using Variational Autoencoder with Modulation Spectrum-constrained Training
Ho-Tuan Vu, Akagi Masato
Journal of Signal Processing, 22, 4, 189-192-, 2018
Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal space
Yongwei Li, Junfeng Li, Masato Akagi
J. Acoust. Soc. Am., 144, 2, 908-916-, 2018
Auditory-Inspired End-to-End Speech Emotion Recognition using 3D Convolutional Recurrent Neural Networks based on Spectral Temporal Representation
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
Proc. ICME2018, San Diego, USA, -, 2018
Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model.
Yongwei Li, Ken-Ichi Sakakibara, Masato Akagi
11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018, Taipei City, Taiwan, November 26-29, 2018, 230-234, 2018
Speech Emotion Recognition Using MPCRNN based on Gammatone auditory Filterbank
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
APSIPA2017, -, 2017
Study on Method for Protecting Speech Privacy by Actively Controlling Speech Transmission Index in Simulated Room
Masashi Unoki, Yuta Kashihara, Maori Kobayashi, Masato Akagi
APSIPA2017, -, 2017
Method of Blindly Estimating Speech Transmission Index in Noisy Reverberant Environments
Masashi Unoki, Akikazu Miyazaki, Shota Morita, Masato Akagi
Journal of Information Hiding and Multimedia Signal Processing, International, 8, 6, 1430-1445-, 2017
Method of Estimating Signal-to-Noise Ratio Based on Optimal Design for Sub-band Voice Activity Detection
Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi
Journal of Information Hiding and Multimedia Signal Processing, International, 8, 6, 1446-1459-, 2017
Feature Selection Method for Real-time Speech Emotion Recognition
Reda Elbarougy, Masato Akagi
O-COCOSDA2017, 86-91-, 2017
Commonalities of glottal sources and vocal tract shapes among speakers in emotional speech
Li, Y, Sakakibara, K-I, Morikawa, D, Akagi, M
ISSP2017, -, 2017
Acoustical analyses of tendencies of intelligibility in Lombard speech with different background noise levels
Ngo, T. V, Kubo, R, Morikawa, D, Akagi, M
Journal of Signal Processing, 21, 4, 171-174-, 2017
Weighted robust principal component analysis with gammatone auditory filterbank for singing voice separation
Feng Li, Masato Akagi
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 10639, 849-858, 2017
Toward affective speech-to-speech translation
Masato Akagi
Advances in Intelligent Systems and Computing, 538, -, 2017
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank.
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 1750-1755, 2017
Optimizing Fuzzy Inference Systems for Improving Speech Emotion Recognition
Reda Elbarougy, Masato Akagi
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 533, 85-95, 2017
Quality Improvement of Vietnamese HMM-Based Speech Synthesis System Based on Decomposition of Naturalness and Intelligibility Using Non-negative Matrix Factorization
Anh-Tuan Dinh, Thanh-Son Phan, Masato Akagi
ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 538, 490-499, 2017
Voice conversion system to emotional speech in multiple languages based on three-layered model for dimensional space
Xue, Y, Hamada, Y, Elbarougy, R, Akagi, M
O-COCOSDA2016, Bali, Indonesia, 122-127-, 2016
Quality Improvement of HMM-based Synthesized Speech Based on Decomposition of Naturalness and Intelligibility using Non-Negative Matrix Factorization
Dinh, A. T, Akagi, M
O-COCOSDA2016, Bali, Indonesia, 62-67-, 2016
Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyance
Kubo, R, Morikawa, D, Akagi, M
Proc. Inter-Noise2016, Hamburg, Germany, 171-176-, 2016
Study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model
Dinh, T. A, Morikawa, D, Akagi, M
Journal of Signal Processing, 20, 4, 205-208-, 2016
A study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model
Dinh, T. A, Morikawa, D, Akagi, M
Proc. NCSP2016, Honolulu, HW, USA, 13-16-, 2016
Automatic Speech Emotion Recognition in Chinese Using a Three-layered Model in Dimensional Approach
Li, X, Akagi, M
Proc. NCSP2016, Honolulu, HW, USA, 17-20-, 2016
A study on applying target prediction model to parameterize power envelope of emotional speech
Xue, Y, Akagi, M
Proc. NCSP2016, Honolulu, HW, USA, 157-160-, 2016
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi
Journal of Signal Processing Systems, 82, 2, 163-173, 2016
Multilingual Speech Emotion Recognition System based on a Three-layer Model
Xingfeng Li, Masato Akagi
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5, 3608-3612, 2016
Voice Conversion to Emotional Speech based on Three-layered Model in Dimensional Approach and Parameterization of Dynamic Features in Prosody
Yawen Xue, Yasuhiro Hamada, Masato Akagi
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), -, 2016
Preliminary Study on Blind Estimation of Room Acoustic Parameters in Noisy Reverberant Environments
Unoki, M, Morita, S, Miyazaki, A, Akagi, M
Proc. WESPAC2015, Singapore, 428-435-, 2015
Dependence on age of interference with phoneme perception by first- and second-language speech maskers
Kubo, R, Akagi, M, Akahane-Yamada, R
Acoustical Science and Technology, 36, 5, 397-407, 36, 5, 397 - 407-, 2015
A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions
Masashi Unoki, Masato Toi, Masato Akagi
European Signal Processing Conference, 06-10-September-2004, 1689-1692, 2015
A study on perception of emotional states in multiple languages on Valence-Activation approach
Xiao Han, Reda Elbarougy, Masato Akagi, Junfeng Li, Thi Duyen Ngo, The Duy Bui
Proc NCSP2015, Kuala Lumpur, Malaysia, -, 2015
Improving the naturalness of concatenative Vietnamese speech synthesis under limited data conditions
Phung Trung Nghia, Luong Chi Mai, Masato Akagi
Journal of Computer Science and Cybernetics, V.31, N.1, 1-16, 31, 1, 1-16-, 2015
Toward a rule-based synthesis of Vietnamese emotional speech
Thi Duyen Ngo, Masato Akagi, The Duy Bui
Advances in Intelligent Systems and Computing, 326, 129-142, 2015
Study on method to control fundamental frequency contour related to a position on Valence-Activation space
Hamada, Y, Elbarougy, R, Xue, Y, Akagi, M
Proc. WESPAC2015, Singapore, 519-522-, 2015
Emotional speech synthesis system based on a three-layered model using a dimensional approach
Yawen Xue, Yasuhiro Hamada, Masato Akagi
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 505-514, 2015
Toward Improving Estimation Accuracy of Emotion Dimensions in Bilingual Scenario Based on Three-layered Model
Xingfeng Li, Masato Akagi
2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 21-26, 2015
Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English
Junfeng Li, Risheng Xia, Dongwen Ying, Yonghong Yan, Masato Akagi
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 136, 6, 3301-3312, 2014
Binaural Sound Source Localization in Noisy Reverberant Environments Based on Equalization-Cancellation Theory
Thanh-Duc Chau, Junfeng Li, Masato Akagi
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E97A, 10, 2011-2020, 2014
Toward Relaying Emotional State for Speech-To-Speech Translator: Estimation of Emotional State for Synthesizing Speech with Emotion
Akagi, M, Elbarougy, R
Proc. ICSV2014. Beijing, -, 2014
Perception of second language phoneme masked by first- or second-language speech in 20 – 60 years old listeners
Kubo, R, Akagi, M, Akahane-Yamada, R
167th ASA, Providence, RI, -, 2014
音情景解析の概念にもとづいた音声プライバシー保護
赤木正人, 入江佳洋
電子情報通信学会論文誌 A, J97-A, 4, 247-255-, 2014
弦楽器F0推定のための精密周波数測定方法
西江純教, 赤木正人
電子情報通信学会論文誌A, J97-A, 4, 332-342-, 2014
Speech recognition in noisy conditions based on speech separation using Non-negative Matrix Factorization
Du, Y, Akagi, M
Proc. NCSP2014, Hawaii, USA, 429-432-, 2014
Glottal source analysis of emotional speech
Li, Y, Akagi, M
Proc. NCSP2014, Hawaii, USA, 513-516-, 2014
Improving Speech Emotion Dimensions Estimation Using a Three-Layer Model for Human Perception
Elbarougy, R, Akagi, M
Acoustical Science and Technology, 35, 2, 86-98-, 2014
Study on Analyzing Individuality of Instrument Sounds Using Non-negative Matrix Factorization
Kobayashi, K, Morikawa, D, Akagi, M
Proc. NCSP2014, Hawaii, USA, 37-40-, 2014
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 108-+, 2014
Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System
Masato Akagi, Xiao Han, Reda Elbarougy, Yasuhiro Hamada, Junfeng Li
2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 574-577, 2014
Toward relaying an affective Speech-to-Speech translator: Cross-language perception of emotional state represented by emotion dimensions
Reda Elbarougy, Han Xiao, Masato Akagi, Junfeng Li
2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 48-53-, 2014
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 108-+, 2014
Toward Affective Speech-to-Speech Translation: Strategy for Emotional Speech Recognition and Synthesis in Multiple Languages
Masato Akagi, Xiao Han, Reda Elbarougy, Yasuhiro Hamada, Junfeng Li
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), -, 2014
A Method for Emotional Speech Synthesis Based on the Position of Emotional State in Valence-Activation Space
Yasuhiro Hamada, Reda Elbarougy, Masato Akagi
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), -, 2014
Improving naturalness of HMM-based TTS trained with limited data by temporal decomposition
Phung, T. N, Phan, T. S., Vu, T. T, Loung, M. C, Akagi, M
IEICE Trans. Inf. & Syst., E96-D, 11, 2417-2426-, 2013
A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions
Phung, T. N, Luong, M. C, Akagi, M
Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain 281-284, -, 2013
Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio
Chau, D. T, Li, J, Akagi, M
Proc. ChinaSIP2013, Beijing, 322-326-, 2013
Exploring auditory aging can exclusively explain Japanese adults′ age-related decrease in training effects of American English /r/-/l/
Kubo, R, Akagi, M
Proc. ICA2013, 2aSC34, Montreal, -, 2013
A singing voices synthesis system to characterize vocal registers using ARX-LF model
Motoda, H, Akagi, M
Proc. NCSP2013, Hawaii, USA, 93-96-, 2013
A Study on individualization of Head-Related Transfer Function in the median plane
Hisatsune, H, Akagi, M
Proc. NCSP2013, Hawaii, USA, 161-164-, 2013
Objective Japanese intelligibility prediction for noisy speech signals before and after noise-reduction processing
Junfeng Li, Masato Akagi, Yonghong Yan
2013 IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2013 - Proceedings, 352-355, 2013
Blind method of estimating speech transmission index from reverberant speech signals.
Masashi Unoki, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, Nam Soo Kim
21st European Signal Processing Conference, EUSIPCO 2013, Marrakech, Morocco, September 9-13, 2013, 1-5, 2013
Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function
Masashi Unoki, Tomohiro Ikeda, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, Nam Soo Kim
2013 IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2013 - Proceedings, 308-312, 2013
Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese
Junfeng Li, Fei Chen, Masato Akagi, Yonghong Yan
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 1183-1186, 2013
Admissible range for individualization of head-related transfer function in median plane
Masato Akagi, Hideki Hisatsune
2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 326-329, 2013
Cross-lingual Speech Emotion Recognition System Based on a Three-Layer Model for Human Perception
Reda Elbarougy, Masato Akagi
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), -, 2013
A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model
Phung, T. N, Unoki, M, Akagi, M
Journal of Signal Processing, 16, 5, 409-417-, 2012
Privacy protection for speech based on concepts of auditory scene analysis
Akagi, M, Irie, Y
Proc. INTERNOISE2012, -, 2012
Speech enhancement technique in noisy reverberant environment using two microphone arrays
Sasaki, Y, Akagi, M
Proc. NCSP2012, 333-336-, 2012
Study on hearing impression of speaker identification focusing on dynamic features
Izumida, T, Akagi, M
Proc. NCSP2012, Honolulu, 401-404-, 2012
EVALUATION OF OBJECTIVE INTELLIGIBILITY PREDICTION MEASURES FOR NOISE-REDUCED SIGNALS IN MANDARIN
Risheng Xia, Junfeng Li, Masato Akagi, Yonghong Yan
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 4465-4468, 2012
Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered Model
Reda Elbarougy, Masato Akagi
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), -, 2012
A concatenative speech synthesis for monosyllabic languages with limited data
Trung-Nghia Phung, Mai Chi Luong, Masato Akagi
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), -, 2012
TRANSFORMATION OF F0 CONTOURS FOR LEXICAL TONES IN CONCATENATIVE SPEECH SYNTHESIS OF TONAL LANGUAGES
Trung-Nghia Phung, Mai Chi Luong, Masato Akagi
2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 129-134, 2012
MTF-based sub-band power-envelope restoration for robust speech recognitionin noisy reverberant environments
Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi, Rüdigger Hoffmann
APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, 21-25, 2011
Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication
Li, J, Sakamoto, S, Hongo, S, Akagi, M, Suzuki, Y
Speech Communication, 53, 677-689-, 2011
Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English
Junfeng Li, Lin Yang, Jianping Zhang, Yonghong Yan, Yi Hu, Masato Akagi, Philipos C. Loizou
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 129, 5, 3291-3301, 2011
Towards an intelligent binaural speech enhancement system by integrating meaningful signal extraction
Chau, D. T, Li, J, Akagi, M
Proc. NCSP2011, 344-347-, 2011
Influences of transformed auditory feedback with first three formant frequencies
Shih, T, Suemitsu, A, Akagi, M
Proc. NCSP2011, 340-343-, 2011
Study on blind estimation of Speech Transmission Index in room acoustics
Ikeda, T, Unoki, M, Akagi, M
Proc. NCSP2011, 235-238-, 2011
Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals
Yanao, Y, Miyauchi, R, Unoki, M, Akagi, M
Proc. NCSP2011, 231-234-, 2011
A binaural model accounting for spatial masking release
Mizukawa, S, Akagi, M
Proc. NCSP2011, 179-182-, 2011
Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristics
Kosugi, T, Haniu, A, Miyauchi, R, Unoki, M, Akagi, M
Proc. NCSP2011, 135-138-, 2011
An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenon
Trung-Nghia Phung, Mai Chi Luong, Masato Akagi
Proc. ICSAP2011, 512-514-, 2011
An investigation on speech perception over coarticulation
Trung-Nghia Phung, Mai Chi Luong, Masato Akagi
Proc. ICSAP2011, 507-511-, 2011
Study on MTF-based power envelope restoration in noisy reverberant environments
Morita, S, Lu, X, Unoki, M, Akagi, M
Proc. NCSP2011, 247-250-, 2011
Towards intelligent binaural speech enhancement by meaningful sound extraction
Chau, D. T, Li, J, Akagi, M
Journal of Signal Processing, 15, 4, 291-294-, 2011
Voice activity detection in MTF-based power envelope restoration
Masashi Unoki, Xugang Lu, Rico Petrick, Shota Morita, Masato Akagi, Ruediger Hoffmann
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2620-2623, 2011
A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Yu Zhou, Junfeng Li, Yanqing Sun, Jianping Zhang, Yonghong Yan, Masato Akagi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E93D, 10, 2813-2821, 2010
A study on the IMTF-based filtering on the modulation spectrum of reverberant signal
Shota Morita, Masashi Unoki, Masato Akagi
Journal of Signal Processing, 14, 4, 269-272, 2010
A study on the MTF-based inverse filtering for the modulation spectrum of reverberant speech
Morita, S, Unoki, M, Akagi, M
Proc. NCSP10, -, 2010
Experimental evaluations of TS-BASE/WF in reverberant conditions
Li, J, Sasaki, Y, Akagi, M, Yan, Y
Proc. NCSP10, -, 2010
A study on brain activities elicited by synthesized emotional voices controlled with prosodic features
Hamada, Y, Kitamura, T, Akagi, M
Proc. NCSP10, -, 2010
Pitch perception of complex sounds with varied fundamental frequency and spectral tilt
Ishida, M, Akagi, M
Proc. NCSP10, -, 2010
Intelligibility investigation of single-channel noise reduction algorithms for Chinese and Japanese
Junfeng Li, Lin Yang, Yonghong Yan, Chau Due Thanh, Masato Akagi
2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings, 7-11, 2010
A DOA Estimation Algorithm based on Equalization-Cancellation Theory
Duc Thanh Chau, Junfeng Li, Masato Akagi
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2774-2777, 2010
音声に含まれる感情情報の認識 ―感情空間をどのように表現するか―
赤木正人
日本音響学会誌, 66, 8, 393-398-, 2010
Effects of spatial cues on detectability of alarm signals in noisy environments
Naoki Kuroda, Junfeng Li, Yukio Iwaya, Masashi Unoki, Masato Akagi
Proc. IWPASH2009, -, 2009
Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation model
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki
Proc. WASPPA2009 --- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, -, 2009
Advancement of two-stage binaural speech enhancement (TS-BASE) for high-quality speech communication
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki
Proc. The 10th Western Pacific Acoustics Conference, -, 2009
Physiologically-inspired feature extraction for emotion recognition
Yu Zhou, Yanqing Sun, Junfeng Li, Jianping Zhang, Yonghong Yan
Proc. Interspeech2009, -, 2009
Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices
Saitou, T, Goto, M, Unoki, M, Akagi, M
NCMMSC2009, -, 2009
聴覚抹消系の機能モデルの提案 -聴神経の位相固定性及びスパイク生成機構のモデル化-
牧,赤木,廣田
日本音響学会論文誌, 65, 5, 239-250-, 2009
An emotional speech recognition system based on multi-layer emotional speech perception model
Aoki, Y, Huang, C-F, Akagi, M
Proc. NCSP'09, 133-136-, 2009
An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech
Kinugasa, K, Unoki, M, Akagi, M
Proc. NCSP'09, 105-108-, 2009
Effects from Spatial Cues on Detectability of Alarm Signals in Car Environments
Kuroda, N, Li, J, Iwaya, Y, Unoki, M, Akagi, M
Proc. NCSP'09, 45-48-, 2009
Effects from spatial cues on detectability of alarm signals in car environments
N. Kuroda, J. Li, Y. Iwaya, M. Unoki, M. Akagi
Proc. NCSP’09, pp. 45-48-, 2009
A flexible spectral modification method based on temporal decomposition and Gaussian mixture model
Binh Phu Nguyen, Masato Akagi
Acoustical Science and Technology, 30, 3, 170-179, 2009
Efficient Modeling of Temporal Structure of Speech For Applications in Voice Transformation
Binh Phu Nguyen, Masato Akagi
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 1599-1602, 2009
A psychoacoustically-motivated conceptual model for automatic speech recognition
Haniu, A, Unoki, M, Akagi, M
Proc. WESPAC2009, -, 2009
Advancement of two-stage binaural speech enhancement (TS-BASE) for high-quality speech communication
Li, J, Sakamoto, S, Hongo, S, Akagi, M, Suzuki, Y
Proc. WESPAC2009, -, 2009
Comparison of emotion perception among different cultures
Dang, J, Li, A, Erickson, D, Suemitsu, A, Akagi, M, Sakuraba, K, Minematsu, N, Hirose, K
Proc. APSIPA2009, -, 2009
Analysis of production and perception characteristics of non-linguistic information in speech and its application to inter-language communications
Akagi, M
Proc. APSIPA2009, -, 2009
PSYCHOACOUSTICALLY-MOTIVATED ADAPTIVE beta-ORDER GENERALIZED SPECTRAL SUBTRACTION FOR COCHLEAR IMPLANT PATIENTS
Junfeng Li, Qian-Jie Fu, Hui Jiang, Masato Akagi
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 4665-+, 2009
Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
Lu, X, Unoki, M, Akagi, M
Acoustical Science and Technology, 29, 6, 351-361-, 2008
Adaptive beta-order generalized spectral subtraction for speech enhancement
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yoiti Suzuki
SIGNAL PROCESSING, 88, 11, 2764-2776, 2008
A three-layered model for expressive speech perception
Chun-Fang Huang, Masato Akagi
SPEECH COMMUNICATION, 50, 10, 810-828, 2008
Improved two-stage binaural speech enhancement based on accurate interference estimation for hearing aids
Li, J, Sakamoto, S, Hongo, S, Akagi, M, Suzuki, Y
IHCON2008, -, 2008
Improved two-stage binaural speech enhancement based on accurate interference estimation for hearing aids
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki
in Proc. International Hearing Aid Research Conference, -, 2008
Experimental evaluation of the two-stage binaural speech enhancement with Wiener filter for speech enhancement and sound localization
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki
Proc. ISAAR2009, -, 2008
Comparison of Japanese expressive speech perception by Japanese and Taiwanese listeners
Huang, C. F, Erickson, D, Akagi, M
Acoustics2008, Paris, 2317-2322-, 2008
An MTF-based blind restoration for temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments
Lu, X, Unoki, M, Akagi, M
Acoustics2008, Paris, 1419-1424-, 2008
A two-stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments
Li, J, Sakamoto, S, Hongo, S, Akagi, M, Suzuki, Y
Acoustics2008, Paris, 723-727-, 2008
A hybrid microphone array post-filter in a diffuse noise field
Junfeng Li, Masato Akagi
APPLIED ACOUSTICS, 69, 6, 546-557, 2008
A two-microphone noise reduction method in highly non-stationary multiple-noise-source environments
Junfeng Li, Masato Akagi, Yoiti Suzuki
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E91A, 6, 1337-1346, 2008
A two-stage binaural speech enhancement approach for hearing aids with preserving binaural benefits in noisy environments
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yo-iti Suzuki
in Proc. Forum Acousticum 2008, pp. 723-727-, 2008
A two-microphone noise reduction method in highly non-stationary multiple-noise-source environments
Junfeng Li, Masato Akagi, Yoiti Suzuki
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E91A, 6, 1337-1346, 2008
A hybrid microphone array post-filter in a diffuse noise field
Junfeng Li, Masato Akagi
APPLIED ACOUSTICS, 69, 6, 546-557, 2008
Two-input two-output speech enhancement using adaptive filter and soft decision mask filter
Ai Sasaki, Shuichi Sakamoto, Satoshi Hongo, Junfeng Li, Yo-iti Suzuki
in Proc. the 3rd International Symposium on Medical, Bio- and Nano-Electronics, -, 2008
歌声らしさの知覚モデルに基づいた歌声特有の音響特徴量の分析
齋藤,辻, 鵜木,赤木
日本音響学会誌, 64, 5, 267-277-, 2008
A study on nonlinguistic feature in singing and speaking voices by brain activity measurement
Nakamura, T, Kitamura, T, Akagi, M
Proc. NCSP'09, 217-220-, 2008
Extension of the two-microphone noise reduction method for binaural hearing aids
Junfeng Li, Masato Akagi, Yoiti Suzuki
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 97-101, 2008
Psychoacoustically-motivated Adaptive beta-order Generalized Spectral Subtraction Based on Data-driven Optimization
Junfeng Li, Hui Jiang, Masato Akagi
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 171-+, 2008
THE IMPROVED TS-BASE APPROACHES WITH INTERFERENCE COMPENSATION AND THEIR EVALUATIONS FOR SPEECH ENHANCEMENT
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yoiti Suzuki
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 141-144, 2008
An LP-based blind model for restoring bone-conducted speech
Thang tat Vu, Masashi Unoki, Masato Akagi
2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 210-215, 2008
Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model
Binh Phu Nguyen, Masato Akagi
2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 222-227, 2008
Robust Front End Processing for Speech Recognition in Reverberant Environments: Utilization of Speech Characteristics
Rico Petrick, Xugang Lu, Masashi Unoki, Masato Akagi, Ruediger Hoffmann
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 658-+, 2008
Psychoacoustically-motivated Adaptive beta-order Generalized Spectral Subtraction Based on Data-driven Optimization
Junfeng Li, Hui Jiang, Masato Akagi
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 171-+, 2008
High-Quality Analysis/Synthesis Method Based on Temporal Decomposition for Speech Modification
Binh Phu Nguyen, Takeshi Shibata, Masato Akagi
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 662-665, 2008
Limited error based event localizing temporal decomposition and its application to variable-rate speech coding
Phu Chien Nguyen, Masato Akagi, Binh Phu Nguyen
SPEECH COMMUNICATION, 49, 4, 292-304, 2007
Foreword to the special issue on "applied systems"
Masato Akagi
Acoustical Science and Technology, 28, 3, 139-, 2007
Noise reduction based on microphone array and post-filtering for robust speech recognition in car environments
Junfeng Li, Masato Akagi
ADVANCES FOR IN-VEHICLE AND MOBILE SYSTEMS: CHALLENGES FOR INTERNATIONAL STANDARDS, 153-166, 2007
Spectral Modification for Voice Gender Conversion using Temporal Decomposition
Nguyen B. P, Akagi M
Journal of Signal Processing,, 11, 4, 333-336-, 2007
Voice conversion to add non-linguistic information into speaking voices
Akagi, M, Saitou, T, Huang, C-F
Proc. JCA2007, -, 2007
Common factors in emotion perception among different cultures
Sawamura K, Dang J, Akagi M, Erickson D, Li, A, Sakuraba, K, Minematsu, N, Hirose, K
Proc. ICPhS2007, 2113-2116-, 2007
A Flexible Spectral Modification Method based on Temporal Decomposition and Gaussian Mixture Model
Binh Phu Nguyen, Masato Akagi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 597-600, 2007
A Rule-Based Speech Morphing for Verifying a Expressive Speech Perception Model
Chun-Fang Huang, Masato Akagi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 1221-1224, 2007
Noise Reduction Based on Adaptive beta-Order Generalized Spectral Subtraction for Speech Enhancement
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yoiti Suzuki
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 577-+, 2007
Vocal conversion from speaking voice to singing voice using STRAIGHT
Saitou, T, Goto, M, Unoki, M, Akagi, M
Proc. Interspeech2007, Singing Challenge, -, 2007
Method of LP-based blind restoration for improving intelligibility of bone-conducted speech
Thang Tat Vu, Germine Seide, Masashi Unoki, Masato Akagi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 1885-1888, 2007
Improvement in detectability of alarm signals in noisy environments by utilizing spatial cues
Hideaki Uchiyama, Masashi Unoki, Masato Akagi
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 177-180, 2007
Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voices
Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 113-+, 2007
The Construction of Large-scale Bone-conducted and Air-conducted Speech Databases for Speech Intelligibility Tests
Vu, T. T, Unoki, M, Akagi, M
Proc. Oriental COCOSDA2007, 88-91-, 2007
Noise Reduction Based on Adaptive beta-Order Generalized Spectral Subtraction for Speech Enhancement
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yoiti Suzuki
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 88, 11, 577-+, 2007
A Flexible Spectral Modification Method based on Temporal Decomposition and Gaussian Mixture Model
Binh Phu Nguyen, Masato Akagi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 30, 3, 597-600, 2007
A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-based Models
Vu, T, Unoki, M, Akagi, M
Journal of Signal Processing, 10, 6, 407-417-, 2006
A Model-Concept of the Selective Sound Segregation: ?A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments
Unoki, M, Kubo, M, Haniu, A, Akagi, M
Journal of Signal Processing, 10, 6, 419-431-, 2006
Noise reduction method based on generalized subtractive beamformer
Li, J, Akagi, M
Acoust. Sci. & Tech., The journal of the Acoustical Society of Japan, 27, 4, 206-215-, 2006
Effect of ITD and component frequencies on perception of alarm signals in noisy environments
Nakanishi, J, Unoki, M, Akagi, M
Journal of Signal Processing, 10, 4, 231-234-, 2006
Effects of complicated vocal tract shapes on vocal tract transfer functions
Nishimoto, H, Akagi, M
Journal of Signal Processing, 10, 4, 267-270-, 2006
有限要素法による声道伝達特性推定の有効性に関する検討
西本,赤木, 北村,鈴木
日本音響学会誌, 62, 4, 306-315-, 2006
Communication between speech production and perception within the brain - Observation and simulation
Jianwu Dang, Masato Akagi, Kiyoshi Honda
Journal of Computer Science and Technology, 21, 1, 95-105, 2006
Noise reduction based on microphone array and post-filtering for robust speech recognition
Junfeng Li, Masato Akagi, Yoiti Suzuki
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 680-+, 2006
Multi-channel noise reduction in noisy environments
Junfeng Li, Masato Akagi, Yoiti Suzuki
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 4274, 258-+, 2006
Effect of ITD and component frequencies on perception of alarm signals in noisy environments
Nakanishi, J, Unoki, M, Akagi, M
Proc. NCSP2006, 37-40-, 2006
Effects of complicated vocal tract shapes on vocal tract transfer functions
Nishimoto, H, Akagi, M
Proc. NCSP2006, 114-117-, 2006
Improved Hybrid Microphone Array Post-filter by Integrating a Robust Speech Absence Probability Estimator for Speech Enhancement
Junfeng Li, Masato Akagi, Yoiti Suzuki
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2130-+, 2006
A robust feature extraction based on the MTF concept for speech recognition in reverberant environment
Xugang Lu, Masashi Unoki, Masato Akagi
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2546-2549, 2006
A study on an LP-based model for restoring bone-conducted speech
Thang tat Vu, Masashi Unoki, Masato Akagi
2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 294-+, 2006
Development of the MTF-based speech dereverberation method using adaptive time-frequency division
Masashi Unoki, Masato Toi, Masato Akagi
Forum Acusticum Budapest 2005: 4th European Congress on Acustics, 51-56, 2005
Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis
T Saitou, M Unoki, M Akagi
SPEECH COMMUNICATION, 46, 3-4, 405-417, 2005
Perception of Hypernasality and its Physical Correlates
Yukie Kozaki-Yamaguchi, Noriko Suzuki, Yukihiro Fujita, Hidemi Yoshimasu, Masato Akagi, Teruo Amagasa
Oral Science International, 2, 1, 21-35, 2005
A computational model of cochlear nucleus neurons
K Maki, M Akagi
AUDITORY SIGNAL PROCESSINGP: PHYSIOLOGY, PSYCHOACOUSTICS, AND MODELS, 84-90, 2005
Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model
K Ito, M Akagi
AUDITORY SIGNAL PROCESSINGP: PHYSIOLOGY, PSYCHOACOUSTICS, AND MODELS, 91-99, 2005
Toward a rule-based synthesis of emotional speech on linguistic descriptions of perception
CF Huang, M Akagi
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 3784, 366-373, 2005
A study on a speech recognition method based on the selective sound segregation in noisy environment
Haniu, A, Unoki, M, Akagi, M
Proc. NCSP05, 403-406-, 2005
A noise reduction system in arbitrary noise environments and its applications to speech enhancement and speech recognition
JF Li, XG Lu, M Akagi
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5, 277-280, 2005
A Multi-Layer fuzzy logical model for emotional speechPerception
Huang, C. F, Akagi, M
Proc. EuroSpeech2005, 417-420-, 2005
A model for selective segregation of a target instrument sound from the mixed sound of various instruments
Unoki, M, Kubo, M, Haniu, A, Akagi, M
Proc. EuroSpeech2005, 2097-2100-, 2005
A hybrid microphone array post-filter in a diffuse noise field
Li, J, Akagi, M
Proc. EuroSpeech2005, 2313-2316-, 2005
Comparative analysis of the two-step reaction catalyzed by prokaryotic and eukaryotic phytochelatin synthase by an ion-pair liquid chromatography assay.
Naoki Tsuji, Shingo Nishikori, Sachiko Matsumoto, Osamu Iwabe, Kentaro Shiraki, Hitoshi Miyasaka, Masahiro Takagi, Kazumasa Hirata, Kazuhisa Miyamoto
Planta, 222, 181-191 (2005), -, 2005
A computational model of cochlear nucleus neurons
K Maki, M Akagi
AUDITORY SIGNAL PROCESSINGP: PHYSIOLOGY, PSYCHOACOUSTICS, AND MODELS, 84-90, 2005
Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model
K Ito, M Akagi
AUDITORY SIGNAL PROCESSINGP: PHYSIOLOGY, PSYCHOACOUSTICS, AND MODELS, 91-99, 2005
An improved method based on the MTF concept for restoring the power envelope from a reverberant signal
Masashi Unoki, Masakazu Furukawa, Keigo Sakata, Masato Akagi
Acoustical Science and Technology, 25, 4, 232-242, 2004
Fundamental frequency estimation for noisy speech using entropy-weighted periodic and harmonic features
Y Ishimoto, K Ishizuka, K Aikawa, M Akagi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E87D, 1, 205-214, 2004
蝸牛神経核背側核細胞の周波数応答特性に関する神経回路モデルの提案
牧,赤木,廣田
日本音響学会誌, 60, 1, 3-11-, 2004
Temporal decomposition of speech and its application to speech coding and modification
Akagi, M, Nguyen, P. C
Proc. Special Workshop in MAUI (SWIM), 1,4-, 2004
下丘細胞の時間応答特性に関する計算モデルの提案
牧,赤木,廣田
日本音響学会誌, 60, 6, 304-313-, 2004
Temporal decomposition of speech and its application to speech coding and modification
Akagi, M, Nguyen, P. C, Saitou, T, Tsuji, N, Unoki, M
Proc. KEST2004, 280-288-, 2004
Analysis of acoustic features affecting “singing-ness” and its application to singing-voice synthesis from speaking-voice
Saitou, T, Tsuji, N, Unoki, M, Akagi, M
Proc. ICSLP2004,, -, 2004
Noise reduction using hybrid noise estimation technique and post-filtering
Li, J, Akagi, M
Proc. ICSLP2004, -, 2004
A model for selective segregation of a target instrument sound from the mixed sound of various instruments
Unoki, M, Kubo, M, Akagi, M
Proc. ICMC2003, 295-298-, 2003
Efficient quantization of speech excitation parameters using temporal decomposition
Nguyen, P. C, Akagi, M
Proc. EUROSPEECH2003, 449-452-, 2003
A speech dereverberation method based on the MTF concept
Unoki, M, Sakata, K, Akagi, M
Proc. EUROSPEECH200, 1417-1420-, 2003
Modified restricted temporal decomposition and its application to low rate speech coding
PC Nguyen, T Ochi, M Akagi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E86D, 3, 397-405, 2003
初期聴覚系における神経発火の時間-周波数応答パターン
牧勝弘, 伊藤一仁, 赤木正人
日本音響学会誌, 59, 1, 52-58-, 2003
蝸牛神経核腹側核細胞モデルの振幅変調音に対する応答特性
牧勝弘, 赤木正人, 廣田薫
日本音響学会誌, 59, 1, 13-22-, 2003
Development of the F0 control method for singing-voices synthesis
Saitou, T, Unoki, M, Akagi, M
Proc. SP2004, 491-49-, 2003
Temporal decomposition: A promising approach to VQ-based speaker identification
PC Nguyen, M Akagi, TB Ho
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, I, 184-187, 2003
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal
M Unoki, M Furukawa, K Sakata, M Akagi
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, I, 888-891, 2003
Temporal decomposition: A promising approach to VQ-based speaker identification
PC Nguyen, M Akagi, TB Ho
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, III, 184-187, 2003
Perception of fundamental frequency fluctuation
Akagi, M
Forum Acousticum Sevilla 2002 (Invited), HEA-02-003-IP-, 2002
A method for recovering the power envelope from reverberant speech
Unoki, M, Furukawa, M, Akagi, M
Forum Acousticum Sevilla 2002, SPA-Gen-002-, 2002
Limited error based event localizing temporal decomposition
Nguyen, P. C, Akagi, M
Proc. EUSIPCO2002, 90, -, 2002
Coding speech at very low rates using STRAIGHT and temporal decomposition
Nguyen, P. C, Akagi, M
Proc. ICSLP2002, 1849-1852-, 2002
Extraction of F0 dynamic characteristics and development of F0 control model in singing voice
Saitou, T, Unoki, M, Akagi, M
Proc. ICAD2002, 275-278-, 2002
Speech enhancement and segregation based on human auditory mechanisms
M Akagi, M Mizumachi, Y Ishimoto, M Unoki
ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 186-196, 2002
Variable rate speech coding using straight and temporal decomposition
PC Nguyen, M Akagi
2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS, 26-28, 2002
Noise reduction using a small-scale microphone array in multi noise source environment
M Akagi, T Kago
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, I, 909-912, 2002
Improvement of the restricted temporal decomposition method for line spectral frequency parameters
PC Nguyen, M Akagi
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, I, 265-268, 2002
Perception of lateral misarticulation and its physical correlates
M Akagi, N Suzuki, K Hayashi, H Saito, K Michi
FOLIA PHONIATRICA ET LOGOPAEDICA, 53, 6, 291-307, 2001
A computational model of auditory sound localization
K Ito, M Akagi
COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 312, 97-111, 2001
A computational model of co-modulation masking release
M Unoki, M Akagi
COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 312, 221-232, 2001
Noisiness estimation of machine working noise using human auditory model
Akagi, M, Kakehi, M, Kawaguchi, M, Nishinuma, M, Ishigami, A
Proc. Internoise2001, pp.2451-2454-, 2001
A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency
Ishimoto, Y, Unoki, M, Akagi, M
Proc. EUROSPEECH2001, pp.2439-2442-, 2001
特集-音響学における20世紀の成果と21世紀に残された課題- 聴覚分野
赤木他
日本音響学会誌, -, 2000
Speech enhancement and segregation based on human auditory mechanisms
Masato Akagi, Mitsunori Mizumachi, Yuichi Ishimoto, Masashi Unoki
Proceedings of 2001 International Conference on Information Society in the 21st Century, 246-253, 2000
A fundamental frequency estimation method for noisy speech
Yuichi Ishimoto, Masato Akagi
Proceedings of WESTPRAC VII, vol. 1, 161-164, 2000
Preception of synthesized singing voices with fine fluctuations in their fundamental frequency contours
Masato Akagi, Hironori Kitakaze
The 6th International Conference on Spoken language Processing, 3, 458-461, 2000
MR撮像法を用いた3次元声道形状の計測 -舌・口底切除症例の検討-
齋藤,鈴木, 藤田,道, 高橋, 赤木, 和久本
昭和歯学会雑誌, 20, 2, 198-214-, 2000
蝸牛神経核細胞の機能モデルの提案 -前腹側核細胞の応答特性-
牧,赤木,廣田
日本音響学会論文誌, 56, 7, 457-466-, 2000
The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises
Mizumachi, M, Akagi, M
J. Acoust. Soc. Jpn. (E), 21, 5, 251-258-, 2000
A computational model of auditory sound localization based on ITD
K Ito, M Akagi
PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON RECENT DEVELOPMENTS IN AUDITORY MECHANICS, 483-489, 2000
Effect of the basilar membrane nonlinearities on rate-place representation of vowel in the cochlear nucleus: A modeling approach
K Maki, M Akagi, K Hirota
PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON RECENT DEVELOPMENTS IN AUDITORY MECHANICS, 490-496, 2000
A method of signal extraction from noisy signal based on auditory scene analysis
M Unoki, M Akagi
SPEECH COMMUNICATION, 27, 3-4, 261-279, 1999
マイクロホン対を用いたスペクトルサブトラクションによる雑音除去法
水町光徳, 赤木正人
電子情報通信学会論文誌, J82-A, 4, 503-512-, 1999
聴覚の情景解析に基づいた雑音下の調波復号音の一抽出法
鵜木祐史, 赤木正人
電子情報通信学会論文誌, J82-A, 10, 1497-1507-, 1999
Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis.
Masashi Unoki, Masato Akagi
Sixth European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999, -, 1999
Noise reduction by paired-microphones using spectral subtraction
Mitsunori Mizumachi, Masato Akagi
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2, 1001-1004, 1998
Signal extraction from noisy signal based on auditory scene analysis.
Masashi Unoki, Masato Akagi
The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998, -, 1998
Spectral Stability Based Event Localizing Temporal Decomposition
ACR Nandasena, M Akagi
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, Vol.15, No.4, 957-960, 1998
A method for signal extraction from noise-added signals
Masashi Unoki, Masato Akagi
Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi), 80, 1-10, 1997
Significant cues in spectral envelope of isolated vowels for speaker identification
Kitamura Tatsuya, Akagi Masato
The Journal of the Acoustical Society of Japan, 53, 3, 185-191, 1997
Speaker individuality in fundamental frequency contours and its control
Masato Akagi, Taro Ienaga
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 18, 2, 73-80, 1997
基本周波数の微細変動成分に対する知覚
北風裕教, 赤木正人
電子情報通信学会技術報告,(音声・聴覚研究会合同), SP99, 168, -, 1997
Perception of lateral misarticulation and its physical correlates
Masato Akagi, Tatsuya Kitamura, Noriko Suzuki, Ken-ichi Michi
Journal of the Acoustical Society of America, 100, 4, 2694-, 1996
Speaker individualities in speech spectral envelopes
Tatsuya Kitamura, Masato Akagi
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 16, 5, 283-289, 1995
Evaluation of a spectrum target prediction model in speech perception
Masato Akagi
Journal of the Acoustical Society of America, 87, 2, 858-865, 1990
Sharpness and amplitude envelopes of broadband noise
Kazuo Ueda, Masato Akagi
Journal of the Acoustical Society of America, 87, 2, 814-819, 1990
Spectrum target prediction model and its application to speech recognition
Masato Akagi, Yoh'ichi Tohkura
Computer Speech and Language, 4, 4, 325-344, 1990
A construction of pole‐deviation tracking filter
Masato Akagi, Taizo Iijima
Electronics and Communications in Japan (Part I: Communications), 67, 5, 28-36, 1984
Speech recognition by polarized linear predictive error coding—POLPEC method
Masato Akagi, Taizo Iijima
Electronics and Communications in Japan (Part I: Communications), 65, 8, 9-18, 1982