トップページ  >  教員個別情報  >  発表論文
鵜木 祐史 (UNOKI, Masashi)教授, 生体機能・感覚研究センター長, 人間情報学研究領域長
情報科学, 人間情報学研究領域, 生体機能・感覚研究センター

発表論文

254件
Computational models of auditory sensation important for sound quality on basis of either gammatone or gammachirp auditory filterbank
Takuto Isoyama, Shunsuke Kidani, Masashi Unoki
Applied Acoustics, 218, 109914-109914, 2024
Study on suppression effect of air-conducted sound by bone-conducted sound
Shunsuke INOUE, Teruki TOYA, Yasufumi UEZU, Masashi UNOKI
INTER-NOISE and NOISE-CON Congress and Conference Proceedings, 268, 3, 5479-5489, 2023
Computational model for predicting sound quality metrics using loudness model based on gammatone/gammachirp auditory filterbank and its applications
Takuto Isoyama, Shunsuke Kidani, Masashi Unoki
INTER-NOISE and NOISE-CON Congress and Conference Proceedings, 268, 3, 5955-5964, 2023
Non-intrusive speech intelligibility prediction using an auditory periphery model with hearing loss
Candy Olivia Mawalim, Benita Angela Titalim, Shogo Okada, Masashi Unoki
Applied Acoustics, 214, -, 2023
Contributions of Temporal Modulation Cues in Temporal Amplitude Envelope of Speech to Urgency Perception
Masashi Unoki, Miho Kawamura, Maori Kobayashi, Shunsuke Kidani, Junfeng Li, Masato Akagi
Applied Sciences, 13, 10, 6239-6239, 2023
Methods for improving word intelligibility of bone-conducted speech by using bone-conduction headphones
Teruki Toya, Maori Kobayashi, Kenichi Nakamura, Masashi Unoki
Applied Acoustics, 207, 109337-109337, 2023
Method of estimating three-dimensional direction-of-arrival based on monaural modulation spectrum
Rui Wang, Nguyen Khanh Bui, Daisuke Morikawa, Masashi Unoki
Applied Acoustics, 203, 109215-109215, 2023
Auditory Model Optimization with Wavegram-CNN and Acoustic Parameter Models for Nonintrusive Speech Intelligibility Prediction in Hearing Aids
Candy Olivia Mawalim, Benita Angela Titalim, Shogo Okada, Masashi Unoki
European Signal Processing Conference, 211-215, 2023
Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditions
Taiyang Guo, Sixia Li, Shunsuke Kidani, Shogo Okada, Masashi Unoki
2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023, 2221-2227, 2023
Unsupervised Deep Unfolded Representation Learning for Singing Voice Separation.
Weitao Yuan, Shengbei Wang, Jianming Wang, Masashi Unoki, Wenwu Wang 0001
IEEE/ACM Transactions on Audio, Speech and Language Processing, 31, 3206-3220, 2023
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition.
Xingfeng Li 0001, Xiaohan Shi, Desheng Hu, Yongwei Li, Qingchen Zhang, Zhengxia Wang, Masashi Unoki, Masato Akagi
IEEE/ACM Transactions on Audio, Speech and Language Processing, 31, 2534-2547, 2023
A Discriminative Feature Representation Method Based on Cascaded Attention Network With Adversarial Strategy for Speech Emotion Recognition.
Yang Liu, Haoqin Sun, Wenbo Guan, Yuqi Xia, Yongwei Li, Masashi Unoki, Zhen Zhao 0006
IEEE/ACM Transactions on Audio, Speech and Language Processing, 31, 1063-1074, 2023
Contributions of Jitter and Shimmer in the Voice for Fake Audio Detection.
Kai Li 0018, Xugang Lu, Masato Akagi, Masashi Unoki
IEEE Access, 11, 84689-84698, 2023
Personality trait estimation in group discussions using multimodal analysis and speaker embedding.
Candy Olivia Mawalim, Shogo Okada, Yukiko I. Nakano, Masashi Unoki
Journal on Multimodal User Interfaces, 17, 2, 47-63, 2023
Analysis of Amplitude and Frequency Perturbation in the Voice for Fake Audio Detection
Kai Li, Yao Wang, Minh Le Nguyen, Masato Akagi, Masashi Unoki
2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), -, 2022
F0 Modification via PV-TSM Algorithm for Speaker Anonymization Across Gender
Candy Olivia Mawalim, Shogo Okada, Masashi Unoki
2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), -, 2022
Contribution of Common Modulation Spectral Features to Vocal-Emotion Recognition of Noise-Vocoded Speech in Noisy Reverberant Environments
Taiyang Guo, Zhi Zhu, Shunsuke Kidani, Masashi Unoki
Applied Sciences, 12, 19, 9979-9979, 2022
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection
Kai Li, Sheng Li, Xugang Lu, Masato Akagi, Meng Liu, Lin Zhang, Chang Zeng, Longbiao Wang, Jianwu Dang, Masashi Unoki
Interspeech 2022, 664-668, 2022
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network
Kai Li, Xugang Lu, Masato Akagi, Jianwu Dang, Sheng Li, Masashi Unoki
2022 30th European Signal Processing Conference (EUSIPCO), 379-383, 2022
Detection of Brain Network Communities During Natural Speech Comprehension From Functionally Aligned EEG Sources
Di Zhou, Gaoyan Zhang, Jianwu Dang, Masashi Unoki, Xin Liu
Frontiers in Computational Neuroscience, 16, -, 2022
ISO/TC 43・ISO/TC 43/SC 1・ISO/TC 43/SC 2・ISO/TC 43/SC 3総会――音響に関する国際規格の審議状況:2021パリ会議(オンライン開催)――
鈴木 陽一, 倉片 憲治, 今泉 博之, 佐藤 洋, 赤松 友成, 山崎 隆志, 藤坂 洋一, 内田 匠, 鵜木 祐史, 桑野 園子, 山田 一郎, 大島 俊也, 高橋 幸雄, 下田 康平, 白橋 良宏, 杉江 聡, 小林 知尋, 永幡 幸司, 森長 誠, 白木 秀児, 平光 厚雄, 古賀 貴士, 平川 侑, 澤田 浩一
日本音響学会誌, 78, 4, 203-208, 2022
Automatic Mean Opinion Score Estimation with Temporal Modulation Features on Gammatone Filterbank for Speech Assessment.
Huy Nguyen, Kai Li 0018, Masashi Unoki
Interspeech 2022(INTERSPEECH), 4526-4530, 2022
Method for improving the word intelligibility of presented speech using bone-conduction headphones.
Teruki Toya, Wenyu Zhu, Maori Kobayashi, Kenichi Nakamura, Masashi Unoki
Interspeech 2022(INTERSPEECH), 759-763, 2022
Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network.
Nan Li, Meng Ge, Longbiao Wang, Masashi Unoki, Sheng Li 0010, Jianwu Dang 0001
Interspeech 2022(INTERSPEECH), 361-365, 2022
Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement.
Tuan Vu Ho, Quoc Huy Nguyen, Masato Akagi, Masashi Unoki
Interspeech 2022(INTERSPEECH), 176-180, 2022
An Improved Stimulus Reconstruction Method for EEG-Based Short-Time Auditory Attention Detection.
Kai Yang, Zhuo Zhang, Gaoyan Zhang, Masashi Unoki, Jianwu Dang 0001, Longbiao Wang
Neural Information Processing - 29th International Conference, 267-277, 2022
Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients.
Quoc-Huy Nguyen, Masashi Unoki
30th European Signal Processing Conference(EUSIPCO), 21-25, 2022
Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Using Temporal Modulation Features on Gammatone Auditory Filterbank.
Kai Li 0018, Quoc-Huy Nguyen, Yasuji Ota, Masashi Unoki
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022(DCASE), -, 2022
Dialogue scenario classification based on social factors
Yuning Liu, Di Zhou, Masashi Unoki, Jianwu Dang, Aijun Li
2022 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, 379-383, 2022
Reconstruction of speech spectrogram based on non-invasive EEG signal
Di Zhou, Masashi Unoki, Gaoyan Zhang, Jianwu Dang
2022 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, 275-279, 2022
Speech Intelligibility Prediction for Hearing Aids Using an Auditory Model and Acoustic Parameters
Benita Angela Titalim, Candy Olivia Mawalim, Shogo Okada, Masashi Unoki
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 1076-1084, 2022
Investigation of noise-reverberation-robustness of modulation spectral features for speech-emotion recognition
Taiyang Guo, Sixia Li, Masashi Unoki, Shogo Okada
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 39-46, 2022
Speaker anonymization by modifying fundamental frequency and x-vector singular value.
Candy Olivia Mawalim, Kasorn Galajit, Jessada Karnjana, Shunsuke Kidani, Masashi Unoki
Comput. Speech Lang., 73, 101326-101326, 2022
Multi-resolution modulation-filtered cochleagram feature for LSTM-based dimensional emotion recognition from speech
Zhichao Peng, Jianwu Dang, Masashi Unoki, Masato Akagi
Neural Networks, 140, 261-273, 2021
Computational Models of Sharpness and Fluctuation Strength Using Loudness Models Composed of Gammatone and Gammachirp Auditory Filterbanks
Takuto Isoyama, Shunsuke Kidani, Masashi Unoki
Journal of Signal Processing, 25, 4, 141-144, 2021
Robust Voice Activity Detection Using a Masked Auditory Encoder Based Convolutional Neural Network
Nan Li, Longbiao Wang, Masashi Unoki, Sheng Li, Rui Wang, Meng Ge, Jianwu Dang
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), -, 2021
Crossfire conditional generative adversarial networks for singing voice extraction
Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 3, 2308-2312, 2021
Synchronous multi-bit audio watermarking based on phase shifting
Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2021-, 2700-2704, 2021
Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method.
Candy Olivia Mawalim, Masashi Unoki
APSIPA ASC, 1627-1633, 2021
Speech Watermarking Method Using McAdams Coefficient Based on Random Forest Learning.
Candy Olivia Mawalim, Masashi Unoki
Entropy, 23, 10, 1246-1246, 2021
Measurements of Transmission Characteristics Related to Bone-Conducted Speech Using Excitation Signals in the Oral Cavity
Teruki Toya, Peter Birkholz, Masashi Unoki
Journal of Speech, Language, and Hearing Research, 63, 12, 4252-4264, 2020
Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept
Thuan Van Ngo, Tuan Vu Ho, Masashi Unoki, Rieko Kubo, Masato Akagi
2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings, 753-758, 2020
Multi-Subspace Echo Hiding Based on Time-Frequency Similarities of Audio Signals
Shengbei Wang, Weitao Yuan, Masashi Unoki
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 2349-2363, 2020
Improvement in Bone-Conducted Speech Restoration Using Linear Prediction and Long Short-Term Memory Model
Huy Quoc Nguyen, Masashi Unoki
Journal of Signal Processing, 24, 4, 175-178, 2020
Cortical oscillatory hierarchy for natural sentence processing
Bin Zhao, Jianwu Dang, Gaoyan Zhang, Masashi Unoki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-, 125-129, 2020
X-Vector Singular Value Modification and Statistical-Based Decomposition with Ensemble Regression Modeling for Speaker Anonymization System.
Candy Olivia Mawalim, Kasorn Galajit, Jessada Karnjana, Masashi Unoki
Interspeech 2020(INTERSPEECH), 1703-1707, 2020
Speech Information Hiding by Modification of LSF Quantization Index in CELP Codec.
Candy Olivia Mawalim, Shengbei Wang, Masashi Unoki
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference(APSIPA), 1321-1330, 2020
Audio Information Hiding Based on Cochlear Delay Characteristics with Optimized Segment Selection
Candy Olivia Mawalim, Masashi Unoki
Advances in Intelligent Systems and Computing, 128-138, 2020
聴覚特性に基づいた音響情報ハイディング技術
鵜木 祐史
電子情報通信学会 基礎・境界ソサイエティ Fundamentals Review, 13, 4, 284-293, 2020
Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks with Auditory Front-Ends
Zhichao Peng, Xingfeng Li, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
IEEE Access, 8, 16560-16572, 2020
Presentation effect of cue tone on tuning of auditory filter for several frequencies
Shunsuke Kidani, Ryota Miyauchi, Masashi Unoki
Acoustical Science and Technology, 41, 1, 378-379, 2020
Non-blind speech watermarking method based on spread-spectrum using linear prediction residue
Reiya Namikawa, Masashi Unoki
IEICE Transactions on Information and Systems, E103D, 1, 63-66, 2020
Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Networks
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), -, 2019
A Robust Method for Blindly Estimating Speech Transmission Index using Convolutional Neural Network with Temporal Amplitude Envelope
Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), -, 2019
A Skip Attention Mechanism for Monaural Singing Voice Separation
Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang
IEEE Signal Processing Letters, 26, 10, 1481-1485, 2019
How the temporal amplitude envelope of speech contributes to urgency perception
Masashi Unoki, Miho Kawamura, Maori Kobayashi, Shunsuke Kidani, Masato Akagi
Proceedings of 23rd International Congress on Acoustics (ICA 2019), 1739-1744, 2019
Detection of speech tampering using sparse representations and spectral manipulations based information hiding
Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki
Speech Communication, 112, 1-14, 2019
Feasibility of Audio Information Hiding Using Linear Time Variant IIR Filters Based on Cochlear Delay
Candy Olivia Mawalim, Masashi Unoki
Journal of Signal Processing, 23, 4, 155-158, 2019
Data augmentation for monaural singing voice separation based on variational autoencoder-generative adversarial network
Boxin He, Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki
Proceedings - IEEE International Conference on Multimedia and Expo, 2019-July, 1354-1359, 2019
Inaudible Speech Watermarking Based on Self-compensated Echo-hiding and Sparse Subspace Clustering
Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2019-May, 2632-2636, 2019
Proximal Deep Recurrent Neural Network for Monaural Singing Voice Separation
Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2019-May, 286-290, 2019
Semi-fragile speech watermarking based on singular-spectrum analysis with CNN-based parameter estimation for tampering detection
Galajit Kasorn, Karnjana Jessada, Unoki Masashi, Aimmanee Pakinee
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 8, -, 2019
Digital audio watermarking method based on singular spectrum analysis with automatic parameter estimation using a convolutional neural network
Kasorn Galajit, Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki
Smart Innovation, Systems and Technologies, 110, 63-73, 2019
Enhanced feature network for monaural singing voice separation
Weitao Yuan, Boxin He, Shengbei Wang, Jianming Wang, Masashi Unoki
Speech Communication, 106, 1-6, 2019
Speech Watermarking Based on Source-filter Model of Speech Production.
Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki
J. Inf. Hiding Multim. Signal Process., 10, 4, 517-534, 2019
Estimates of transmission characteristics related to perception of bone-conducted speech using real utterances and transcutaneous vibration on larynx
Teruki Toya, Peter Birkholz, Masashi Unoki
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11658 LNAI, 491-500, 2019
Multimodal BigFive Personality Trait Analysis Using Communication Skill Indices and Multiple Discussion Types Dataset
Candy Olivia Mawalim, Shogo Okada, Yukiko I. Nakano, Masashi Unoki
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11578 LNCS, 370-383, 2019
Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
Proceedings - IEEE International Conference on Multimedia and Expo, 2018-July, 1-6, 2018
Speech Watermarking Technique Based on Singular Spectrum Analysis and Automatic Parameter Estimation using Differential Evolution for Tampering Detection
Kasorn Galajit, Jessada Karnjana, Masashi Unoki, Mongkonchai Intarauksorn, Pakinee Aimmanee
2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing, iSAI-NLP 2018 - Proceedings, -, 2018
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification
Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki
Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, 2018-February, 193-202, 2018
F0 estimation using empirical mode decomposition and complex cepstrum analysis in reverberant environments
Surasak Boonkla, Masashi Unoki, Chai Wutiwiwatchai, Stanislav S. Makhanov
Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, 2018-February, 980-986, 2018
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, 2018-February, 1750-1755, 2018
Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room
Masashi Unoki, Yuta Kashihara, Maori Kobayashi, Masato Akagi
Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, 2018-February, 1199-1204, 2018
Contributions of temporal cue on the perception of speaker individuality and vocal emotion for noise-vocoded speech
Zhi Zhu, Ryota Miyauchi, Yukiko Araki, Masashi Unoki
Acoustical Science and Technology, 39, 3, 234-242, 2018
Contribution of modulation spectral features on the perception of vocal-emotion using noise-vocoded speech
Zhu Zhi, Miyauchih Ryota, Araki Yukiko, Unoki Masashi
ACOUSTICAL SCIENCE AND TECHNOLOGY, 39, 6, 379-386, 2018
Study on speech representation based on spikegram for speech fingerprints
Dung Kim Tran, Masashi Unoki
Smart Innovation, Systems and Technologies, 82, 153-160, 2018
Noise Suppression Method Based on Modulation Spectrum Analysis.
Takuto Isoyama, Masashi Unoki
Speech and Computer - 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18-22, 2018, Proceedings, 234-244, 2018
Method of Estimating Direction of Arrival of Sound Source for Monaural Hearing Based on Temporal Modulation Perception.
Nguyen Khanh Bui, Daisuke Morikawa, Masashi Unoki
2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, 5014-5018, 2018
Speech Watermarking Based on Robust Principal Component Analysis and Formant Manipulations.
Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki
2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, 2082-2086, 2018
Speech Emotion Recognition Using MPCRNN based on Gammatone auditory Filterbank
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
APSIPA2017, -, 2017
Method of blindly estimating speech transmission index in noisy reverberant environments
Masashi Unoki, Akikazu Miyazaki, Shota Morita, Masato Akagi
Journal of Information Hiding and Multimedia Signal Processing, 8, 1430-1445, 2017
Method of estimating signal-to-noise ratio based on optimal design for sub-band voice activity detection
Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi
Journal of Information Hiding and Multimedia Signal Processing, 8, 1446-1459, 2017
Feasibility of vocal emotion conversion on modulation spectrogram for simulated cochlear implants
Zhi Zhu, Ryota Miyauchi, Yukiko Arakiy, Masashi Unoki
25th European Signal Processing Conference, EUSIPCO 2017, 2017-, 1834-1838, 2017
RECENT DEVELOPMENT OF SPEECH AND AUDIO SIGNAL PROCESSING IN NETWORK COMMUNICATION
Ruimin Hu, Changchun Bao, Qingwei Zhao, Masashi Unoki, Jong Won Shin
CHINA COMMUNICATIONS, 14, 9, III-IV, 2017
Robust front-end for speech recognition by human and machine in noisy reverberant environments: The effect of phase information
Yang Liu, Naushin Nower, Shota Morita, Masashi Unoki
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, -, 2017
Tampering Detection in Speech Signals by Semi-Fragile Watermarking Based on Singular-Spectrum Analysis
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
ADVANCES IN INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, 63, 131-140, 2017
Investigation on the head-related modulation transfer function for monaural DOA
Nguyen Khanh Bui, Daisuke Morikawa, Masashi Unoki
ADVANCES IN INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, 63, 191-198, 2017
Robust method for estimating F0 of complex tone based on pitch perception of amplitude modulated signal
Kenichiro Miwa, Masashi Unoki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2017-, 2311-2315, 2017
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank.
Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 1750-1755, 2017
Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room.
Masashi Unoki, Yuta Kashihara, Maori Kobayashi, Masato Akagi
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 1199-1204, 2017
F0 estimation using empirical mode decomposition and complex cepstrum analysis in reverberant environments.
Surasak Boonkla, Masashi Unoki, Chai Wutiwiwatchai, Stanislav, S. Makhanov
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 980-986, 2017
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification.
Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017, 193-202, 2017
Speech enhancement of instantaneous amplitude and phase for applications in noisy reverberant environments
Yang Liu, Naushin Nower, Shota Morita, Masashi Unoki
SPEECH COMMUNICATION, 84, 1-14, 2016
Speech Analysis Method Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition
Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov, Chai Wutiwiwatchai
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E99A, 10, 1762-1773, 2016
Singular-Spectrum Analysis for Digital Audio Watermarking with Automatic Parameterization and Parameter Estimation
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E99D, 8, 2109-2120, 2016
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 82, 2, 163-173, 2016
MTF-Based Kalman Filtering with Linear Prediction for Power Envelope Restoration in Noisy Reverberant Environments
Yang Liu, Shota Morita, Masashi Unoki
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E99A, 2, 560-569, 2016
An approach to estimating decision complexity for better understanding playing patterns of masters
Akira Takeuchi, Masashi Unoki, Hiroyuki Iida
Studies in Computational Intelligence, 619, 113-126, 2016
Method of Audio Watermarking Based on Adaptive Phase Modulation
Nhut Minh Ngo, Masashi Unoki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E99D, 1, 92-101, 2016
Study on Effects of Speech Production during Delayed Auditory Feedback for Air-Conducted and Bone-Conducted Speech
Toya Teruki, Ishikawa Daisuke, Miyauchi Ryota, Nishimoto Kazushi, Unoki Masashi
Journal of Signal Processing, 20, 4, 197-200, 2016
SSA-based Audio-Information-Hiding Scheme with Psychoacoustic Model
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), -, 2016
iDAF-drum: Supporting Practice of Drumstick Control by Exploiting Insignificantly Delayed Auditory Feedback
Kazushi Nishimoto, Akari Ikenoue, Masashi Unoki
KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS, 416, 483-497, 2016
Audio watermarking scheme based on singular spectrum analysis and psychoacoustic model with self-synchronization
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
Journal of Electrical and Computer Engineering, 2016, -, 2016
Study on linguistic information and speaker individuality contained in temporal envelope of speech
Zhi Zhu, Yasutaka Nishino, Ryota Miyauchi, Masashi Unoki
Acoustical Science and Technology, 37, 5, 258-261, 2016
iDAF-drum: Supporting Practice of Drumstick Control by Exploiting Insignificantly Delayed Auditory Feedback
Kazushi Nishimoto, Akari Ikenoue, Masashi Unoki
KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS, 416, 483-497, 2016
Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments
Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov
Speech and Computer, 9811, 580-587, 2016
Robust front-end for speech recognition by human and machine in noisy reverberant environments: the effect of phase information
Yang Liu, Naushin Nower, Shota Morita, Masashi Unoki
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 1-5, 2016
Modulation spectral features for predicting vocal emotion recognition by simulated cochlear implants
Zhi Zhu, Ryota Miyauchi, Yukiko Araki, Masashi Unoki
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5, 262-266, 2016
INVESTIGATIONS INTO VOWEL AND CONSONANT STRUCTURES IN ARTICULATORY AND AUDITORY SPACES USING LAPLACIAN EIGENMAPS
Jianwu Dang, Shengbei Wang, Masashi Unoki
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 5355-5359, 2016
SSA-based Audio-Information-Hiding Scheme with Psychoacoustic Model
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 1-10, 2016
Preliminary Study on Blind Estimation of Room Acoustic Parameters in Noisy Reverberant Environments
Unoki, M, Morita, S, Miyazaki, A, Akagi, M
Proc. WESPAC2015, Singapore, 428-435-, 2015
Tampering detection scheme for speech signals using formant enhancement based watermarking
Shengbei Wang, Ryota Miyauchi, Masashi Unoki, Nam Soo Kim
Journal of Information Hiding and Multimedia Signal Processing, 6, 1264-1283, 2015
Robust and reliable audio watermarking based on phase coding
Nhut Minh Ngo, Masashi Unoki
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2015-, 345-349, 2015
A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions
Masashi Unoki, Masato Toi, Masato Akagi
European Signal Processing Conference, 06-10-September-2004, 1689-1692, 2015
Robust, Blindly-Detectable, and Semi-Reversible Technique of Audio Watermarking Based on Cochlear Delay Characteristics
Masashi Unoki, Ryota Miyauchi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E98D, 1, 38-48, 2015
Speech Watermarking Method Based on Formant Tuning
Shengbei Wang, Masashi Unoki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E98D, 1, 29-37, 2015
ROBUST AND RELIABLE AUDIO WATERMARKING BASED ON PHASE CODING
Nhut Minh Ngo, Masashi Unoki
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 345-349, 2015
An Audio Watermarking Scheme Based on Automatic Parameterized Singular-Spectrum Analysis Using Differential Evolution
Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki, Chai Wutiwiwatchai
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 543-551, 2015
An Audio Watermarking Scheme Based on Singular-Spectrum Analysis
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
DIGITAL-FORENSICS AND WATERMARKING, IWDW 2014, 9023, 145-159, 2015
Watermarking for Digital Audio Based on Adaptive Phase Modulation
Nhut Minh Ngo, Masashi Unoki
DIGITAL-FORENSICS AND WATERMARKING, IWDW 2014, 9023, 105-119, 2015
An Audio Watermarking Scheme Based on Singular-Spectrum Analysis
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai
DIGITAL-FORENSICS AND WATERMARKING, IWDW 2014, 9023, 145-159, 2015
Watermarking for Digital Audio Based on Adaptive Phase Modulation
Nhut Minh Ngo, Masashi Unoki
DIGITAL-FORENSICS AND WATERMARKING, IWDW 2014, 9023, 105-119, 2015
Complex tensor factorization in modulation frequency domain for single-channel speech enhancement
Shogo Masaya, Masashi Unoki
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 1765-1769, 2015
Feasibility of Estimating Direction of Arrival Based on Monaural Modulation Spectrum
Daisuke Morikawa, Masaru Ando, Masashi Unoki
2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP), 384-387, 2015
An Automatic Watermarking in CELP Speech Codec Based on Formant Tuning
Erick Christian Garcia Alvarez, Shengbei Wang, Masashi Unoki
2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP), 160-163, 2015
ROBUST AND RELIABLE AUDIO WATERMARKING BASED ON DYNAMIC PHASE CODING AND ERROR CONTROL CODING
Nhut Minh Ngo, Brian Michael Kurkoski, Masashi Unoki
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2276-2280, 2015
RESTORATION OF INSTANTANEOUS AMPLITUDE AND PHASE OF SPEECH SIGNAL IN NOISY REVERBERANT ENVIRONMENTS
Yang Liu, Naushin Nower, Yonghong Yan, Masashi Unoki
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 879-883, 2015
An Audio Watermarking Scheme Based on Automatic Parameterized Singular-Spectrum Analysis Using Differential Evolution
Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki, Chai Wutiwiwatchai
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 543-551, 2015
Watermarking of speech signals based on formant enhancement
Shengbei Wang, Masashi Unoki
European Signal Processing Conference, 1257-1261, 2014
Study on Method of Estimating Direction of Arrival Using Monaural Modulation Spectrum
Masaru Ando, Daisuke Morikawa, Masashi Unoki
Journal of Signal Processing, 18, 4, 197-200, 2014
Study on blind method of estimating speech transmission index from noisy reverberant amplitude-modulated-signals
Akikazu Miyazaki, Shota Morita, Masashi Unoki
Journal of Signal Processing, 18, 4, 201-204, 2014
Data hiding scheme for amplitude modulation radio broadcasting systems
Nhut Minh Ngo, Masashi Unoki, Ryota Miyauchi, Yôiti Suzuki
Journal of Information Hiding and Multimedia Signal Processing, 5, 324-341, 2014
Comparative evaluations of inaudible and robust watermarking for digital audio signals
Masashi Unoki, Jessada Kamjana, Shengbei Wang, Nhut Minh Ngo, Ryota Miyauchi
21st International Congress on Sound and Vibration 2014, ICSV 2014, 3, 2230-2237, 2014
Study on semi-scramble method for speech signals based on phonemic restoration
YAMAMOTO Katsuhiko, ZHU Zhi, UNOKI Masashi, AOKI Naofumi
Journal of Signal Processing, 18, 4, 205-208, 2014
Study on Scramble Method for Speech Signal by Using Random-Bit Shift of Quantization
Zhu Zhi, Yamamoto Katsuhiko, Unoki Masashi, Aoki Naofumi
Journal of Signal Processing, 18, 6, 303-307, 2014
Signal to Noise Ratio Estimation Based on An Optimal Design of Subband Voice Activity Detection
Shota Morita, Xugang Lu, Masashi Unoki
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 560-+, 2014
Speech Analysis Method Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Log-Spectrum Domain
Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov, Chai Wutiwiwatchai
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 555-+, 2014
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 108-+, 2014
Formant Enhancement based Speech Watermarking for Tampering Detection
Shengbei Wang, Masashi Unoki, Nam Soo Kim
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 1366-1370, 2014
Hybrid Speech Watermarking based on Formant Enhancement and Cochlear Delay
Shengbei Wang, Masashi Unoki
2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 272-275, 2014
RESTORATION OF INSTANTANEOUS AMPLITUDE AND PHASE USING KALMAN FILTER FOR SPEECH ENHANCEMENT
Naushin Nower, Yang Liu, Masashi Unoki
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 4633-4637, 2014
WATERMARKING OF SPEECH SIGNALS BASED ON FORMANT ENHANCEMENT
Shengbei Wang, Masashi Unoki
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 1257-1261, 2014
Controlling Tradeoff Between Approximation Accuracy and Complexity of a Smooth Function in a Reproducing Kernel Hilbert Space for Noise Reduction
Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 61, 3, 601-610, 2013
微少遅延聴覚フィードバックを応用したドラム演奏フォーム改善支援システム
池之上 あかり, 小倉 加奈代, 鵜木 祐史, 西本 一志
ヒューマンインタフェース学会論文誌, 15, 1, 15-24, 2013
Study on effects of presence of cue-tone on psychophysical tuning curves
Shunsuke Kidani, Ryota Miyauchi, Masashi Unoki
Proceedings of Meetings on Acoustics, 19, -, 2013
Objective evaluation of sound quality for attacks on robust audio watermarking
Akira Nishimura, Masashi Unoki, Kazuhiko Kondo, Akio Ogihara
Proceedings of Meetings on Acoustics, 19, -, 2013
MTF based Kalman filtering with linear prediction for power envelope restoration
Yang Liu, Masashi Unoki
2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 198-203, 2013
Concurrent processing of voice activity detection and noise reduction using empirical mode decomposition and modulation spectrum analysis
Yasuaki Kanai, Shota Morita, Masashi Unoki
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 742-746, 2013
Study on method for estimating F0 of steady complex tone in noisy reverberant environments
Kenichiro Miwa, Masashi Unoki
2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 456-459, 2013
Watermarking Method for Speech Signals based on Modifications to LSFs
Shengbei Wang, Masashi Unoki
2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 283-286, 2013
Robust audio data hiding method based on phase of modulated complex lapped transform
Kiho Cho, Soo Hyun Bae, In Kyu Choi, Nam Soo Kim, Masashi Unoki
Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013, 263-266, 2013
BLIND METHOD OF ESTIMATING SPEECH TRANSMISSION INDEX FROM REVERBERANT SPEECH SIGNALS
Masashi Unoki, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, Nam Soo Kim
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 1-5, 2013
IMM-based feature compensation robust to slowly time-varying noise and reverberation
Shin Jae Kang, Chang Woo Han, Kang Hyun Lee, Nam Soo Kim, Masashi Unoki
2013 IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2013 - Proceedings, 313-317, 2013
Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function
Masashi Unoki, Tomohiro Ikeda, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, Nam Soo Kim
2013 IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2013 - Proceedings, 308-312, 2013
A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model
Phung, T. N, Unoki, M, Akagi, M
Journal of Signal Processing, 16, 5, 5, 409-417-417, 2012
Improvements to creativity in singing abilities based on perspective of studies on interaction between speech production and auditory perception
Masashi Unoki, Kazushi Nishimoto
2012 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS (KICSS 2012), 157-160, 2012
ROBUST VOICE ACTIVITY DETECTION USING EMPIRICAL MODE DECOMPOSITION AND MODULATION SPECTRUM ANALYSIS
Yasuaki Kanai, Masashi Unoki
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 400-404, 2012
UNIFIED DENOISING AND DEREVERBERATION METHOD USED IN RESTORATION OF MTF-BASED POWER ENVELOPE
Masashi Unoki, Xugang Lu
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 215-219, 2012
CONTROLLING THE TRADEOFF PROPERTY IN A REGULARIZATION FRAMEWORK FOR NOISE REDUCTION
Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 201-205, 2012
Detection of tampering in speech signals with inaudible watermarking technique
Masashi Unoki, Ryota Miyauchi
Proceedings of the 2012 8th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2012, 118-121, 2012
Data-hiding scheme for digital-audio in amplitude modulation domain
Nhut Minh Ngo, Masashi Unoki, Ryota Miyauchi, Yoiti Suzuki
Proceedings of the 2012 8th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2012, 114-117, 2012
Blind estimation method of speech transmission index in room acoustics
Masashi Unoki, Tomohiro Ikeda, Masato Akagi
Proceedings of Forum Acusticum, 1973-1978, 2011
MTF-based sub-band power-envelope restoration for robust speech recognitionin noisy reverberant environments
Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi, Rüdigger Hoffmann
APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, 21-25, 2011
Sub-band temporal modulation envelopes and their normalization for automatic speech recognition in reverberant environments
Xugang Lu, Masashi Unoki, Satoshi Nakamura
COMPUTER SPEECH AND LANGUAGE, 25, 3, 571-584, 2011
Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals
Yanao, Y, Miyauchi, R, Unoki, M, Akagi, M
Proc. NCSP2011, 231-234-, 2011
Temporal modulation normalization for robust speech feature extraction and recognition
Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura
MULTIMEDIA TOOLS AND APPLICATIONS, 52, 1, 187-199, 2011
Embedding limitations with digital-audio watermarking method based on cochlear delay characteristics
Masashi Unoki, Kuniaki Imabeppu, Daiki Hamada, Atsushi Haniu, Ryota Miyauchi
Journal of Information Hiding and Multimedia Signal Processing, 2, 1, 1-23, 2011
Effects of spatial cues on detectability of alarm signals in noisy environments
N. Kuroda, J. Li, Y. Iwaya, M. Unoki, M. Akagi
Principles and Applications of Spatial Hearing, 484-493, 2011
Adaptive regularization framework for robust voice activity detection
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2664-2667, 2011
Voice activity detection in MTF-based power envelope restoration
Masashi Unoki, Xugang Lu, Rico Petrick, Shota Morita, Masato Akagi, Ruediger Hoffmann
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2620-2623, 2011
Reversible watermarking for digital audio based on cochlear delay characteristics
Masashi Unoki, Ryota Miyauchi
Proceedings - 7th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIHMSP 2011, 314-317, 2011
Functional approximation in a reproducing kernel Hilbert Space for speech estimation in noisy environments
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
APSIPA ASC 2010 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 76-81, 2010
Normalization on subband temporal envelopes for large vocabulary continuous speech recognition
Xugang Lu, Masashi Unoki, Satoshi Nakamura
Proceedings of the 12th IASTED International Conference on Signal and Image Processing, SIP 2010, 7-12, 2010
Effects of the presence of cue tone in signal detection varies with relationships between cue tone and signal frequencies
Shunsuke Kidani, Ryota Miyauchi, Masashi Unoki
20th International Congress on Acoustics 2010, ICA 2010 - Incorporating Proceedings of the 2010 Annual Conference of the Australian Acoustical Society, 4, 3287-3291, 2010
A study on the IMTF-based filtering on the modulation spectrum of reverberant signal
Shota Morita, Masashi Unoki, Masato Akagi
Journal of Signal Processing, 14, 4, 269-272, 2010
A study on the MTF-based inverse filtering for the modulation spectrum of reverberant speech
Morita, S, Unoki, M, Akagi, M
Proc. NCSP10, -, 2010
METHOD OF DIGITAL-AUDIO WATERMARKING BASED ON COCHLEAR DELAY CHARACTERISTICS
Masashi Unoki, Daiki Hamada
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 6, 3B, 1325-1346, 2010
Regularization in a reproducing kernel Hilbert space for robust voice activity detection
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 585-588, 2010
Voice activity detection in a regularized reproducing kernel Hilbert space
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 3086-3089, 2010
Speech enhancement as a functional approximation and generalization
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings, 18-22, 2010
Voice activity detection in a regularized reproducing kernel Hilbert space
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 3086-3089, 2010
Methods for Robust Speech Recognition in Reverberant Environments: A Comparison
Rico Petrick, Thomas Feher, Masashi Unoki, Ruediger Hoffmann
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 582-+, 2010
Design of IIR all-pass filter based on cochlear delay to reduce embedding limitations
Masashi Unoki, Toshizo Kosugi, Atsushi Haniu, Ryota Miyauchi
Proceedings - 2010 6th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIHMSP 2010, 526-529, 2010
Speech Enhancement Based on Noise Eigenspace Projection
Dongwen Ying, Masashi Unoki, Xugang Lu, Jianwu Dang
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E92D, 5, 1137-1145, 2009
An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech
Kinugasa, K, Unoki, M, Akagi, M
Proc. NCSP'09, 105-108-, 2009
Effects from spatial cues on detectability of alarm signals in car environments
N. Kuroda, J. Li, Y. Iwaya, M. Unoki, M. Akagi
Proc. NCSP’09, pp. 45-48-, 2009
A psychoacoustically-motivated conceptual model for automatic speech recognition
Haniu, A, Unoki, M, Akagi, M
Proc. WESPAC2009, -, 2009
NONLINEAR RESPONSES OF A NONLINEAR COCHLEAR MODEL WITH THE FUNCTION OF AN OUTER HAIR CELL MODEL
Y. Murakami, M. Unoki
CONCEPTS AND CHALLENGES IN THE BIOPHYSICS OF HEARING, 343-349, 2009
Temporal modulation normalization for robust speech feature extraction and recognition
Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 4354-4357, 2009
Normalization on the modulation spectrum of the subband temporal envelopes for automatic speech recognition in reverberant environments
X. Lu, M. Unoki, S. Nakamura
ACM International Conference Proceeding Series, 247-254, 2009
Subband Temporal Modulation Spectrum Normalization for Automatic Speech Recognition in Reverberant Environments
Xugang Lu, Masashi Unoki, Satoshi Nakamura
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2475-2478, 2009
Embedding limitations with audio-watermarking method based on cochlear delay characteristics
Kuniaki Imabeppu, Daiki Hamada, Masashi Unoki
IIH-MSP 2009 - 2009 5th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 82-85, 2009
TEMPORAL CONTRAST NORMALIZATION AND EDGE-PRESERVED SMOOTHING ON TEMPORAL MODULATION STRUCTURE FOR ROBUST SPEECH RECOGNITION
X. Lu, S. Matsuda, M. Unoki, T. Shimizu, S. Nakamura
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 4573-4576, 2009
MTF-based power envelope restoration in noisy reverberant environments.
Masashi Unoki, Yutaka Yamasaki, Masato Akagi
17th European Signal Processing Conference, EUSIPCO 2009, Glasgow, Scotland, UK, August 24-28, 2009, 228-232, 2009
Blind estimation method of reverberation time based on concept of modulation transfer function
M. Unoki, S. Hiramatsu
Proceedings - European Conference on Noise Control, 4491-4496, 2008
Judgment of perceptual synchrony between two pulses and verification of its relation to cochlear delay by an auditory model
Eriko Aiba, Minoru Tsuzaki, Satomi Tanaka, Masashi Unoki
JAPANESE PSYCHOLOGICAL RESEARCH, 50, 4, 204-213, 2008
An LP-based blind model for restoring bone-conducted speech
Thang tat Vu, Masashi Unoki, Masato Akagi
2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 210-215, 2008
Robust Front End Processing for Speech Recognition in Reverberant Environments: Utilization of Speech Characteristics
Rico Petrick, Xugang Lu, Masashi Unoki, Masato Akagi, Ruediger Hoffmann
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 658-+, 2008
A Comprehensive Study on the Effects of Room Reverberation on Fundamental Frequency Estimation
Rico Petrick, Masashi Unoki, Anish Mittal, Carlos Segura, Ruediger Hoffmann
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 131-+, 2008
Audio watermarking method based on the cochlear delay characteristics
Masashi Unoki, Daiki Hamada
2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 616-619, 2008
Comparative evaluations of robust and accurate F0 estimates in reverberant environments
Masashi Unoki, Toshihiro Hosorogiya, Yuichi Ishimoto
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 4569-+, 2008
MTF-based method of blind estimation of reverberation time in room acoustics.
Masashi Unoki, Sota Hiramatsu
2008 16th European Signal Processing Conference, EUSIPCO 2008, Lausanne, Switzerland, August 25-29, 2008, 1-5, 2008
The Construction of Large-scale Bone-conducted and Air-conducted Speech Databases for Speech Intelligibility Tests
Vu, T. T, Unoki, M, Akagi, M
Proc. Oriental COCOSDA2007, 88-91-, 2007
Estimates of tuning of auditory filter using simultaneous and forward notched-noise masking
Masashi Unoki, Ryota Miyauchi, Chin-Tuan Tan
HEARING - FROM SENSORY PROCESSING TO PERCEPTION, 19-26, 2007
Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voices
Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 113-+, 2007
Improvement in detectability of alarm signals in noisy environments by utilizing spatial cues
Hideaki Uchiyama, Masashi Unoki, Masato Akagi
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 177-180, 2007
Vocal conversion from speaking voice to singing voice using STRAIGHT.
Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007, 4005-4006, 2007
Method of LP-based blind restoration for improving intelligibility of bone-conducted speech
Thang Tat Vu, Germine Seide, Masashi Unoki, Masato Akagi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 1885-1888, 2007
A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-based Models
Vu, T, Unoki, M, Akagi, M
Journal of Signal Processing, 10, 6, 407-417-417, 2006
Comparison of the roex and gammachirp filters as representations of the auditory filter
Masashi Unoki, Toshio Irino, Brian Glasberg, Brian C. J. Moore, Roy D. Patterson
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 120, 3, 1474-1492, 2006
Effect of ITD and component frequencies on perception of alarm signals in noisy environments
Nakanishi, J, Unoki, M, Akagi, M
Journal of Signal Processing, 10, 4, 231-234-234, 2006
Estimate of auditory filter shape using notched-noise masking for various signal frequencies
Masashi Unoki, Kazuhito Ito, Yuichi Ishimoto, Chin-Tuan Tan
Acoustical Science and Technology, 27, 1, 1-11, 2006
A study on an LP-based model for restoring bone-conducted speech
Thang tat Vu, Masashi Unoki, Masato Akagi
2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 294-+, 2006
A robust feature extraction based on the MTF concept for speech recognition in reverberant environment
Xugang Lu, Masashi Unoki, Masato Akagi
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2546-2549, 2006
Estimates of auditory filter shape using simultaneous and forward notched-noise masking
Masashi Unoki, Chin Tuan Tan
Forum Acusticum Budapest 2005: 4th European Congress on Acustics, 1497-1502, 2005
Development of the MTF-based speech dereverberation method using adaptive time-frequency division
Masashi Unoki, Masato Toi, Masato Akagi
Forum Acusticum Budapest 2005: 4th European Congress on Acustics, 51-56, 2005
Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis
T Saitou, M Unoki, M Akagi
SPEECH COMMUNICATION, 46, 3-4, 405-417, 2005
A study on a speech recognition method based on the selective sound segregation in noisy environment
Haniu, A, Unoki, M, Akagi, M
Proc. NCSP05, 403-406-, 2005
A model for selective segregation of a target instrument sound from the mixed sound of various instruments.
Masashi Unoki, Masaaki Kubo, Atsushi Haniu, Masato Akagi
INTERSPEECH 2005 - Eurospeech, 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005, 2097-2100, 2005
A speech dereverberation method based on the MTF concept in power envelope restoration
Masashi Unoki, Keigo Sakata, Masakazu Furukawa, Masato Akagi
Acoustical Science and Technology, 25, 4, 243-254, 2004
An improved method based on the MTF concept for restoring the power envelope from a reverberant signal
Masashi Unoki, Masakazu Furukawa, Keigo Sakata, Masato Akagi
Acoustical Science and Technology, 25, 4, 232-242, 2004
Temporal decomposition of speech and its application to speech coding and modification
Akagi, M, Nguyen, P. C, Saitou, T, Tsuji, N, Unoki, M
Proc. KEST2004, 280-288-, 2004
Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice.
Takeshi Saitou, Naoya Tsuji, Masashi Unoki, Masato Akagi
INTERSPEECH 2004 - ICSLP, 8th International Conference on Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004, 1929-1932, 2004
A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions.
Masashi Unoki, Masato Toi, Masato Akagi
2004 12th European Signal Processing Conference, Vienna, Austria, September 6-10, 2004, 34, 5, 1689-1692, 2004
Extending the domain of center frequencies for the compressive gammachirp auditory filter
RD Patterson, M Unoki, T Irino
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 114, 3, 1529-1542, 2003
"A speech dereverberation method based on the MTF concept,"
Masashi Unoki, Keigo Sakata, Masato Akagi
Proc. EuroSpeech2003, Geneva, Switzerland, pp. 1417-1420, 1417-1420, 2003
Development of the F0 control method for singing-voices synthesis
Saitou, T, Unoki, M, Akagi, M
Proc. SP2004, 491-49-, 2003
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal
M Unoki, M Furukawa, K Sakata, M Akagi
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, vol. Ipp. 840-943, 888-891, 2003
A method for recovering the power envelope from reverberant speech
Unoki, M, Furukawa, M, Akagi, M
Forum Acousticum Sevilla 2002, SPA-Gen-002-, 2002
Extraction of F0 dynamic characteristics and development of F0 control model in singing voice
Saitou, T, Unoki, M, Akagi, M
Proc. ICAD2002, 275-278-278, 2002
Improvement of an IIR asymmetric compensation gammachirp filter
Masashi Unoki, Toshio Irino, Roy D. Patterson
Acoustical Science and Technology, 22, 6, 426-430, 2001
An analysis/synthesis auditory filterbank based on an IIR gammachirp filter
T Irino, M Unoki
COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 312, 49-64, 2001
A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency.
Yuichi Ishimoto, Masashi Unoki, Masato Akagi
EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001, 2439-2442, 2001
Eurospeech99, IEEE MMSP99 会議報告
中村 哲, 大川 茂樹, 伊藤 彰則, 田本 真詞, 水野 秀之, 鵜木 祐史, 徳田 恵一, 鏑木 時彦, 畑岡 信夫
情報処理学会研究報告. SLP, 音声言語情報処理, 28, 91, 21-28, 1999
A method of signal extraction from noisy signal based on auditory scene analysis
M Unoki, M Akagi
SPEECH COMMUNICATION, 27, 3-4, 261-279, 1999
Analysis/synthesis auditory filterbank based on an IIR implementation of the gammachirp
Toshio Irino, Masashi Unoki
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 20, 6, 397-406, 1999
Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis.
Masashi Unoki, Masato Akagi
Sixth European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999, -, 1999
Signal extraction from noisy signal based on auditory scene analysis.
Masashi Unoki, Masato Akagi
The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998, 98, 1-29, 1998
A time-varying, analysis/synthesis auditory filterbank using the gammachirp
T Irino, M Unoki
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 3653-3656, 1998
A method for signal extraction from noise-added signals
M Unoki, M Akagi
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 80, 11, 1-11, 1997
A method of signal extraction from noisy signal.
Masashi Unoki, Masato Akagi
Fifth European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997, 5, 2583-2586, 1997