DANG, Jianwu Professor
School of Information Science, Human Life Design Area, School of Information Science
B.E. from Tsinghua University, China (1982), M.E from Tsinghua University, China (1984), Ph.D. from Shizuoka University (1992)
B.E. from Tsinghua University, China (1982), M.E from Tsinghua University, China (1984), Ph.D. from Shizuoka University (1992)
◆Professional Experience
2006 - 2007 : Japan Advanced Institute of Science and Technology , 教授
2005 - : 北陸先端科学技術大学院大学 北陸先端科学技術大学・情報科学研究科・大学院 , 教授
2004 - : 北陸先端科学技術大学院大学 北陸先端科学技術大学大学院・情報科学研究科 , 教授
: Visiting researcher of ATR Human Information Processing Research Laboratories (1992), Senior researcher of ATR Human Information Processing Research Laboratories (1998-2001)
: Associate of Dept. of Computer Science and Technology of Tianjin University, China (1984), Lecture of Dept. of Computer Science and Technology of Tianjin University, China (1986-1988)
Oral medicine (pathology), Rehabilitation science, Intelligent robotics, Perceptual information processing
◆Research Keywords
鼻孔放射, 音声知覚, フォルマント, 調音運動, 拡張有限要素法, 有限要素法, 摂食嚥下, MRI, MRI計測, バズ音, 舌運動, 音声合成・音声処理, 発話障害, 有声破裂音, 音声生成, 聴覚変形フィードバック, 軟口蓋, 軟口蓋の調音モデル, ホルマント, 発話訓練, 舌部分切除, 摂取嚥下, 発話状態の推定, 筋電信号, 発話運動, 顔画像, 個人性, 生理学的モデル
◆Research Interests
Research on speech recognition considering auditory, articulatory and physiological features:
We are going to develop some novel method for speech recognition by considering human mechanisms. We are using human auditory property for developing a robust speech recognition method for a noisy environment, coarticulatory mechanism for missing speech recognition, and physiological features for speaker identification.
Researches on speech production mechanisms and their modeling:
There are still a number of unsolved questions on mechanisms of speech production, especially for production of emotional speech. To answer those questions, we used a physiological articulatory model, which has been developed based on MRI data by this Lab and ATR, to simulate the processing from articulatory target to speech sound and the inverse processing from speech sound to articulatory target. The “true” mechanisms can be approached using such an iterative approach. An additional part of this topic is to refine the articulatory model based on physiological discoveries.
Researches on speech cognitive science:
Speech cognition (perception) can be considered as an inverse procedure of the speech production. Since numbers of articulatory situations are able to produce the same sound, there is one-to-many inverse problem occurring in the cognition processing, which is a crucial topic in speech cognition. We are going to challenge the problem by investigating its causes, which are concerned with the stability of the articulatory situation, and the physiological and morphological constraints, via the physiological articulatory model.
Research on speech communication within the brain
According to the motor theory of speech perception, a famous hypothesis, speech perception is realizing with reference to image or knowledge of the motor (production) areas (Liberman et al., 1960, 1985). In this research, we are going to verify this theory by investigating interaction between speech perception and production via acoustic analysis, EMG measurement and articulatory observation. 備考
Research on speech synthesis with specific individuality and emotion
・ Individuality of speech depends on physiological (inborn) factors and social (habit-forming) factors. In this study, we focus on the analysis and modeling of the effects of the former factors on speech.・ Emotion is the paralinguistic information to describe a state of the speaker, which cannot be logically produced. The study is trying to study emotional speech generation by adapting our experience to the articulatory model and clarify the relation between the emotion and acoustic parameters besides the fundamental frequency.
Science and Technology of Speech communication: Process of speech production and its inverse process - cognition
Communication using speech production and speech perception is one of the basic ways for human to exchange information. Fully understanding such mechanisms of human and realizing them by a computer system are the research goal of our laboratory. 備考


◆Published Papers
Story co-segmentation of Chinese broadcast news using weakly-supervised semantic similarity.
Array,Array,Yujun Zhang, Zhi-Qiang Liu, Jianwu Dang
Neurocomputing, 355, 121-133, 2019
Traffic model of machine-type communication for railway signal equipment based on MMPP
Lin Junting, Hu Xueyang, Dang Jianwu, Wu Zhongqing
Replay attack detection with auditory filter-based relative phase features.
Zeyan Oo,Longbiao Wang, Khomdet Phapatanaburi, Meng Liu, Seiichi Nakagawa, Masahiro Iwahashi, Jianwu Dang
EURASIP J. Audio, Speech and Music Processing, 2019, 8-, 2019
Investigation of the Comprehension Process during Silent Reading based on Eye Movements
Di Zhou, Jinfeng Huang, Jianwu Dang
2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 165-169, 2019
平井啓之, 竹本浩典, 本多清志, 党建武
日本音響学会誌, 64, 4, 216-228, 2008
D-14-17 破裂子音の音響特徴に関する研究(D-14. 音声,一般セッション)
馬鋭, 方強, 錦戸信和, 廬緒剛, 党建武, 星野朱美
電子情報通信学会総合大会講演論文集, 2008, 1, -, 2008
錦戸信和, 党建武
電子情報通信学会技術研究報告. SP, 音声, 107, 165, 1-6, 2007
金野武司, 錦戸信和, 党建武
電子情報通信学会技術研究報告. SP, 音声, 107, 165, 43-48, 2007
Studies on Speech Production
Springer, 2018, 2019
125-140, 株式会社ニッケイ印刷, 2015
“Observations About Articulatory, Acoustic and Perceptual Characteristics of Laugh and Smile Speech in Comparison with Sad and Neutral Speech” In Trouvain, Jürgen & Campbell, Nick (eds.), Phonetics of Laughing, ,
1–29, Universaar – Saarland University Press: Saarbrücken, Germany., 2014
Physiological Articulatory Model for Investigating Speech Production: modeling and Control
ISBN-NR. 978-3639173871, VDM Verlag, 2009
Advances in Chinese Spoken Language Processing, C. H. Lee, et al.(Chapter)
World Scientific, 2007
◆Conference Activities & Talks
Robust Detection of Link Communities in Large Social Networks by Exploiting Link Semantics
AAAI-18,Feb. 2–7, 2018, New Orleans, Louisiana, USA, 2018
Autoencoder Based Community Detection with Adaptive Integration of Network Topology and Node Contents.
In: Liu W., Giunchiglia F., Yang B. (eds) Knowledge Science, Engineering and Management. KSEM 2018. Lecture Notes in Computer Science, vol 11062. Springer, Cham, 2018
ICASSP, April, 15-20, 2018 Calgary, Canada, 2018
Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning
COLING2018, pages: 547-558., 2018
Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement
INTERSPEECH, Sep. 2-6,2018, Hyderabad, India., 2018

■Teaching Experience

Analysis for Information Science(E), Speech Signal Processing, 情報解析学特論(E), 音声情報処理特論

■Contributions to  Society

◆Academic Society Affiliations
ACTA AUTOMATICA SINICA, 天津市計算機学会, China Computer Federation, Association for Computing Machinery (ACM), International Speech Communication Association, Acoustic Society of America, The Institute of Image Information and Television Engineers, Acoutical Society of Japan, The Institute of Electronics, Information and Communication Engineers
◆Academic Contribution
International workshop on perception and representation of intention involved in spoken language , 情報科学研究科・教授・党建武 , 2013 , HighTech Center, Ishikawa, Japan
Symposium on Modeling of Speech and Audiovisual Mechanism , 情報科学研究科・教授・党 建武; 情報科学研究科・教授・赤木 , 2011 - 2011 , Ishikawa Lifelong Learning Center, Kanazawa, Ishikawa, Japan
The international academic exchange forum , 情報科学研究科・教授・党 建武情報科学研究科・教授・赤木正人 , 2010 - 2010 , HighTech Center, Nomi, Ishkawa, Japan