JAIST Repository: 一覧: 著者

トップページ| 北陸先端科学技術大学院大学| 附属図書館

一覧

コミュニティ
& コレクション
タイトル
著者
日付
学位論文
リサーチレポート・テクニカルメモランダム

登録利用者:

登録者ページ
利用者(E-people)

当システムについて

JAIST Repository >

著者: "Akagi, Masato"

「一覧: 著者」画面に戻る
タイトル順ソート	日付順ソート

208 著者名表示.

発行日	タイトル	著者
1995	Speaker individualities in speech spectral envelopes	Kitamura, Tatsuya; Akagi, Masato
1997	Speaker individuality in fundamental frequency contours and its control	Akagi, Masato; Ienaga, Taro
Mar-1997	雑音が付加された波形からの信号波形の一抽出法	鵜木, 祐史; 赤木, 正人; UNOKI, Masashi; AKAGI, Masato
6-Feb-1998	A computational model of co-modulation masking release	Unoki, Masashi; Akagi, Masato
6-Feb-1998	A method of signal extraction from noisy signal based on auditory scene analysis	Unoki, Masashi; Akagi, Masato
Apr-1999	A method of signal extraction from noisy signal based on auditory scene analysis	Unoki, Masashi; Akagi, Masato
20-Apr-1999	マイクロホン対を用いたスペクトルサブトラクションによる雑音除去法	水町, 光徳; 赤木, 正人; MIZUMACHI, Mitsunori; AKAGI, Masato
20-Oct-1999	聴覚の情景解析に基づいた雑音下の調波複合音の一抽出法	鵜木, 祐史; 赤木, 正人; UNOKI, Masashi; AKAGI, Masato
2000	The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises	Mizumachi, Mitsunori; Akagi, Masato
Jul-2000	A computational model of auditory sound localization based on ITD	Ito, Kazuhito; Akagi, Masato
1-Jul-2000	蝸牛神経核細胞の機能モデルの提案 : 前腹側核細胞の応答特性	牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
25-Dec-2000	2.聴覚モデルの系譜 : 聴覚分野(〈特集〉-音響学における20世紀の成果と21世紀に残された課題-)	赤木, 正人; Akagi, Masato
2001	Computational Models of Auditory Function : A computational model of auditory sound localization	Ito, Kazuhito; Akagi, Masato
2001	Computational Models of Auditory Function : A computational model of co-modulation masking release	Unoki, Masashi; Akagi, Masato
2002	Enabling Society With Information Technology : Speech enhancement and segregation based on human auditory mechanisms	Akagi, Masato; Mizumachi, Mitsunori; Ishimoto, Yuichi; Unoki, Masashi
25-Dec-2002	蝸牛神経核腹側核細胞モデルの振幅変調音に対する応答特性	Amplitude modulation; 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
25-Dec-2002	初期聴覚系における神経発火の時間-周波数応答パタン(<小特集>末梢聴覚機能解析の動向)	牧, 勝弘; 伊藤, 一仁; 赤木, 正人; Maki, Katuhiro; Ito, Kazuhito; Akagi, Masato
1-Mar-2003	Modified Restricted Temporal Decomposition and Its Application to Low Rate Speech Coding	NGUYEN, Phu Chien; OCHI, Takao; AKAGI, Masato
25-Dec-2003	蝸牛神経核背側核細胞の周波数応答特性に関する神経回路モデルの提案 : トーンバースト刺激に対する応答	牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
2004	A speech dereverberation method based on the MTF concept in power envelope restoration	Unoki, Masashi; Sakata, Keigo; Furukawa, Masakazu; Akagi, Masato
2004	An improved method based on the MTF concept for restoring the power envelope from a reverberant signal	Unoki, Masashi; Furukawa, Masakazu; Sakata, Keigo; Akagi, Masato
1-Jan-2004	Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features	ISHIMOTO, Yuichi; ISHIZUKA, Kentaro; AIKAWA, Kiyoaki; AKAGI, Masato
1-Jun-2004	下丘細胞の時間応答特性に関する計算モデルの提案	牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
2005	Toward a rule-based synthesis of emotional speech on linguistic description of perception	Huang, Chun-Fang; Akagi, Masato
2005	Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model	Ito, Kazuhito; Akagi, Masato
2005	A computational model of cochlear nucleus neurons	Maki, Katuhiro; Akagi, Masato
28-Mar-2005	Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency	Ishimoto, Yuichi; Unoki, Masashi; Akagi, Masato
Jul-2005	Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis	Saitou, Takeshi; Unoki, Masashi; Akagi, Masato
2006	A Model-Concept of the Selective Sound Segregation : A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments	Unoki, Masashi; Kubo, Masaaki; Haniu, Atsushi; Akagi, Masato
2006	Multi-channel noise reduction in noisy environments	Li, Junfeng; Akagi, Masato; Suzuki, Yoiti
2006	A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-Based Models	Thang, Tat Vu; Kimura, Kenji; Unoki, Masashi; Akagi, Masato
Feb-2006	A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments	Li, Junfeng; Akagi, Masato
1-Apr-2006	有限要素法による声道伝達特性推定の有効性に関する検討	西本, 博則; 赤木, 正人; 北村, 達也; 鈴木, 規子; Nishimoto, Hironori; Akagi, Masato; Kitamura, Tatsuya; Suzuki, Noriko
Jul-2006	Effect of ITD and component frequencies on perception of alarm signals in noisy environments	Nakanishi, Josaku; Unoki, Masashi; Akagi, Masato
Jul-2006	Effects of complicated vocal tract shape on vocal tract transfer function	Nishimoto, Hironori; Akagi, Masato
1-Jul-2006	Noise reduction method based on generalized subtractive beamformer	Li, Junfeng; Akagi, Masato
2007	Advances for In-Vehicle and Mobile Systems : Noise reduction based on microphone array and post-filtering for robust speech recognition in car environments	Li, Junfeng; Lu, Xugang; Akagi, Masato
Apr-2007	Limited error based event localizing temporal decomposition and its application to variable-rate speech coding	Nguyen, Phu Chien; Akagi, Masato; Nguyen, Binh Phu
Jul-2007	Spectral Modification for Voice Gender Conversion using Temporal Decomposition	Nguyen, Binh Phu; Akagi, Masato
Oct-2007	Speech-to-Singing Synthesis: Converting Speaking Voices to Singing Voices By Controlling Acoustic Features Unique to Singing Voices,	Saitou, Takeshi; Goto, Masataka; Unoki, Masashi; Akagi, Masato
Oct-2007	Improvement of Detectability of Alarm Signal in Noisy Environments by Utilizing Spatial Cues	Uchiyama, Hideaki; Unoki, Masashi; Akagi, Masato
5-Oct-2007	LP-based method of blind restoration to improve intelligibility of bone-conducted speech	Thang, Tat Vu; Unoki, Masashi; Akagi, Masato
2008	Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems	Lu, Xugang; Unoki, Masashi; Akagi, Masato
1-May-2008	歌声らしさの知覚モデルに基づいた歌声特有の音響特徴量の分析	齋藤, 毅; 辻, 直也; 鵜木, 祐史; 赤木, 正人; Saitou, Takeshi; Tsuji, Naoya; Unoki, Masashi; Akagi, Masato
Jun-2008	An LP-based blind model for restoring bone-conducted speech	Vu, Thang tat; Unoki, Masashi; Akagi, Masato
Jun-2008	Phoneme-based Spectral Voice Conversion Using Temporal Decomposition and Gaussian Mixture Model	Nguyen, Binh Phu; Akagi, Masato
Jun-2008	A hybrid microphone array post-filter in a diffuse noise field	Li, Junfeng; Akagi, Masato
1-Jun-2008	A Two-Microphone Noise Reduction Method in Highly Non-stationary Multiple-Noise-Source Environments	LI, Junfeng; AKAGI, Masato; SUZUKI, Yoiti
Jul-2008	Estimation of local peaks based on particle filter in adverse environments	Tomoike, Seiji; Akagi, Masato
23-Sep-2008	Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization	Li, Junfeng; Jiang, Hui; Akagi, Masato
24-Sep-2008	High-quality analysis/synthesis method based on Temporal decomposition for speech modification	Nguyen, Binh Phu; Shibata, Takeshi; Akagi, Masato
24-Sep-2008	Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristics	Petrick, Rico; Lu, Xugang; Unoki, Masashi; Akagi, Masato; Hoffmann, Ruediger
Oct-2008	A three-layered model for expressive speech perception	Huang, Chun-Fang; Akagi, Masato
Nov-2008	Adaptive β-order generalized spectral subtraction for speech enhancement	Li, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yoiti
25-Dec-2008	アジアの音	赤木, 正人; Akagi, Masato
2009	聴覚末梢系の機能モデルの提案－聴神経の位相固定性及びスパイク生成機構のモデル化－	牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
2009	A flexible spectral modification method based on temporal decomposition and Gaussian mixture model	Nguyen, Binh Phu; Akagi, Masato
1-Mar-2009	A study on nonlinguistic features in singing and speaking voices by brain activity measurement	Nakamura, Tomohiko; Kitamura, Tatsuya; Akagi, Masato
1-Mar-2009	An emotional speech recognition system based on multi-layer emotional speech perception model	Aoki, Yuusuke; Huang, Chun-Fang; Akagi, Masato
1-Mar-2009	An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech	Kinugasa, Kota; Unoki, Masashi; Akagi, Masato
1-Mar-2009	Effects from Spatial Cues on Detectability of Alarm Signals in Car Environments	Kuroda, Naoki; Li, Junfeng; Iwaya, Yukio; Unoki, Masashi; Akagi, Masato
Apr-2009	Psychoacoustically-motivated adaptive β-order generalized spectral subtraction for cochlear implant patients	Li, Junfeng; Fu, Qian-Jie; Jiang, Hui; Akagi, Masato
Jul-2009	An MTF-based method of blind restoration for improving intelligibility of bone-conducted speech	Kinugasa, Kota; Unoki, Masashi; Akagi, Masato
25-Aug-2009	MTF-based power envelope restoration in noisy reverberant environments	Unoki, Masashi; Yamasaki, Yutaka; Akagi, Masato
8-Sep-2009	感情音声知覚モデルの提案とその応用	赤木, 正人; AKAGI, Masato
9-Sep-2009	Efficient modeling of temporal structure of speech for applications in voice transformation	Nguyen, Binh Phu; Akagi, Masato
Oct-2009	Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation model	Li, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yoiti
2010	Comparison of Emotion Perception among Different Cultures	Dang, Jianwu; Li, Aijun; Erickson, Donna; Suemitsu, Atsuo; Akagi, Masato; Sakuraba, Kyoko; Minematsu, Nobuaki; Hirose, Keikichi
Mar-2010	赤木研究室（北陸先端科学技術大学院大学）	赤木, 正人; Akagi, Masato
4-Mar-2010	A study on brain activities elicited by synthesized emotional voices controlled with prosodic features	Hamada, Yasuhiro; Kitamura, Tatsuya; Akagi, Masato
4-Mar-2010	A study on the IMTF-based filtering for the modulation spectrum of reverberant speech	Morita, Shota; Unoki, Masashi; Akagi, Masato
4-Mar-2010	Experimental evaluations of TS-BASE/WF in reverberant conditions	Li, Junfeng; Sasaki, Yuuki; Akagi, Masato; Yan, Yonghong
4-Mar-2010	Pitch perception of complex sounds with varied fundamental frequency and spectral tilt	Ishida, Mai; Akagi, Masato
2-Jun-2010	Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication	Li, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yôiti
Jul-2010	A Study on the IMTF-Based Filtering on the Modulation Spectrum of Reverberant Signal	Morita, Shota; Unoki, Masashi; Akagi, Masato
1-Aug-2010	音声に含まれる感情情報の認識 : 感情空間をどのように表現するか	赤木, 正人; Akagi, Masato
30-Sep-2010	A DOA estimation algorithm based on equalization-cancellation theory	Chau, Duc Thanh; Li, Junfeng; Akagi, Masato
1-Oct-2010	A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features	ZHOU, Yu; LI, Junfeng; SUN, Yanqing; ZHANG, Jianping; YAN, Yonghong; AKAGI, Masato
Nov-2010	Intelligibility Investigation of Single-Channel Noise Reduction Algorithms for Chinese and Japanese	Li, Junfeng; Yang, Lin; Yan, Yonghong; Thanh, Chau Duc; Akagi, Masato
2011	An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenon	Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
Feb-2011	An investigation on speech perception over coarticulation	Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
1-Mar-2011	Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristics	KOSUGI, Toshizo; HANIU, Atsushi; MIYAUCHI, Ryota; UNOKI, Masashi; AKAGI, Masato
2-Mar-2011	音声の知覚と認識 : 人は脳で音声を聞く．機械は？	赤木, 正人; 羽二生, 篤; AKAGI, Masato; HANIU, Atsushi
2-Mar-2011	A binaural model accounting for spatial masking release	Mizukawa, Shinya; Akagi, Masato
2-Mar-2011	Study on blind estimation of Speech Transmission Index in room acoustics	Ikeda, Tomohiro; Unoki, Masashi; Akagi, Masato
2-Mar-2011	Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals	Yano, Yuta; Miyauchi, Ryota; Unoki, Masashi; Akagi, Masato
2-Mar-2011	Study on MTF-based power envelope restoration in noisy reverberant environments	Morita, Shota; Lu, Xugang; Unoki, Masashi; Akagi, Masato
3-Mar-2011	Influences of transformed auditory feedback with first three formant frequencies	Shih, Tsungming; Suemitsu, Atsuo; Akagi, Masato
3-Mar-2011	Towards an intelligent binaural speech enhancement system by integrating meaningful signal extraction	Chau, Duc Thanh; Li, Junfeng; Akagi, Masato
1-Apr-2011	聴覚フィードバック下での音声知覚・生成の同時脳活動計測に関する研究	赤木, 正人; Akagi, Masato
10-May-2011	Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English	Li, Junfeng; Yang, Lin; Zhang, Jianping; Yan, Yonghong; Hu, Yi; Akagi, Masato; C. Loizou, Philipos
Jul-2011	Towards intelligent binaural speech enhancement by meaningful sound extraction	Chau, Duc Thanh; Li, Junfeng; Akagi, Masato
5-Mar-2012	Study on hearing impression of speaker identification focusing on dynamic features	Izumida, Tsuyoshi; Akagi, Masato
5-Mar-2012	Speech enhancement technique in noisy reverberant environment using two microphone arrays	Sasaki, Yuuki; Akagi, Masato
6-Mar-2012	Study on detectability of signals by utilizing differences in their amplitude modulation	Yano, Yuta; Miyauchi, Ryota; Unoki, Masashi; Akagi, Masato
22-Aug-2012	Privacy protection for speech based on concepts of auditory scene analysis	AKAGI, Masato; IRIE, Yoshihiro
Sep-2012	A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model	Phung, Nghia Trung; Unoki, Masashi; Akagi, Masato
Dec-2012	Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered Model	Elbarougy, Reda; Akagi, Masato
Dec-2012	A concatenative speech synthesis for monosyllabic languages with limited data	Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
Dec-2012	Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages	Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
Mar-2013	A singing voices synthesis system to characterize vocal registers using ARX-LF model	Motoda, Hiroki; Akagi, Masato
Mar-2013	A Study on individualization of Head-Related Transfer Function in the median plane	Hisatsune, Hideki; Akagi, Masato
15-May-2013	音声中の感情認識のための新しい認識方略に関する研究	赤木, 正人; Akagi, Masato
2-Jun-2013	Exploring auditory aging can exclusively explain Japanese adults′ age-related decrease in training effects of American English /r/-/l/	Kubo, Rieko; Akagi, Masato
8-Jul-2013	Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio	Chau, Duc Thanh; Li, Junfeng; Akagi, Masato
27-Aug-2013	Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese	Li, Junfeng; Chen, Fei; Akagi, Masato; Yan, Yonghong
Sep-2013	Acoustic sound source tracking for a moving object using precise Doppler-shift measurement	Nishie, Suminori; Akagi, Masato
2-Sep-2013	A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions	Phung, Trung-Nghia; Luong, Chi Mai; Akagi, Masato
Oct-2013	Admissible range for individualization of head-related transfer function in median plane	Akagi, Masato; Hisatsune, Hideki
Oct-2013	Cross-lingual Speech Emotion Recognition System Based on a Three-Layer Model for Human Perception	Elbarougy, Reda; Akagi, Masato
1-Nov-2013	Improving Naturalness of HMM-Based TTS Trained with Limited Data by Temporal Decomposition	PHUNG, Trung-Nghia; PHAN, Thanh-Son; VU, Thang Tat; LUONG, Mai Chi; AKAGI, Masato
2014	Speech recognition in noisy conditions based on speech separation using Non-negative Matrix Factorization	Du, Yuxuan; Akagi, Masato
2014	Study on Analyzing Individuality of Instrurment Sounds Using Non-negative Matrix Factorization	Kobayashi, Keisuke; Morikawa, Daisuke; Akagi, Masato
2014	Glottal source analysis of emotional speech	Li, Yongwei; Akagi, Masato
2014	Improving speech emotion dimensions estimation using a three-layer model of human perception	Elbarougy, Reda; Akagi, Masato
2014	Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English	Li, Junfeng; Xia, Risheng; Ying, Dongwen; Yan, Yonghong; Akagi, Masato
1-Apr-2014	音情景解析の概念にもとづいた音声プライバシー保護	赤木, 正人; 入江, 佳洋; Akagi, Masato; Irie, Yoshihiro
1-Apr-2014	弦楽器F0 推定のための精密周波数測定方法	西江, 純教; 赤木, 正人; Nishie, Suminori; Akagi, Masato
Jul-2014	Toward relaying emotional state for speech-to-speech translator: Estimation of emotional state for synthesizing speech with emotion	Akagi, Masato; Elbarougy, Reda
Aug-2014	Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System	Akagi, Masato; Han, Xiao; Elbarougy, Reda; Hamada, Yasuhiro; Li, Junfeng
Sep-2014	Toward relaying an affective Speech-to-Speech translator: Cross-language perception of emotional state represented by emotion dimensions	Elbarougy, Reda; Xiao, Han; Akagi, Masato; Li, Junfeng
1-Oct-2014	Binaural Sound Source Localization in Noisy Reverberant Environments Based on Equalization-Cancellation Theory	Chau, Thanh-Duc; Li, Junfeng; Akagi, Masato
Feb-2015	A study on perception of emotional states in multiple languages on Valence-Activation approach	Han, Xiao; Elbarougy, Reda; Akagi, Masato; Li, Junfeng; Ngo, Thi Duyen; Bui, The Duy
1-Sep-2015	Dependence on age of interference with phoneme perception by first- and second-language speech maskers	Kubo, Rieko; Akagi, Masato; Akahane-Yamada, Reiko
28-Oct-2015	Toward Improving Estimation Accuracy of Emotion Dimensions in Bilingual Scenario Based on Three-layered Model	LI, Xingfeng; Akagi, Masato
Dec-2015	Study on method to control fundamental frequency contour related to a position on Valence-Activation space	Hamada, Yasuhiro; Elbarougy, Reda; Xue, Yuawn; Akagi, Masato
19-Dec-2015	Emotional speech synthesis system based on a three-layered model using a dimensional approach	Xue, Yawen; Hamada, Yasuhiro; Akagi, Masato
2016	Voice Conversion to Emotional Speech based on Three-layered Model in Dimensional Approach and Parameterization of Dynamic Features in Prosody	Xue, Yawen; Hamada, Yasuhiro; Akagi, Masato
Mar-2016	A study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model	Dinh-Anh, Tuan; Morikawa, Daisuke; Akagi, Masato
Mar-2016	Automatic Speech Emotion Recognition in Chinese Using a Three-layered Model in Dimensional Approach	Li, Xingfeng; Akagi, Masato
Mar-2016	A study on applying target prediction model to parameterize power envelope of emotional speech	Xue, Yawen; Akagi, Masato
21-Aug-2016	Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyance	Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato
Oct-2016	Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using non-negative matrix factorization	Dinh, Anh-Tuan; Akagi, Masato
Oct-2016	Voice conversion system to emotional speech in multiple languages based on three-layered model for dimensional space	Xue, Yawen; Hamada, Yasuhiro; Elbarougy, Reda; Akagi, Masato
18-Oct-2016	Optimizing Fuzzy Inference Systems for Improving Speech Emotion Recognition	Elbarougy, Reda; Akagi, Masato
12-Nov-2016	Quality Improvement of Vietnamese HMM-Based Speech Synthesis System Based on Decomposition of Naturalness and Intelligibility using Non-negative Matrix Factorization	Dinh, Anh-Tuan; Phan, Thanh-Son; Akagi, Masato
2017	Acoustical Analyses of Tendencies of Intelligibility in Lombard Speech with Different Background Noise Levels	Ngo, Thuan Van; Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato
2-Mar-2017	Acoustical analyses of Lombard speech by different background noise levels for tendencies of intelligibility	Ngo, Thuan Van; Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato
2-Mar-2017	Articulatory Characteristics of Expressive Speech in Activation-Evaluation Space	Asai, Takuya; Suemitsu, Atsuo; Akagi, Masato
1-Jun-2017	ヒト発話シミュレータによるStory Teller Systemの構築	赤木, 正人; Akagi, Masato
26-Oct-2017	Weighted Robust Principal Component Analysis with Gammatone Auditory Filterbank for Singing Voice Separation	Li, Feng; Akagi, Masato
1-Nov-2017	Feature Selection Method for Real-time Speech Emotion Recognition	Elbarougy, Reda; Akagi, Masato
15-Dec-2017	Speech Emotion Recognition Using Multichannel Parallel Convolutional Recurrent Neural Networks based on Gammatone Auditory Filterbank	Peng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
2018	Unsupervised Singing Voice Separation Based on Robust Principal Component Analysis Exploiting Rank-1 Constraint	Li, Feng; Akagi, Masato
2018	A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech	Li, Xingfeng; Akagi, Masato
5-Mar-2018	Perceptual grouping with prosodic features in Japanese dialects	Zhang, Ling; Akagi, Masato
6-Mar-2018	Study on differences between perceptions of Japanese and Chinese emotional speech by Japanese and Chinese listeners	Zhang, Chenyi; Akagi, Masato
7-Mar-2018	Non-parallel training dictionary-based voice conversion with Variational Autoencoder	Vu, Ho-Tuan; Akagi, Masato
7-Mar-2018	Synthesis of expressive singing voice by F0, amplitude envelope and spectral feature conversion	Nguyen, Thi-Hao; Akagi, Masato
7-Mar-2018	Estimation of glottal source waveform and vocal tract shape for singing-voice analysis	Takahashi, Kyoko; Akagi, Masato
19-Jul-2018	Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space	Xue, Yawen; Hamada, Yasuhiro; Akagi, Masato
25-Jul-2018	Nonparallel Dictionary-Based Voice Conversion Using Variational Autoencoder with Modulation-Spectrum-Constrained Training	Ho, Tuan Vu; Akagi, Masato
26-Jul-2018	Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation	Peng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
22-Aug-2018	Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal space	Li, Yongwei; Li, Junfeng; Akagi, Masato
11-Sep-2018	Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech	Li, Yongwei; Sakakibara, Ken-Ichi; Morikawa, Daisuke; Akagi, Masato
15-Nov-2018	Maximal Information Coefficient and Predominant Correlation-Based Feature Selection Toward A Three-Layer Model for Speech Emotion Recognition	Li, Xingfeng; Akagi, Masato
15-Nov-2018	Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency range	Takahashi, Kyoko; Akagi, Masato
15-Nov-2018	Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis	Li, Feng; Akagi, Masato
2019	The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity	Li, Xingfeng; Akagi, Masato
6-Mar-2019	Study on Perception of Speaker Age by Semantic Differential Method	Li, Yang; Kobayashi, Maori; Akagi, Masato
6-Mar-2019	Variation of Formant Amplitude and Frequencies in Vowel Spectrum uttered under Various Noisy Environments	Matsumoto, Shumpei; Akagi, Masato
6-Mar-2019	Study on Relationship between Degree of Emphasis and Acoustic Feature for Synthesizing Emphasized Speech	Ohtani, Yasuhiro; Akagi, Masato
6-Mar-2019	Relationship between discomfort sound and its physical correlates	Takahashi, Yumiko; Akagi, Masato
7-Mar-2019	Study on Nonlinear Relationships between Semantic Primitives and Emotional Dimensions for Improving Three-layered Model	Liu, Xingyu; Elbarougy, Reda Elsaid; Akagi, Masato
7-Mar-2019	Study on Relations between Emotion Perception and Acoustic Features using Speech Morphing Techniques	Wang, Zi; Kobayashi, Maori; Akagi, Masato
3-Apr-2019	Improving multilingual speech emotion recognition by combining acoustic features in a three-layer model	Li, Xingfeng; Akagi, Masato
17-Apr-2019	Blind Monaural Singing Voice Separation Using Rank-1 Constraint Robust Principal Component Analysis and Vocal Activity Detection	Li, Feng; Akagi, Masato
6-May-2019	Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model	Li, Yongwei; Sakakibara, Ken-Ichi; Akagi, Masato
16-Jul-2019	Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model	Atmaja, Bagus Tris; Akagi, Masato
19-Nov-2019	Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational Autoencoder	Ho, Tuan Vu; Akagi, Masato
19-Nov-2019	Speech Emotion Recognition Using Speech Feature and Word Embedding	Atmaja, Bagus Tris; Shirai, Kiyoaki; Akagi, Masato
19-Nov-2019	Evaluation of the Lombard Effect Model on Synthesizing Lombard Speech in Varying Noise Level Environments with Limited Data	Ngo, Thuan Van; Kubo, Rieko; Akagi, Masato
19-Nov-2019	Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Network	Peng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
20-Nov-2019	Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking	Li, Feng; Qian, Kaizhi; Hasegawa-Johnson, Mark; Akagi, Masato
14-Dec-2019	Combining F0 and non-negative constraint robust principal component analysis for singing voice separation	Li, Feng; Akagi, Masato
23-Dec-2019	Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model	Li, Yongwei; Sakakibara, Ken-Ichi; Akagi, Masato
20-Jan-2020	Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends	Peng, Zhichao; Li, Xingfeng; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
22-Jan-2020	Effect of articulatory and acoustic features on the intelligibility of speech in noise: an articulatory synthesis study	Ngo, Thuanvan; Akagi, Masato; Birkholz, Peter
20-Feb-2020	Study on relationship between warmness of speech and valence, activation or dominance	Miyagawa, Natsumi; Akagi, Masato
20-Feb-2020	Influence of auditory feedback on uttering vowel speech in noisy environment	Nishigaki, Tomoya; Akagi, Masato
May-2020	Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition	Atmaja, Bagus Tris; Akagi, Masato
1-May-2020	Mimicking Lombard Effect: An Analysis and Reconstruction	Ngo, Thuan Van; Kubo, Rieko; Akagi, Masato
25-May-2020	The Effect of Silence Feature in Dimensional Speech Emotion Recognition	Atmaja, Bagus Tris; Akagi, Masato
27-May-2020	Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning	Atmaja, Bagus Tris; Akagi, Masato
1-Jul-2020	A Two-Stage Phase-Aware Approach for Monaural Multi-Talker Speech Separation	Yin, Lu; Li, Junfeng; Yan, Yonghong; Akagi, Masato
Oct-2020	Segment-level Effects of Gender, Nationality and Emotion Information on Text-independent Speaker Verification	Li, Kai; Akagi, Masato; Wu, Yibo; Dang, and Jianwu
Oct-2020	Comparison of glottal source parameter values in emotional vowels	Li, Yongwei; Tao, Jianhua; Liu, Bin; Erickson, Donna; Akagi, Masato
9-Oct-2020	Acoustic and articulatory analysis and synthesis of shouted vowels	Xue, Yawen; Marxen, Michael; Akagi, Masato; Birkholz, Peter
30-Oct-2020	Non-parallel Voice Conversion based on Hierarchical Latent Embedding Vector Quantized Variational Autoencoder	Ho, Tuan Vu; Akagi, Masato
1-Nov-2020	Continuous Audiovisual Emotion Recognition Using Feature Selection and LSTM	Elbarougy, Reda; Atmaja, Bagus Tris; Akagi, Masato
6-Nov-2020	Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information	Atmaja, Bagus Tris; Akagi, Masato
18-Nov-2020	On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and Classifiers	Atmaja, Bagus Tris; Akagi, Masato
19-Nov-2020	Predicting Valence and Arousal by Aggregating Acoustic Features for Acoustic-Linguistic Information Fusion	Atmaja, Bagus Tris; Hamada, Yasuhiro; Akagi, Masato
19-Nov-2020	Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM	Atmaja, Bagus Tris; Akagi, Masato
2-Mar-2021	Cross-Lingual Voice Conversion With Controllable Speaker Individuality Using Variational Autoencoder and Star Generative Adversarial Network	Ho, Tuan Vu; Akagi, Masato
1-Oct-2021	Increasing speech intelligibility and naturalness in noise based on concepts of modulation spectrum and modulation transfer function	Ngo, Thuanvan; Kubo, Rieko; Akagi, Masato
15-Oct-2021	F_0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model	Li, Yongwei; Tao, Jianhua; Erickson, Donna; Liu, Bin; Akagi, Masato
Dec-2021	Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis	Li, Kai; Unoki, Masashi; Li, Yongwei; Dang, Jianwu; Akagi, Masato
Dec-2021	Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition	Li, Xingfeng; Guo, Taiyang; Hu, Xinhui; Xu, Xinkang; Dang, Jianwu; Akagi, Masato
Dec-2021	Automatic Naturalness Recognition from Acted Speech Using Neural Networks	Atmaja, Bagus Tris; Sasou, Akira; Akagi, Masato
6-Mar-2022	Acoustic features correlated to perceived urgency in evacuation announcements	Kobayashi, Maori; Hamada, Yasuhiro; Akagi, Masato
26-Mar-2022	Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion	Atmaja, Bagus Tris; Sasou, Akira; Akagi, Masato
7-Jul-2022	Speech Emotion and Naturalness Recognitions With Multitask and Single-Task Learnings	Atmaja, Bagus Tris; Sasou, Akira; Akagi, Masato
Sep-2022	Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection	Li, Kai; Li, Sheng; Lu, Xugang; Akagi, Masato; Liu, Meng; Zhang, Lin; Zeng, Chang; Wang, Longbiao; Dang, Jianwu; Unoki, Masashi
Sep-2022	Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion	Ho, Tuan Vu; Kobayashi, Maori; Akagi, Masato
Sep-2022	Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement	Ho, Tuan Vu; Nguyen, Quoc Huy; Akagi, Masato; Unoki, Masashi
26-Jun-2023	Music Theory-inspired Acoustic Representation for Speech Emotion Recognition	Li, Xingfeng; Shi, Xiaohan; Hu, Desheng; Li, Yongwei; Zhang, Qingchen; Wang, Zhengxia; Unoki, Masashi; Akagi, Masato
31-Oct-2023	Increasing Speech Intelligibility by Mimicking Professional Announcers’ Voices and Its Physical Correlates	Tran, Dung Kim; Akagi, Masato; Unoki, Masashi

お問合せ先 : 北陸先端科学技術大学院大学　研究推進課学術情報係 (ir-sys[at]ml.jaist.ac.jp)