208 著者名表示.
発行日 | タイトル |
著者 |
1995 | Speaker individualities in speech spectral envelopes | Kitamura, Tatsuya; Akagi, Masato |
1997 | Speaker individuality in fundamental frequency contours and its control | Akagi, Masato; Ienaga, Taro |
Mar-1997 | 雑音が付加された波形からの信号波形の一抽出法 | 鵜木, 祐史; 赤木, 正人; UNOKI, Masashi; AKAGI, Masato |
6-Feb-1998 | A computational model of co-modulation masking release | Unoki, Masashi; Akagi, Masato |
6-Feb-1998 | A method of signal extraction from noisy signal based on auditory scene analysis | Unoki, Masashi; Akagi, Masato |
Apr-1999 | A method of signal extraction from noisy signal based on auditory scene analysis | Unoki, Masashi; Akagi, Masato |
20-Apr-1999 | マイクロホン対を用いたスペクトルサブトラクションによる雑音除去法 | 水町, 光徳; 赤木, 正人; MIZUMACHI, Mitsunori; AKAGI, Masato |
20-Oct-1999 | 聴覚の情景解析に基づいた雑音下の調波複合音の一抽出法 | 鵜木, 祐史; 赤木, 正人; UNOKI, Masashi; AKAGI, Masato |
2000 | The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises | Mizumachi, Mitsunori; Akagi, Masato |
Jul-2000 | A computational model of auditory sound localization based on ITD | Ito, Kazuhito; Akagi, Masato |
1-Jul-2000 | 蝸牛神経核細胞の機能モデルの提案 : 前腹側核細胞の応答特性 | 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru |
25-Dec-2000 | 2.聴覚モデルの系譜 : 聴覚分野(〈特集〉-音響学における20世紀の成果と21世紀に残された課題-) | 赤木, 正人; Akagi, Masato |
2001 | Computational Models of Auditory Function : A computational model of auditory sound localization | Ito, Kazuhito; Akagi, Masato |
2001 | Computational Models of Auditory Function : A computational model of co-modulation masking release | Unoki, Masashi; Akagi, Masato |
2002 | Enabling Society With Information Technology : Speech enhancement and segregation based on human auditory mechanisms | Akagi, Masato; Mizumachi, Mitsunori; Ishimoto, Yuichi; Unoki, Masashi |
25-Dec-2002 | 蝸牛神経核腹側核細胞モデルの振幅変調音に対する応答特性 | Amplitude modulation; 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru |
25-Dec-2002 | 初期聴覚系における神経発火の時間-周波数応答パタン(<小特集>末梢聴覚機能解析の動向) | 牧, 勝弘; 伊藤, 一仁; 赤木, 正人; Maki, Katuhiro; Ito, Kazuhito; Akagi, Masato |
1-Mar-2003 | Modified Restricted Temporal Decomposition and Its Application to Low Rate Speech Coding | NGUYEN, Phu Chien; OCHI, Takao; AKAGI, Masato |
25-Dec-2003 | 蝸牛神経核背側核細胞の周波数応答特性に関する神経回路モデルの提案 : トーンバースト刺激に対する応答 | 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru |
2004 | A speech dereverberation method based on the MTF concept in power envelope restoration | Unoki, Masashi; Sakata, Keigo; Furukawa, Masakazu; Akagi, Masato |
2004 | An improved method based on the MTF concept for restoring the power envelope from a reverberant signal | Unoki, Masashi; Furukawa, Masakazu; Sakata, Keigo; Akagi, Masato |
1-Jan-2004 | Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features | ISHIMOTO, Yuichi; ISHIZUKA, Kentaro; AIKAWA, Kiyoaki; AKAGI, Masato |
1-Jun-2004 | 下丘細胞の時間応答特性に関する計算モデルの提案 | 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru |
2005 | Toward a rule-based synthesis of emotional speech on linguistic description of perception | Huang, Chun-Fang; Akagi, Masato |
2005 | Study on improving regularity of neural phase locking in single neurons of AVCN via a computational model | Ito, Kazuhito; Akagi, Masato |
2005 | A computational model of cochlear nucleus neurons | Maki, Katuhiro; Akagi, Masato |
28-Mar-2005 | Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency | Ishimoto, Yuichi; Unoki, Masashi; Akagi, Masato |
Jul-2005 | Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis | Saitou, Takeshi; Unoki, Masashi; Akagi, Masato |
2006 | A Model-Concept of the Selective Sound Segregation : A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments | Unoki, Masashi; Kubo, Masaaki; Haniu, Atsushi; Akagi, Masato |
2006 | Multi-channel noise reduction in noisy environments | Li, Junfeng; Akagi, Masato; Suzuki, Yoiti |
2006 | A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-Based Models | Thang, Tat Vu; Kimura, Kenji; Unoki, Masashi; Akagi, Masato |
Feb-2006 | A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments | Li, Junfeng; Akagi, Masato |
1-Apr-2006 | 有限要素法による声道伝達特性推定の有効性に関する検討 | 西本, 博則; 赤木, 正人; 北村, 達也; 鈴木, 規子; Nishimoto, Hironori; Akagi, Masato; Kitamura, Tatsuya; Suzuki, Noriko |
Jul-2006 | Effect of ITD and component frequencies on perception of alarm signals in noisy environments | Nakanishi, Josaku; Unoki, Masashi; Akagi, Masato |
Jul-2006 | Effects of complicated vocal tract shape on vocal tract transfer function | Nishimoto, Hironori; Akagi, Masato |
1-Jul-2006 | Noise reduction method based on generalized subtractive beamformer | Li, Junfeng; Akagi, Masato |
2007 | Advances for In-Vehicle and Mobile Systems : Noise reduction based on microphone array and post-filtering for robust speech recognition in car environments | Li, Junfeng; Lu, Xugang; Akagi, Masato |
Apr-2007 | Limited error based event localizing temporal decomposition and its application to variable-rate speech coding | Nguyen, Phu Chien; Akagi, Masato; Nguyen, Binh Phu |
Jul-2007 | Spectral Modification for Voice Gender Conversion using Temporal Decomposition | Nguyen, Binh Phu; Akagi, Masato |
Oct-2007 | Speech-to-Singing Synthesis: Converting Speaking Voices to Singing Voices By Controlling Acoustic Features Unique to Singing Voices, | Saitou, Takeshi; Goto, Masataka; Unoki, Masashi; Akagi, Masato |
Oct-2007 | Improvement of Detectability of Alarm Signal in Noisy Environments by Utilizing Spatial Cues | Uchiyama, Hideaki; Unoki, Masashi; Akagi, Masato |
5-Oct-2007 | LP-based method of blind restoration to improve intelligibility of bone-conducted speech | Thang, Tat Vu; Unoki, Masashi; Akagi, Masato |
2008 | Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems | Lu, Xugang; Unoki, Masashi; Akagi, Masato |
1-May-2008 | 歌声らしさの知覚モデルに基づいた歌声特有の音響特徴量の分析 | 齋藤, 毅; 辻, 直也; 鵜木, 祐史; 赤木, 正人; Saitou, Takeshi; Tsuji, Naoya; Unoki, Masashi; Akagi, Masato |
Jun-2008 | An LP-based blind model for restoring bone-conducted speech | Vu, Thang tat; Unoki, Masashi; Akagi, Masato |
Jun-2008 | Phoneme-based Spectral Voice Conversion Using Temporal Decomposition and Gaussian Mixture Model | Nguyen, Binh Phu; Akagi, Masato |
Jun-2008 | A hybrid microphone array post-filter in a diffuse noise field | Li, Junfeng; Akagi, Masato |
1-Jun-2008 | A Two-Microphone Noise Reduction Method in Highly Non-stationary Multiple-Noise-Source Environments | LI, Junfeng; AKAGI, Masato; SUZUKI, Yoiti |
Jul-2008 | Estimation of local peaks based on particle filter in adverse environments | Tomoike, Seiji; Akagi, Masato |
23-Sep-2008 | Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization | Li, Junfeng; Jiang, Hui; Akagi, Masato |
24-Sep-2008 | High-quality analysis/synthesis method based on Temporal decomposition for speech modification | Nguyen, Binh Phu; Shibata, Takeshi; Akagi, Masato |
24-Sep-2008 | Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristics | Petrick, Rico; Lu, Xugang; Unoki, Masashi; Akagi, Masato; Hoffmann, Ruediger |
Oct-2008 | A three-layered model for expressive speech perception | Huang, Chun-Fang; Akagi, Masato |
Nov-2008 | Adaptive β-order generalized spectral subtraction for speech enhancement | Li, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yoiti |
25-Dec-2008 | アジアの音 | 赤木, 正人; Akagi, Masato |
2009 | 聴覚末梢系の機能モデルの提案-聴神経の位相固定性及びスパイク生成機構のモデル化- | 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru |
2009 | A flexible spectral modification method based on temporal decomposition and Gaussian mixture model | Nguyen, Binh Phu; Akagi, Masato |
1-Mar-2009 | A study on nonlinguistic features in singing and speaking voices by brain activity measurement | Nakamura, Tomohiko; Kitamura, Tatsuya; Akagi, Masato |
1-Mar-2009 | An emotional speech recognition system based on multi-layer emotional speech perception model | Aoki, Yuusuke; Huang, Chun-Fang; Akagi, Masato |
1-Mar-2009 | An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted Speech | Kinugasa, Kota; Unoki, Masashi; Akagi, Masato |
1-Mar-2009 | Effects from Spatial Cues on Detectability of Alarm Signals in Car Environments | Kuroda, Naoki; Li, Junfeng; Iwaya, Yukio; Unoki, Masashi; Akagi, Masato |
Apr-2009 | Psychoacoustically-motivated adaptive β-order generalized spectral subtraction for cochlear implant patients | Li, Junfeng; Fu, Qian-Jie; Jiang, Hui; Akagi, Masato |
Jul-2009 | An MTF-based method of blind restoration for improving intelligibility of bone-conducted speech | Kinugasa, Kota; Unoki, Masashi; Akagi, Masato |
25-Aug-2009 | MTF-based power envelope restoration in noisy reverberant environments | Unoki, Masashi; Yamasaki, Yutaka; Akagi, Masato |
8-Sep-2009 | 感情音声知覚モデルの提案とその応用 | 赤木, 正人; AKAGI, Masato |
9-Sep-2009 | Efficient modeling of temporal structure of speech for applications in voice transformation | Nguyen, Binh Phu; Akagi, Masato |
Oct-2009 | Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation model | Li, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yoiti |
2010 | Comparison of Emotion Perception among Different Cultures | Dang, Jianwu; Li, Aijun; Erickson, Donna; Suemitsu, Atsuo; Akagi, Masato; Sakuraba, Kyoko; Minematsu, Nobuaki; Hirose, Keikichi |
Mar-2010 | 赤木研究室(北陸先端科学技術大学院大学) | 赤木, 正人; Akagi, Masato |
4-Mar-2010 | A study on brain activities elicited by synthesized emotional voices controlled with prosodic features | Hamada, Yasuhiro; Kitamura, Tatsuya; Akagi, Masato |
4-Mar-2010 | A study on the IMTF-based filtering for the modulation spectrum of reverberant speech | Morita, Shota; Unoki, Masashi; Akagi, Masato |
4-Mar-2010 | Experimental evaluations of TS-BASE/WF in reverberant conditions | Li, Junfeng; Sasaki, Yuuki; Akagi, Masato; Yan, Yonghong |
4-Mar-2010 | Pitch perception of complex sounds with varied fundamental frequency and spectral tilt | Ishida, Mai; Akagi, Masato |
2-Jun-2010 | Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication | Li, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yôiti |
Jul-2010 | A Study on the IMTF-Based Filtering on the Modulation Spectrum of Reverberant Signal | Morita, Shota; Unoki, Masashi; Akagi, Masato |
1-Aug-2010 | 音声に含まれる感情情報の認識 : 感情空間をどのように表現するか | 赤木, 正人; Akagi, Masato |
30-Sep-2010 | A DOA estimation algorithm based on equalization-cancellation theory | Chau, Duc Thanh; Li, Junfeng; Akagi, Masato |
1-Oct-2010 | A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features | ZHOU, Yu; LI, Junfeng; SUN, Yanqing; ZHANG, Jianping; YAN, Yonghong; AKAGI, Masato |
Nov-2010 | Intelligibility Investigation of Single-Channel Noise Reduction Algorithms for Chinese and Japanese | Li, Junfeng; Yang, Lin; Yan, Yonghong; Thanh, Chau Duc; Akagi, Masato |
2011 | An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenon | Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato |
Feb-2011 | An investigation on speech perception over coarticulation | Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato |
1-Mar-2011 | Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristics | KOSUGI, Toshizo; HANIU, Atsushi; MIYAUCHI, Ryota; UNOKI, Masashi; AKAGI, Masato |
2-Mar-2011 | 音声の知覚と認識 : 人は脳で音声を聞く.機械は? | 赤木, 正人; 羽二生, 篤; AKAGI, Masato; HANIU, Atsushi |
2-Mar-2011 | A binaural model accounting for spatial masking release | Mizukawa, Shinya; Akagi, Masato |
2-Mar-2011 | Study on blind estimation of Speech Transmission Index in room acoustics | Ikeda, Tomohiro; Unoki, Masashi; Akagi, Masato |
2-Mar-2011 | Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signals | Yano, Yuta; Miyauchi, Ryota; Unoki, Masashi; Akagi, Masato |
2-Mar-2011 | Study on MTF-based power envelope restoration in noisy reverberant environments | Morita, Shota; Lu, Xugang; Unoki, Masashi; Akagi, Masato |
3-Mar-2011 | Influences of transformed auditory feedback with first three formant frequencies | Shih, Tsungming; Suemitsu, Atsuo; Akagi, Masato |
3-Mar-2011 | Towards an intelligent binaural speech enhancement system by integrating meaningful signal extraction | Chau, Duc Thanh; Li, Junfeng; Akagi, Masato |
1-Apr-2011 | 聴覚フィードバック下での音声知覚・生成の同時脳活動計測に関する研究 | 赤木, 正人; Akagi, Masato |
10-May-2011 | Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and English | Li, Junfeng; Yang, Lin; Zhang, Jianping; Yan, Yonghong; Hu, Yi; Akagi, Masato; C. Loizou, Philipos |
Jul-2011 | Towards intelligent binaural speech enhancement by meaningful sound extraction | Chau, Duc Thanh; Li, Junfeng; Akagi, Masato |
5-Mar-2012 | Study on hearing impression of speaker identification focusing on dynamic features | Izumida, Tsuyoshi; Akagi, Masato |
5-Mar-2012 | Speech enhancement technique in noisy reverberant environment using two microphone arrays | Sasaki, Yuuki; Akagi, Masato |
6-Mar-2012 | Study on detectability of signals by utilizing differences in their amplitude modulation | Yano, Yuta; Miyauchi, Ryota; Unoki, Masashi; Akagi, Masato |
22-Aug-2012 | Privacy protection for speech based on concepts of auditory scene analysis | AKAGI, Masato; IRIE, Yoshihiro |
Sep-2012 | A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model | Phung, Nghia Trung; Unoki, Masashi; Akagi, Masato |
Dec-2012 | Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered Model | Elbarougy, Reda; Akagi, Masato |
Dec-2012 | A concatenative speech synthesis for monosyllabic languages with limited data | Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato |
Dec-2012 | Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages | Phung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato |
Mar-2013 | A singing voices synthesis system to characterize vocal registers using ARX-LF model | Motoda, Hiroki; Akagi, Masato |
Mar-2013 | A Study on individualization of Head-Related Transfer Function in the median plane | Hisatsune, Hideki; Akagi, Masato |
15-May-2013 | 音声中の感情認識のための新しい認識方略に関する研究 | 赤木, 正人; Akagi, Masato |
2-Jun-2013 | Exploring auditory aging can exclusively explain Japanese adults′ age-related decrease in training effects of American English /r/-/l/ | Kubo, Rieko; Akagi, Masato |
8-Jul-2013 | Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio | Chau, Duc Thanh; Li, Junfeng; Akagi, Masato |
27-Aug-2013 | Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese | Li, Junfeng; Chen, Fei; Akagi, Masato; Yan, Yonghong |
Sep-2013 | Acoustic sound source tracking for a moving object using precise Doppler-shift measurement | Nishie, Suminori; Akagi, Masato |
2-Sep-2013 | A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions | Phung, Trung-Nghia; Luong, Chi Mai; Akagi, Masato |
Oct-2013 | Admissible range for individualization of head-related transfer function in median plane | Akagi, Masato; Hisatsune, Hideki |
Oct-2013 | Cross-lingual Speech Emotion Recognition System Based on a Three-Layer Model for Human Perception | Elbarougy, Reda; Akagi, Masato |
1-Nov-2013 | Improving Naturalness of HMM-Based TTS Trained with Limited Data by Temporal Decomposition | PHUNG, Trung-Nghia; PHAN, Thanh-Son; VU, Thang Tat; LUONG, Mai Chi; AKAGI, Masato |
2014 | Speech recognition in noisy conditions based on speech separation using Non-negative Matrix Factorization | Du, Yuxuan; Akagi, Masato |
2014 | Study on Analyzing Individuality of Instrurment Sounds Using Non-negative Matrix Factorization | Kobayashi, Keisuke; Morikawa, Daisuke; Akagi, Masato |
2014 | Glottal source analysis of emotional speech | Li, Yongwei; Akagi, Masato |
2014 | Improving speech emotion dimensions estimation using a three-layer model of human perception | Elbarougy, Reda; Akagi, Masato |
2014 | Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English | Li, Junfeng; Xia, Risheng; Ying, Dongwen; Yan, Yonghong; Akagi, Masato |
1-Apr-2014 | 音情景解析の概念にもとづいた音声プライバシー保護 | 赤木, 正人; 入江, 佳洋; Akagi, Masato; Irie, Yoshihiro |
1-Apr-2014 | 弦楽器F0 推定のための精密周波数測定方法 | 西江, 純教; 赤木, 正人; Nishie, Suminori; Akagi, Masato |
Jul-2014 | Toward relaying emotional state for speech-to-speech translator: Estimation of emotional state for synthesizing speech with emotion | Akagi, Masato; Elbarougy, Reda |
Aug-2014 | Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System | Akagi, Masato; Han, Xiao; Elbarougy, Reda; Hamada, Yasuhiro; Li, Junfeng |
Sep-2014 | Toward relaying an affective Speech-to-Speech translator: Cross-language perception of emotional state represented by emotion dimensions | Elbarougy, Reda; Xiao, Han; Akagi, Masato; Li, Junfeng |
1-Oct-2014 | Binaural Sound Source Localization in Noisy Reverberant Environments Based on Equalization-Cancellation Theory | Chau, Thanh-Duc; Li, Junfeng; Akagi, Masato |
Feb-2015 | A study on perception of emotional states in multiple languages on Valence-Activation approach | Han, Xiao; Elbarougy, Reda; Akagi, Masato; Li, Junfeng; Ngo, Thi Duyen; Bui, The Duy |
1-Sep-2015 | Dependence on age of interference with phoneme perception by first- and second-language speech maskers | Kubo, Rieko; Akagi, Masato; Akahane-Yamada, Reiko |
28-Oct-2015 | Toward Improving Estimation Accuracy of Emotion Dimensions in Bilingual Scenario Based on Three-layered Model | LI, Xingfeng; Akagi, Masato |
Dec-2015 | Study on method to control fundamental frequency contour related to a position on Valence-Activation space | Hamada, Yasuhiro; Elbarougy, Reda; Xue, Yuawn; Akagi, Masato |
19-Dec-2015 | Emotional speech synthesis system based on a three-layered model using a dimensional approach | Xue, Yawen; Hamada, Yasuhiro; Akagi, Masato |
2016 | Voice Conversion to Emotional Speech based on Three-layered Model in Dimensional Approach and Parameterization of Dynamic Features in Prosody | Xue, Yawen; Hamada, Yasuhiro; Akagi, Masato |
Mar-2016 | A study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model | Dinh-Anh, Tuan; Morikawa, Daisuke; Akagi, Masato |
Mar-2016 | Automatic Speech Emotion Recognition in Chinese Using a Three-layered Model in Dimensional Approach | Li, Xingfeng; Akagi, Masato |
Mar-2016 | A study on applying target prediction model to parameterize power envelope of emotional speech | Xue, Yawen; Akagi, Masato |
21-Aug-2016 | Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyance | Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato |
Oct-2016 | Voice conversion system to emotional speech in multiple languages based on three-layered model for dimensional space | Xue, Yawen; Hamada, Yasuhiro; Elbarougy, Reda; Akagi, Masato |
Oct-2016 | Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using non-negative matrix factorization | Dinh, Anh-Tuan; Akagi, Masato |
18-Oct-2016 | Optimizing Fuzzy Inference Systems for Improving Speech Emotion Recognition | Elbarougy, Reda; Akagi, Masato |
12-Nov-2016 | Quality Improvement of Vietnamese HMM-Based Speech Synthesis System Based on Decomposition of Naturalness and Intelligibility using Non-negative Matrix Factorization | Dinh, Anh-Tuan; Phan, Thanh-Son; Akagi, Masato |
2017 | Acoustical Analyses of Tendencies of Intelligibility in Lombard Speech with Different Background Noise Levels | Ngo, Thuan Van; Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato |
2-Mar-2017 | Acoustical analyses of Lombard speech by different background noise levels for tendencies of intelligibility | Ngo, Thuan Van; Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato |
2-Mar-2017 | Articulatory Characteristics of Expressive Speech in Activation-Evaluation Space | Asai, Takuya; Suemitsu, Atsuo; Akagi, Masato |
1-Jun-2017 | ヒト発話シミュレータによるStory Teller Systemの構築 | 赤木, 正人; Akagi, Masato |
26-Oct-2017 | Weighted Robust Principal Component Analysis with Gammatone Auditory Filterbank for Singing Voice Separation | Li, Feng; Akagi, Masato |
1-Nov-2017 | Feature Selection Method for Real-time Speech Emotion Recognition | Elbarougy, Reda; Akagi, Masato |
15-Dec-2017 | Speech Emotion Recognition Using Multichannel Parallel Convolutional Recurrent Neural Networks based on Gammatone Auditory Filterbank | Peng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato |
2018 | Unsupervised Singing Voice Separation Based on Robust Principal Component Analysis Exploiting Rank-1 Constraint | Li, Feng; Akagi, Masato |
2018 | A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech | Li, Xingfeng; Akagi, Masato |
5-Mar-2018 | Perceptual grouping with prosodic features in Japanese dialects | Zhang, Ling; Akagi, Masato |
6-Mar-2018 | Study on differences between perceptions of Japanese and Chinese emotional speech by Japanese and Chinese listeners | Zhang, Chenyi; Akagi, Masato |
7-Mar-2018 | Non-parallel training dictionary-based voice conversion with Variational Autoencoder | Vu, Ho-Tuan; Akagi, Masato |
7-Mar-2018 | Synthesis of expressive singing voice by F0, amplitude envelope and spectral feature conversion | Nguyen, Thi-Hao; Akagi, Masato |
7-Mar-2018 | Estimation of glottal source waveform and vocal tract shape for singing-voice analysis | Takahashi, Kyoko; Akagi, Masato |
19-Jul-2018 | Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space | Xue, Yawen; Hamada, Yasuhiro; Akagi, Masato |
25-Jul-2018 | Nonparallel Dictionary-Based Voice Conversion Using Variational Autoencoder with Modulation-Spectrum-Constrained Training | Ho, Tuan Vu; Akagi, Masato |
26-Jul-2018 | Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation | Peng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato |
22-Aug-2018 | Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal space | Li, Yongwei; Li, Junfeng; Akagi, Masato |
11-Sep-2018 | Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech | Li, Yongwei; Sakakibara, Ken-Ichi; Morikawa, Daisuke; Akagi, Masato |
15-Nov-2018 | Maximal Information Coefficient and Predominant Correlation-Based Feature Selection Toward A Three-Layer Model for Speech Emotion Recognition | Li, Xingfeng; Akagi, Masato |
15-Nov-2018 | Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency range | Takahashi, Kyoko; Akagi, Masato |
15-Nov-2018 | Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis | Li, Feng; Akagi, Masato |
2019 | The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity | Li, Xingfeng; Akagi, Masato |
6-Mar-2019 | Study on Perception of Speaker Age by Semantic Differential Method | Li, Yang; Kobayashi, Maori; Akagi, Masato |
6-Mar-2019 | Variation of Formant Amplitude and Frequencies in Vowel Spectrum uttered under Various Noisy Environments | Matsumoto, Shumpei; Akagi, Masato |
6-Mar-2019 | Study on Relationship between Degree of Emphasis and Acoustic Feature for Synthesizing Emphasized Speech | Ohtani, Yasuhiro; Akagi, Masato |
6-Mar-2019 | Relationship between discomfort sound and its physical correlates | Takahashi, Yumiko; Akagi, Masato |
7-Mar-2019 | Study on Nonlinear Relationships between Semantic Primitives and Emotional Dimensions for Improving Three-layered Model | Liu, Xingyu; Elbarougy, Reda Elsaid; Akagi, Masato |
7-Mar-2019 | Study on Relations between Emotion Perception and Acoustic Features using Speech Morphing Techniques | Wang, Zi; Kobayashi, Maori; Akagi, Masato |
3-Apr-2019 | Improving multilingual speech emotion recognition by combining acoustic features in a three-layer model | Li, Xingfeng; Akagi, Masato |
17-Apr-2019 | Blind Monaural Singing Voice Separation Using Rank-1 Constraint Robust Principal Component Analysis and Vocal Activity Detection | Li, Feng; Akagi, Masato |
6-May-2019 | Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model | Li, Yongwei; Sakakibara, Ken-Ichi; Akagi, Masato |
16-Jul-2019 | Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model | Atmaja, Bagus Tris; Akagi, Masato |
19-Nov-2019 | Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational Autoencoder | Ho, Tuan Vu; Akagi, Masato |
19-Nov-2019 | Speech Emotion Recognition Using Speech Feature and Word Embedding | Atmaja, Bagus Tris; Shirai, Kiyoaki; Akagi, Masato |
19-Nov-2019 | Evaluation of the Lombard Effect Model on Synthesizing Lombard Speech in Varying Noise Level Environments with Limited Data | Ngo, Thuan Van; Kubo, Rieko; Akagi, Masato |
19-Nov-2019 | Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Network | Peng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato |
20-Nov-2019 | Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking | Li, Feng; Qian, Kaizhi; Hasegawa-Johnson, Mark; Akagi, Masato |
14-Dec-2019 | Combining F0 and non-negative constraint robust principal component analysis for singing voice separation | Li, Feng; Akagi, Masato |
23-Dec-2019 | Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model | Li, Yongwei; Sakakibara, Ken-Ichi; Akagi, Masato |
20-Jan-2020 | Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends | Peng, Zhichao; Li, Xingfeng; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato |
22-Jan-2020 | Effect of articulatory and acoustic features on the intelligibility of speech in noise: an articulatory synthesis study | Ngo, Thuanvan; Akagi, Masato; Birkholz, Peter |
20-Feb-2020 | Study on relationship between warmness of speech and valence, activation or dominance | Miyagawa, Natsumi; Akagi, Masato |
20-Feb-2020 | Influence of auditory feedback on uttering vowel speech in noisy environment | Nishigaki, Tomoya; Akagi, Masato |
May-2020 | Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition | Atmaja, Bagus Tris; Akagi, Masato |
1-May-2020 | Mimicking Lombard Effect: An Analysis and Reconstruction | Ngo, Thuan Van; Kubo, Rieko; Akagi, Masato |
25-May-2020 | The Effect of Silence Feature in Dimensional Speech Emotion Recognition | Atmaja, Bagus Tris; Akagi, Masato |
27-May-2020 | Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning | Atmaja, Bagus Tris; Akagi, Masato |
1-Jul-2020 | A Two-Stage Phase-Aware Approach for Monaural Multi-Talker Speech Separation | Yin, Lu; Li, Junfeng; Yan, Yonghong; Akagi, Masato |
Oct-2020 | Segment-level Effects of Gender, Nationality and Emotion Information on Text-independent Speaker Verification | Li, Kai; Akagi, Masato; Wu, Yibo; Dang, and Jianwu |
Oct-2020 | Comparison of glottal source parameter values in emotional vowels | Li, Yongwei; Tao, Jianhua; Liu, Bin; Erickson, Donna; Akagi, Masato |
9-Oct-2020 | Acoustic and articulatory analysis and synthesis of shouted vowels | Xue, Yawen; Marxen, Michael; Akagi, Masato; Birkholz, Peter |
30-Oct-2020 | Non-parallel Voice Conversion based on Hierarchical Latent Embedding Vector Quantized Variational Autoencoder | Ho, Tuan Vu; Akagi, Masato |
1-Nov-2020 | Continuous Audiovisual Emotion Recognition Using Feature Selection and LSTM | Elbarougy, Reda; Atmaja, Bagus Tris; Akagi, Masato |
6-Nov-2020 | Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information | Atmaja, Bagus Tris; Akagi, Masato |
18-Nov-2020 | On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and Classifiers | Atmaja, Bagus Tris; Akagi, Masato |
19-Nov-2020 | Predicting Valence and Arousal by Aggregating Acoustic Features for Acoustic-Linguistic Information Fusion | Atmaja, Bagus Tris; Hamada, Yasuhiro; Akagi, Masato |
19-Nov-2020 | Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM | Atmaja, Bagus Tris; Akagi, Masato |
2-Mar-2021 | Cross-Lingual Voice Conversion With Controllable Speaker Individuality Using Variational Autoencoder and Star Generative Adversarial Network | Ho, Tuan Vu; Akagi, Masato |
1-Oct-2021 | Increasing speech intelligibility and naturalness in noise based on concepts of modulation spectrum and modulation transfer function | Ngo, Thuanvan; Kubo, Rieko; Akagi, Masato |
15-Oct-2021 | F_0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model | Li, Yongwei; Tao, Jianhua; Erickson, Donna; Liu, Bin; Akagi, Masato |
Dec-2021 | Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition | Li, Xingfeng; Guo, Taiyang; Hu, Xinhui; Xu, Xinkang; Dang, Jianwu; Akagi, Masato |
Dec-2021 | Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis | Li, Kai; Unoki, Masashi; Li, Yongwei; Dang, Jianwu; Akagi, Masato |
Dec-2021 | Automatic Naturalness Recognition from Acted Speech Using Neural Networks | Atmaja, Bagus Tris; Sasou, Akira; Akagi, Masato |
6-Mar-2022 | Acoustic features correlated to perceived urgency in evacuation announcements | Kobayashi, Maori; Hamada, Yasuhiro; Akagi, Masato |
26-Mar-2022 | Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion | Atmaja, Bagus Tris; Sasou, Akira; Akagi, Masato |
7-Jul-2022 | Speech Emotion and Naturalness Recognitions With Multitask and Single-Task Learnings | Atmaja, Bagus Tris; Sasou, Akira; Akagi, Masato |
Sep-2022 | Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion | Ho, Tuan Vu; Kobayashi, Maori; Akagi, Masato |
Sep-2022 | Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection | Li, Kai; Li, Sheng; Lu, Xugang; Akagi, Masato; Liu, Meng; Zhang, Lin; Zeng, Chang; Wang, Longbiao; Dang, Jianwu; Unoki, Masashi |
Sep-2022 | Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement | Ho, Tuan Vu; Nguyen, Quoc Huy; Akagi, Masato; Unoki, Masashi |
26-Jun-2023 | Music Theory-inspired Acoustic Representation for Speech Emotion Recognition | Li, Xingfeng; Shi, Xiaohan; Hu, Desheng; Li, Yongwei; Zhang, Qingchen; Wang, Zhengxia; Unoki, Masashi; Akagi, Masato |
31-Oct-2023 | Increasing Speech Intelligibility by Mimicking Professional Announcers’ Voices and Its Physical Correlates | Tran, Dung Kim; Akagi, Masato; Unoki, Masashi |