JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/11510
|
タイトル: | A singing voices synthesis system to characterize vocal registers using ARX-LF model |
著者: | Motoda, Hiroki Akagi, Masato |
発行日: | 2013-03 |
出版者: | 2013 International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'13) |
誌名: | 2013 International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'13) |
開始ページ: | 93 |
終了ページ: | 96 |
抄録: | This paper proposes a singing voices synthesis system to synthesize singing voices having characteristics of vocal registers, such as vocal fly, modal and falsetto. Human can sing songs naturally in wide range of frequency by training how to use vocal fold vibrations to represent vocal registers. However, even state-of-the-art singing voices synthesis systems cannot produce vocal registers appropriately. Naturalness of the synthesized singing voices using these systems is reduced in low and high frequency ranges. One of the methods for improving naturalness is adding characteristics of glottal sources for each vocal register. In this paper, the ARX-LF model that can formulate glottal sources for each vocal register by simulating human voice production mechanisms was applied. A model for controlling ARX-LF parameters corresponding to characteristics of glottal sources was constructed, and acoustic features corresponding to naturalness of singing voice were added. Singing voice data of each vocal register were analyzed by the ARX-LF model, and ARX-LF parameter values corresponding to glottal source of each vocal register were obtained. The control model was constructed using the results of the analysis. Singing voices were synthesized by the control model, and quality of the synthesized voices was evaluated. As the results, almost the same impressions were obtained from the synthesized singing voices as those from actual singing voices in each vocal register. Results revealed effectiveness of the proposed system for synthesizing singing voices to characterize vocal registers. |
Rights: | This material is posted here with permission of the Research Institute of Signal Processing Japan. Hiroki Motoda and Masato Akagi, 2013 International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'13), 2013, pp.93-96. |
URI: | http://hdl.handle.net/10119/11510 |
資料タイプ: | publisher |
出現コレクション: | b11-1. 会議発表論文・発表資料 (Conference Papers)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
NCSP2013_Motoda.pdf | | 1524Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|