JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/15770
|
タイトル: | Study on Relationship between Degree of Emphasis and Acoustic Feature for Synthesizing Emphasized Speech |
著者: | Ohtani, Yasuhiro Akagi, Masato |
発行日: | 2019-03-06 |
出版者: | Research Institute of Signal Processing, Japan |
誌名: | 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019) |
開始ページ: | 256 |
終了ページ: | 259 |
抄録: | Humans can perceive not only presence/absence of emphasis but also degrees of emphasis from actual emphasized speech. However, humans cannot fully do from synthesized speech. This paper focused on two properties of Fundamental frequency (F0) contours: amount of decay from the accent nucleus and variation between each accent nucleus, and hypothesized that the two properties of F0 contours are important for synthesizing emphasized speech. To discuss this hypothesis, this paper clarified relationships between degrees of emphasis and F0 contours. To clarify relationships, it was necessary to compare relationships for each stimulus. To compare relationships, it was necessary to know the degree of emphasis of each stimulus and analyze variations of F0 contours. A listening test was carried out to obtain the degrees of emphasis of stimulus. A value which is frequency at the barycentric point of the vowel was extracted from F0 contours to analyze the variation of F0 contour. From these results, we had two findings; degree of emphasis is increasing when amount of decay from accent nucleus to next mora is increasing, and the variation of accent nuclei is different with/without emphasis. The experiment was carried out to evaluate hypothesis. Synthesized stimuli from non-emphasized voice by varying amount of decay and variation of accent nuclei are used for the experiment. The results showed that the participants of the experiment can perceive emphasis with degrees from the synthesized stimuli. This result clarified the relationships between presence/absence of emphasis and two findings. In addition the hypothesis is important for synthesizing emphasized speech which convey presence/absence of emphasis. |
Rights: | Copyright (C) 2019 Research Institute of Signal Processing, Japan. Yasuhiro Ohtani and Masato Akagi, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), 2019, 256-259. |
URI: | http://hdl.handle.net/10119/15770 |
資料タイプ: | publisher |
出現コレクション: | b11-1. 会議発表論文・発表資料 (Conference Papers)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
2921.pdf | | 1140Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|