|
JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/18464
|
タイトル: | Music Theory-inspired Acoustic Representation for Speech Emotion Recognition |
著者: | Li, Xingfeng Shi, Xiaohan Hu, Desheng Li, Yongwei Zhang, Qingchen Wang, Zhengxia Unoki, Masashi Akagi, Masato |
キーワード: | Affective computing speech emotion recognition acoustic representation music theory and speech analysis |
発行日: | 2023-06-26 |
出版者: | Institute of Electrical and Electronics Engineers (IEEE) |
誌名: | IEEE/ACM Transactions on Audio, Speech, and Language Processing |
巻: | 31 |
開始ページ: | 2534 |
終了ページ: | 2547 |
DOI: | 10.1109/TASLP.2023.3289312 |
抄録: | This research presents a music theory-inspired acoustic representation (hereafter, MTAR) to address improved speech emotion recognition. The recognition of emotion in speech and music is developed in parallel, yet a relatively limited understanding of MTAR for interpreting speech emotions is involved. In the present study, we use music theory to study representative acoustics associated with emotion in speech from vocal emotion expressions and auditory emotion perception domains. In experiments assessing the role and effectiveness of the proposed representation in classifying discrete emotion categories and predicting continuous emotion dimensions, it shows promising performance compared with extensively used features for emotion recognition based on the spectrogram, Melspectrogram, Mel-frequency cepstral coefficients, VGGish, and the large baseline feature sets of the INTERSPEECH challenges. This proposal opens up a novel research avenue in developing a computational acoustic representation of speech emotion via music theory. |
Rights: | This is the author's version of the work. Copyright (C) 2023 IEEE. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 2023, pp. 2534-2547. DOI: 10.1109/TASLP.2023.3289312. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
URI: | http://hdl.handle.net/10119/18464 |
資料タイプ: | author |
出現コレクション: | b10-1. 雑誌掲載論文 (Journal Articles)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
M-AKAGI-I-0710.pdf | | 6684Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|