JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/18464

タイトル: Music Theory-inspired Acoustic Representation for Speech Emotion Recognition
著者: Li, Xingfeng
Shi, Xiaohan
Hu, Desheng
Li, Yongwei
Zhang, Qingchen
Wang, Zhengxia
Unoki, Masashi
Akagi, Masato
キーワード: Affective computing
speech emotion recognition
acoustic representation
music theory and speech analysis
発行日: 2023-06-26
出版者: Institute of Electrical and Electronics Engineers (IEEE)
誌名: IEEE/ACM Transactions on Audio, Speech, and Language Processing
巻: 31
開始ページ: 2534
終了ページ: 2547
DOI: 10.1109/TASLP.2023.3289312
抄録: This research presents a music theory-inspired acoustic representation (hereafter, MTAR) to address improved speech emotion recognition. The recognition of emotion in speech and music is developed in parallel, yet a relatively limited understanding of MTAR for interpreting speech emotions is involved. In the present study, we use music theory to study representative acoustics associated with emotion in speech from vocal emotion expressions and auditory emotion perception domains. In experiments assessing the role and effectiveness of the proposed representation in classifying discrete emotion categories and predicting continuous emotion dimensions, it shows promising performance compared with extensively used features for emotion recognition based on the spectrogram, Melspectrogram, Mel-frequency cepstral coefficients, VGGish, and the large baseline feature sets of the INTERSPEECH challenges. This proposal opens up a novel research avenue in developing a computational acoustic representation of speech emotion via music theory.
Rights: This is the author's version of the work. Copyright (C) 2023 IEEE. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 2023, pp. 2534-2547. DOI: 10.1109/TASLP.2023.3289312. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI: http://hdl.handle.net/10119/18464
資料タイプ: author
出現コレクション:b10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル 記述 サイズ形式
M-AKAGI-I-0710.pdf6684KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係