JAIST Repository >
School of Information Science >
Articles >
Journal Articles >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10119/18117
|
Title: | F_0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model |
Authors: | Li, Yongwei Tao, Jianhua Erickson, Donna Liu, Bin Akagi, Masato |
Keywords: | Glottal source vocal tract source-filter model ARX-LF model |
Issue Date: | 2021-10-15 |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
Magazine name: | IEEE/ACM Transactions on Audio, Speech, and Language Processing |
Volume: | 29 |
Start page: | 3375 |
End page: | 3383 |
DOI: | 10.1109/TASLP.2021.3120585 |
Abstract: | This paper proposes a robust automatic speech analysis method based on a source-filter model constructed of an Auto-Regressive eXogenous (ARX) model and the Liljencrants-Fant (LF) model. The proposed method estimates glottal source waveform and vocal tract shape parameters using an analysis-by-synthesis approach. Structurally, the first step is to initialize the glottal source parameters using the inverse filter method, and the second step is to simultaneously estimate the glottal source waveform and the vocal tract shape parameters using an analysis-by-synthesis approach with an iterative algorithm. The proposed method was verified on synthetic voices with different glottal noise (signal to noise ratio) from 0 dB to 50 dB and different fundamental frequency ( F_0 ) from 80 Hz to 320 Hz levels. The results show that the proposed method achieved a much higher estimation accuracy than that of the state-of-the-art inverse filtering methods on both different glottal noise and different F_0 levels. |
Rights: | This is the author's version of the work. Copyright (C) 2021 IEEE. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 2021, pp.3375 - 3383. DOI: 10.1109/TASLP.2021.3120585. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
URI: | http://hdl.handle.net/10119/18117 |
Material Type: | author |
Appears in Collections: | b10-1. 雑誌掲載論文 (Journal Articles)
|
Files in This Item:
File |
Description |
Size | Format |
bare_jrnl_accept_v1.pdf | | 5614Kb | Adobe PDF | View/Open |
|
All items in DSpace are protected by copyright, with all rights reserved.
|