JAIST Repository >
b. 情報科学研究科・情報科学系 >
b30. リサーチレポート >
Research Report - School of Information Science : ISSN 0918-7553 >
IS-RR-2005 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/8404
|
タイトル: | Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency |
著者: | Ishimoto, Yuichi Unoki, Masashi Akagi, Masato |
発行日: | 2005-03-28 |
出版者: | 北陸先端科学技術大学院大学情報科学研究科 |
誌名: | Research report (School of Information Science, Japan Advanced Institute of Science and Technology) |
巻: | IS-RR-2005-006 |
開始ページ: | 1 |
終了ページ: | 31 |
抄録: | This paper proposes a robust and accurate method of estimating the fundamental frequencies (F0s) for noisy speech. In general, it is difficult to directly estimate accurate F0s from noisy speech. This method combines two different methods of F0 estimation. One is based on the periodicity and harmonicity of instantaneous amplitude of speech; it is robust against noise, but it does not allow for accurate F0 estimation. The other is based on the stability of instantaneous frequency, and it enables accurate F0 estimation, but this method is not robust against noise. To combine these two methods, the proposed method makes use of noise reduction by using a comb filter with controllable pass-bands. Experiments were carried out to estimate F0s of real speech in noisy environments and to compare the proposed method with other methods such as an autocorrelation methods and a cepstrum method. The results showed that this method was more robust than the other methods. This method could estimate F0s of noisy speech with accuracy similar to that in clean speech F0 estimation by using only the stability of instaneous frequency. |
URI: | http://hdl.handle.net/10119/8404 |
資料タイプ: | publisher |
出現コレクション: | IS-RR-2005
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
IS-RR-2005-006.pdf | | 710Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|