JAIST Repository: A Model-Concept of the Selective Sound Segregation : A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments

トップページ| 北陸先端科学技術大学院大学| 附属図書館

一覧

コミュニティ
& コレクション
タイトル
著者
日付
学位論文
リサーチレポート・テクニカルメモランダム

登録利用者:

登録者ページ
利用者(E-people)

当システムについて

JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: https://hdl.handle.net/10119/4016

タイトル:	A Model-Concept of the Selective Sound Segregation : A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various Instruments
著者:	Unoki, Masashi Kubo, Masaaki Haniu, Atsushi Akagi, Masato
キーワード:	cocktail party effect computational auditory scene analysis selective sound segregation musical instrument
発行日:	2006
出版者:	信号処理学会
誌名:	Journal of signal processing : 信号処理
巻:	10
号:	6
開始ページ:	419
終了ページ:	431
抄録:	We propose a novel model-concept of selective sound segregation based on Auditory Scene Analysis and then describe implementation of a prototype model for selectively segregating a target musical instrument sound from the mixed sound of various musical instruments. This model is extended from our previously proposed model of segregating two acoustic sources (Unoki and Akagi, Speech Communication, 27, 261-279, 1999). The extended model consists of two blocks: our previous model as bottom-up processing and a selective processing based on knowledge sources as top-down processing. A novel idea is to segregate a target sound from the mixed sound based on the top-down information as an interaction between bottom-up and top-down processing. To demonstrate the ability of the proposed model, we carried out three simulations: (i) segregation of the target sound from noisy sound (signal extraction); (ii) segregation of the target sound from four mixed sounds (concurrent separation); and (iii) segregation of the target performance sound from mixed sound (selective segregation). Simulation results showed that the proposed model could adequately selectively segregate not only the target instrument sound, but also the target performance sound, from the mixed sound of various instruments; this is not possible when using only bottom-up or top-down processing. The advantage provided by this model-concept led to significantly improved results. This model can be applied to selective speech-sound segregation, enabling its extension to computational modeling of the mechanisms of a human's selective hearing system.
Rights:	信号処理学会, Masashi Unoki, Masaaki Kubo, Atsushi Haniu and Masato Akagi, Journal of signal processing : 信号処理, 10(6), 2006, 419-431.
資料タイプ:	Article
URI:	https://hdl.handle.net/10119/4016
資料タイプ:	publisher
出現コレクション:	b10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル	記述	サイズ	形式
62-4.pdf		8301Kb	Adobe PDF	見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

お問合せ先 : 北陸先端科学技術大学院大学　研究推進課学術情報係 (ir-sys[at]ml.jaist.ac.jp)