JAIST Repository >
a. 知識科学研究科・知識科学系 >
a10. 学術雑誌論文等 >
a10-1. 雑誌掲載論文 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/5001
|
タイトル: | Semi-supervised learning integrated with classifier combination for word sense disambiguation |
著者: | Le, Anh-Cuong Shimazu, Akira Huynh, Van-Nam Nguyen, Minh Le |
キーワード: | Semi-supervised Learning Word Sense Disambiguation Computational Linguistics |
発行日: | 2008-10 |
出版者: | Elsevier |
誌名: | Computer Speech & Language |
巻: | 22 |
号: | 4 |
開始ページ: | 330 |
終了ページ: | 345 |
DOI: | 10.1016/j.csl.2007.11.001 |
抄録: | Word sense disambiguation (WSD) is the problem of determining the right sense of a polysemous word in a certain context. This paper investigates the use of unlabeled data for WSD within a framework of semi-supervised learning, in which labeled data is iteratively extended from unlabeled data. Focusing on this approach, we first explicitly identify and analyze three problems inherently occurred piecemeal in the general bootstrapping algorithm; namely the imbalance of training data, the confidence of new labeled examples, and the final classifier generation; all of which will be considered integratedly within a common framework of bootstrapping. We then propose solutions for these problems with the help of classifier combination strategies. This results in several new variants of the general bootstrapping algorithm. Experiments conducted on the English lexical samples of Senseval-2 and Senseval-3 show that the proposed solutions are effective in comparison with previous studies, and significantly improve supervised WSD. |
Rights: | NOTICE: This is the author’s version of a work accepted for publication by Elsevier.
Changes resulting from the publishing process, including peer review, editing, corrections,
structural formatting and other quality control mechanisms, may not be reflected in this
document. Changes may have been made to this work since it was submitted for publication.
A definitive version was subsequently published in Anh-Cuong Le, Akira Shimazu, Van-Nam Huynh and Le-Minh Nguyen, Computer Speech & Language, 22(4), 2008, 330-345, http://dx.doi.org/10.1016/j.csl.2007.11.001 |
URI: | http://hdl.handle.net/10119/5001 |
資料タイプ: | author |
出現コレクション: | a10-1. 雑誌掲載論文 (Journal Articles)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
CLS-final.pdf | | 512Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|