Semi-Supervised Training of a Kernel PCA-Based Model for Word Sense Disambiguation

机译：基于内核PCA的词义消歧模型的半监督训练

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we introduce a new semi-supervised learning model for word sense disambiguation based on Kernel Principal Component Analysis (KPCA), with experiments showing that it can further improve accuracy over supervised KPCA models that have achieved WSD accuracy superior to the best published individual models. Although empirical results with supervised KPCA models demonstrate significantly better accuracy compared to the state-of-the-art achieved by either naive Bayes or maximum entropy models on Senseval-2 data, we identify specific sparse data conditions under which supervised KPCA models deteriorate to essentially a most-frequent-sense predictor. We discuss the potential of KPCA for leveraging unannotated data for partially-unsupervised training to address these issues, leading to a composite model that combines both the supervised and semi-supervised models.

机译：在本文中，我们介绍了一种基于核主成分分析（KPCA）的新的半监督学习模型，用于词义歧义消除，实验表明，该模型可以进一步提高监督WPC准确性优于已发表论文的监督KPCA模型的准确性。个别模型。尽管与朴素贝叶斯模型或Senseval-2数据的最大熵模型取得的最新技术相比，监督KPCA模型的经验结果显示出明显更高的准确性，但我们确定了监督KPCA模型在本质上恶化的特定稀疏数据条件最常使用的预测变量。我们讨论了KPCA利用未注释的数据进行部分无监督训练以解决这些问题的潜力，从而形成了一个组合模型，该模型结合了监督模型和半监督模型。

著录项

来源
《20th International Conference on Computational Linguistics vol.2》|2004年|P.1298-1304|共7页
会议地点 Geneva(CH)
作者
Weifeng Su; Marine CARPUAT; Dekai Wu;
展开▼
作者单位

Human Language Technology Center HKUST Department of Computer Science University of Science and Technology, Clear Water Bay, Hong Kong;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. SEMI-SUPERVISED WORD SENSE DISAMBIGUATION USING VON NEUMANN KERNEL [J] . WENSHENG ZHU International Journal of Innovative Computing Information and Control . 2017,第2期

机译：使用VON NEUMANN KERNEL进行半监督的词义消歧
2. Using Exponential Kernel for Semi-Supervised Word Sense Disambiguation [J] . Chen Junting, Zhong Liyun, Cai Caiyun Journal of computational and theoretical nanoscience . 2016,第10期

机译：使用指数核来半监督词感歧义
3. Semi-supervised Learning with Induced Word Senses for State of the Art Word Sense Disambiguation [J] . Ba#351, kaya Osman, Jurgens David The Journal of Artificial Intelligence Research . 2016,第10期

机译：半监督学习与诱导词义相结合，可实现最先进的词义歧义消除
4. Semi-Supervised Training of a Kernel PCA-Based Model for Word Sense Disambiguation [C] . Weifeng Su, Marine CARPUAT, Dekai Wu International Conference on Computational Linguistics . 2004

机译：基于内核PCA的词组歧义模型的半监督培训
5. Maximum entropy model for Korean word sense disambiguation. [D] . Shin, Donghun. 2009

机译：用于朝鲜语单词歧义消除的最大熵模型。
6. Word sense disambiguation for event trigger word detection in biomedicine [O] . David Martinez, Timothy Baldwin 2011

机译：用于生物医学中事件触发词检测的词义消歧
7. Semi-Supervised Training of a Kernel PCABased Model for Word Sense Disambiguation [O] . Weifeng Su, Marine Carpuat, Dekai Wu 2008

机译：基于核pCaB的词义消歧模型的半监督训练
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

Semi-Supervised Training of a Kernel PCA-Based Model for Word Sense Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅