首页> 外文期刊>Computer speech and language >Semi-supervised learning integrated with classifier combination for word sense disambiguation
【24h】

Semi-supervised learning integrated with classifier combination for word sense disambiguation

机译:半监督学习与分类器组合相结合,消除词义歧义

获取原文
获取原文并翻译 | 示例
       

摘要

Word sense disambiguation (WSD) is the problem of determining the right sense of a polysemous word in a certain context. This paper investigates the use of unlabeled data for WSD within a framework of semi-supervised learning, in which labeled data is iteratively extended from unlabeled data. Focusing on this approach, we first explicitly identify and analyze three problems inherently occurred piecemeal in the general bootstrapping algorithm; namely the imbalance of training data, the confidence of new labeled examples, and the final classifier generation; all of which will be considered integratedly within a common framework of bootstrapping. We then propose solutions for these problems with the help of classifier combination strategies. This results in several new variants of the general bootstrapping algorithm. Experiments conducted on the English lexical samples of Senseval-2 and Senseval-3 show that the proposed solutions are effective in comparison with previous studies, and significantly improve supervised WSD.
机译:词义消歧(WSD)是在特定上下文中确定多义词的正确意义的问题。本文在半监督学习的框架内研究了WSD的未标记数据的使用,其中,已标记数据从未标记数据迭代地扩展。着眼于这种方法,我们首先明确地识别和分析一般自举算法中固有发生的三个问题。即训练数据的不平衡,新标记示例的置信度以及最终分类器的生成;所有这些都将在自举的通用框架内综合考虑。然后,我们借助分类器组合策略为这些问题提出解决方案。这导致了常规自举算法的几个新变体。对Senseval-2和Senseval-3的英语词汇样本进行的实验表明,与以前的研究相比,提出的解决方案是有效的,并且可以显着改善监督的WSD。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号