首页> 外文会议>International Conference on Advances in Semantic Processing >Word Sense Disambiguation Using Active Learning with Pseudo Examples
【24h】

Word Sense Disambiguation Using Active Learning with Pseudo Examples

机译:使用主动学习与伪示例的词感歧义

获取原文

摘要

In recent years, there have been attempts to apply active learning for Word sense disambiguation (WSD). This active learning technique selects the most informative unlabeled examples that were most difficult to disambiguate. The most commonly addressed problem has been the extraction of relevant information, where the system constructs a better classification model to identify the appropriate sense of the target word. Previous research reported that it is effective to create negative examples artificially (i.e., pseudo negative examples). However, this method works only for words that appear in a small number of topics (e.g., technical terms) because the evaluation set is strongly biased. For common noun or verb words, it is hard to apply this system so that problems still remain in the active learning with pseudo negative examples for WSD. In this paper, to solve this problem, we propose a novel WSD system based on active learning with pseudo examples for any words. This proposed method is to learn WSD models constructed from training corpus by adding pseudo examples during the active learning process. To evaluate the effectiveness of the proposed method, we perform some experiments to compare it with the result of the previous methods. The results of the experiments show that the proposed method achieves the highest precision of all systems and can extract more effective pseudo examples for WSD.
机译:近年来,有试图为词语感消解(WSD)应用主动学习。这种有源学习技术选择最难以消除的最具信息性的未标记示例。最常见的问题是相关信息的提取,其中系统构造更好的分类模型,以识别目标词的适当感。以前的研究报告说,它是有效的,人为地创造否定例子(即,伪消极的例子)。但是,此方法仅适用于出现在少量主题(例如,技术术语)中出现的单词,因为评估集强烈偏置。对于常见的名词或动词词语,很难应用这个系统,以便在WSD的伪负例中仍然存在问题。在本文中,为了解决这个问题,我们提出了一种基于主动学习的新型WSD系统,以伪示例用于任何单词。该提出的方法是通过在主动学习过程中添加伪示例来学习由训练语料库构成的WSD模型。为了评估所提出的方法的有效性,我们执行一些实验,以将其与先前方法的结果进行比较。实验结果表明,该方法达到了所有系统的最高精度,并可以提取更有效的WSD伪实施例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号