首页> 外文会议>Conference on computational natural language learning >One Million Sense-Tagged Instances for Word Sense Disambiguation and Induction
【24h】

One Million Sense-Tagged Instances for Word Sense Disambiguation and Induction

机译:一百万个带有感官标记的实例,可消除和消除感官歧义

获取原文

摘要

Supervised word sense disambiguation (WSD) systems are usually the best performing systems when evaluated on standard benchmarks. However, these systems need annotated training data to function properly. While there are some publicly available open source WSD systems, very few large annotated datasets are available to the research community. The two main goals of this paper are to extract and annotate a large number of samples and release them for public use, and also to evaluate this dataset against some word sense disambiguation and induction tasks. We show that the open source IMS WSD system trained on our dataset achieves state-of-the-art results in standard disambiguation tasks and a recent word sense induction task, outperforming several task submissions and strong baselines.
机译:在标准基准上进行评估时,监督词义消歧(WSD)系统通常是性能最好的系统。但是,这些系统需要带注释的训练数据才能正常运行。尽管有一些公开可用的开源WSD系统,但是研究团体却很少使用大型的带注释的大型数据集。本文的两个主要目标是提取和注释大量样本并发布给公众使用,并针对某些词义消歧和归纳任务评估该数据集。我们证明,在我们的数据集上训练的开源IMS WSD系统在标准消歧任务和最近的词义归纳任务中取得了最先进的结果,胜过了多个任务提交和强大的基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号