首页> 外文期刊>Cybernetics and information technologies: CIT >All-words Word Sense Disambiguation for Russian Using Automatically Generated Text Collection
【24h】

All-words Word Sense Disambiguation for Russian Using Automatically Generated Text Collection

机译:俄罗斯自动生成的文本集合的全文词语感应歧义

获取原文
       

摘要

The limited amount of the sense annotated data is a big challenge for theword sense disambiguation task. As a solution to this problem, we propose analgorithm of automatic generation and labelling of the training collections based onthe monosemous relatives concept. In this article we explore the limits of thisalgorithm: we employ it to harvest training collections for all ambiguous nouns,verbs and adjectives presented in RuWordNet thesaurus and then evaluate the qualityof the obtained collections. We demonstrate that our approach can create high-quality labelled collections with almost full-coverage of the RuWordNet polysemouswords. Furthermore, we show that our method can be applied to the Word-in-Contexttask.
机译:有限的感觉注释数据是唱歌意义消费者任务的重要挑战。作为解决这个问题的解决方案,我们提出了基于单叙述概念的训练收集的自动生成和标记的安疗法。在本文中,我们探讨了本发明的限制:我们将其雇用它来收取ruwordnet词库中的所有暧昧名词,动词和形容词的训练收集,然后评估所获得的收藏品的质量。我们展示了我们的方法可以创造高质量的标记集合,几乎全面覆盖RuWordnet PolySemouswords。此外,我们表明我们的方法可以应用于InnettNtAsk。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号