首页> 外文会议>4th workshop on cognitive aspects of the lexicon >Exploring the Use of Word Embeddings and Random Walks on Wikipedia for the CogAlex Shared Task
【24h】

Exploring the Use of Word Embeddings and Random Walks on Wikipedia for the CogAlex Shared Task

机译:探索在Wikipedia上将单词嵌入和随机游走用于CogAlex共享任务

获取原文
获取原文并翻译 | 示例

摘要

In our participation on the task we wanted to test three different kinds of relatedness algorithms: one based on embeddings induced from corpora, another based on random walks on WordNet and a last one based on random walks based on Wikipedia. All three of them perform similarly in noun relatedness datasets like WordSim353, close to the highest reported values. Although the task definition gave examples of nouns, the train and test data were based on the Edinburgh Association Thesaurus, and around 50% of the target words were not nouns. The corpus-based algorithm performed much better than the other methods in the training dataset, and was thus submitted for the test.
机译:在参与这项任务时,我们想测试三种不同的相关性算法:一种基于语料库引起的嵌入,另一种基于WordNet上的随机游走,最后一种基于Wikipedia上的随机游走。这三个词在名词关联性数据集(如WordSim353)中的表现相似,接近最高报告值。尽管任务定义给出了名词示例,但训练和测试数据均基于爱丁堡协会词库,并且约有50%的目标词不是名词。基于语料库的算法在训练数据集中的性能比其他方法好得多,因此已提交测试。

著录项

  • 来源
  • 会议地点 Dublin(IE)
  • 作者单位

    IXA NLP Group, University of the Basque Country, Basque Country;

    IXA NLP Group, University of the Basque Country, Basque Country;

    IXA NLP Group, University of the Basque Country, Basque Country;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-26 14:23:23

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号