首页> 外文会议>Joint conference on lexical and computational semantics >Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia
【24h】

Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia

机译:迈向建立多语言语义网:在Wikipedia中识别语言间链接

获取原文

摘要

Wikipedia is a Web based, freely available multilingual encyclopedia, constructed in a collaborative effort by thousands of contributors. Wikipedia articles on the same topic in different languages are connected via interlingual (or translational) links. These links serve as an excellent resource for obtaining lexical translations, or building multilingual dictionaries and semantic networks. As these links are manually built, many links are missing or simply wrong. This paper describes a supervised learning method for generating new links and detecting existing incorrect links. Since there is no dataset available to evaluate the resulting interlingual links, we create our own gold standard by sampling translational links from four language pairs using distance heuristics. We manually annotate the sampled translation links and used them to evaluate the output of our method for automatic link detection and correction.
机译:Wikipedia是基于Web的,免费的多语言百科全书,由数千名贡献者共同努力构建。维基百科上有关同一主题的不同语言的文章通过语际(或翻译)链接进行连接。这些链接是获取词汇翻译或建立多语言词典和语义网络的绝佳资源。由于这些链接是手动构建的,因此许多链接丢失或完全错误。本文介绍了一种监督式学习方法,用于生成新链接和检测现有的不正确链接。由于没有可用的数据集来评估由此产生的语言间链接,因此我们通过使用距离启发法对来自四种语言对的翻译链接进行采样,从而创建了自己的黄金标准。我们手动注释采样的翻译链接,并使用它们评估我们用于自动链接检测和更正的方法的输出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号