首页> 外文会议>International conference on computational processing of portuguese >Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia
【24h】

Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia

机译:使用维基百科比较葡萄牙语中单词对之间的语义相关性

获取原文

摘要

The growth of available data in digital format has been facilitating the development of new models to automatically infer the semantic similarity between word pairs. However, there are still many natural languages without sufficient resources to evaluate measures of semantic relatedness. In this paper we translated word pairs from a well-known baseline for evaluating semantic relatedness measures into Portuguese and performed a manual evaluation of each pair. We compared the correlation with similar datasets in other languages and generated LSA models from Wikipedia articles in order to verify the pertinence of each dataset and how semantic similarity conveys across languages.
机译:数字格式中可用数据的增长一直在促进新模型的开发,以自动推断单词对之间的语义相似性。但是,仍然有许多自然语言没有足够的资源来评估语义相关性的度量。在本文中,我们将单词对从一个众所周知的基线进行翻译,以评估语义相关性,并将其翻译成葡萄牙语,并对每个单词对进行了手动评估。我们将相关性与其他语言中的相似数据集进行了比较,并从维基百科的文章中生成了LSA模型,以验证每个数据集的相关性以及语义相似性如何跨语言传达。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号