首页> 外文会议>International conference on applications of natural language to information systems >A Morphological Approach for Measuring Pair-Wise Semantic Similarity of Sanskrit Sentences
【24h】

A Morphological Approach for Measuring Pair-Wise Semantic Similarity of Sanskrit Sentences

机译:一种测量梵文句对语义相似度的形态学方法

获取原文

摘要

Capturing explicit and implicit similarity between texts in natural language is a critical task in Computational Linguistics applications. Similarity can be multi-level (word, sentence, paragraph or document level), each of which can affect the similarity computation differently. Most existing techniques are ill-suited for classical languages like Sanskrit as it is significantly richer in morphology than English. In this paper, we present a morphological analysis based approach for computing semantic similarity between short Sanskrit texts. Our technique considers the constituent words' semantic properties and their role in individual sentences within the text, to compute similarity. As all words do not contribute equally to the semantics of a sentence, an adaptive scoring algorithm is used for ranking, which performed very well for Sanskrit sentence pairs of varied complexities.
机译:在计算语言学应用程序中,获取自然语言文本之间的显式和隐式相似性是一项关键任务。相似度可以是多个级别(单词,句子,段落或文档级),每个级别可以不同地影响相似度计算。现有的大多数技术都不适合像梵语这样的古典语言,因为它的形态学比英语要丰富得多。在本文中,我们提出了一种基于形态学分析的方法来计算短梵文之间的语义相似度。我们的技术考虑了构成词的语义属性及其在文本中各个句子中的作用,以计算相似度。由于所有单词对句子语义的贡献均不相同,因此使用了自适应评分算法进行排名,该算法在复杂度各异的梵文句子对中表现非常出色。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号