【24h】

Exploiting Multiple Resources for Word-Phrase Semantic Similarity Evaluation

机译:利用多种资源进行词组语义相似度评估

获取原文

摘要

Previous researches on semantic similarity calculating have been mainly focused on documents, sentences or concepts. In this paper, we study the semantic similarity of words and compositional phrases. The task is to judge the semantic similarity of a word and a short sequence of words. Based on structured resource (WordNet), semi-structured resource (Wikipedia) and unstructured resource (Web), this paper extracts rich effective features to represent the word-phrase pair. The task can be treated as a binary classification problem and we employ Support Vector Machine to estimate whether the word and phrase is similar given a word-phrase pair. Experiments are conducted on SemEval 2013 Task5a. Our method achieves 82.9% in accuracy, and outperforms the best system (80.3%) that participates in the task. Experimental results demonstrate the effectiveness of our proposed approach.
机译:先前关于语义相似度计算的研究主要集中在文档,句子或概念上。在本文中,我们研究了单词和组成短语的语义相似性。任务是判断一个单词和一个简短单词序列的语义相似性。基于结构化资源(WordNet),半结构化资源(Wikipedia)和非结构化资源(Web),本文提取了丰富的有效特征来表示词组。可以将任务视为二进制分类问题,我们使用支持向量机来估计给定单词短语对的单词和短语是否相似。实验针对SemEval 2013 Task5a进行。我们的方法达到了82.9%的准确性,并且胜于参与任务的最佳系统(80.3%)。实验结果证明了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号