首页> 外文会议>China National Conference on Computational Linguistics >Exploiting Multiple Resources for Word-Phrase Semantic Similarity Evaluation
【24h】

Exploiting Multiple Resources for Word-Phrase Semantic Similarity Evaluation

机译:利用单词语义相似性评估的多个资源

获取原文

摘要

Previous researches on semantic similarity calculating have been mainly focused on documents, sentences or concepts. In this paper, we study the semantic similarity of words and compositional phrases. The task is to judge the semantic similarity of a word and a short sequence of words. Based on structured resource (WordNet), semi-structured resource (Wikipedia) and unstructured resource (Web), this paper extracts rich effective features to represent the word-phrase pair. The task can be treated as a binary classification problem and we employ Support Vector Machine to estimate whether the word and phrase is similar given a word-phrase pair. Experiments are conducted on SemEval 2013 Task5a. Our method achieves 82.9% in accuracy, and outperforms the best system (80.3%) that participates in the task. Experimental results demonstrate the effectiveness of our proposed approach.
机译:以前关于语义相似性计算的研究主要集中在文件,句子或概念上。在本文中,我们研究了单词和组成短语的语义相似性。任务是判断单词的语义相似性和短序列的单词。基于结构化资源(WordNet),半结构化资源(维基百科)和非结构化资源(Web),本文提取丰富的有效功能来表示单词组合对。任务可以被视为二进制分类问题,我们使用支持向量机来估计单词和短语是否类似于单词短语对。实验是在Semeval 2013 Task5a上进行的。我们的方法的准确性达到82.9%,并且优于参与任务的最佳系统(80.3%)。实验结果表明了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号