首页> 外文会议>International workshop on semantic evaluation;Annual meeting of the Association for Computational Linguistics >QLUT at SemEval-2017 Task 1: Semantic Textual Similarity Based on Word Embeddings
【24h】

QLUT at SemEval-2017 Task 1: Semantic Textual Similarity Based on Word Embeddings

机译:QLUT在SemEval-2017任务1:基于单词嵌入的语义文本相似性

获取原文

摘要

This paper reports the details of our submissions in the task 1 of SemEval 2017. This task aims at assessing the semantic textual similarity of two sentences or texts. We submit three unsupervised systems based on word embeddings. The differences between these runs are the various preprocessing on evaluation data. The best performance of these systems on the evaluation of Pearson correlation is 0.6887. Unsurprisingly, results of our runs demonstrate that data preprocessing, such as to-kenization, lemmatization, extraction of content words and removing stop words, is helpful and plays a significant role in improving the performance of models.
机译:本文在SemEval 2017的任务1中报告了我们提交的文档的详细信息。该任务旨在评估两个句子或文本的语义文本相似性。我们基于词嵌入提交了三个无监督的系统。这些运行之间的差异是对评估数据进行的各种预处理。这些系统在评估Pearson相关性方面的最佳性能是0.6887。毫不奇怪,我们的运行结果表明,数据预处理(例如to-kenization,lemmatization,内容词的提取和删除停用词)是有用的,并且在改善模型性能方面起着重要作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号