首页> 外文会议>International workshop on semantic evaluation;Annual meeting of the Association for Computational Linguistics >DT_Team at SemEval-2017 Task 1: Semantic Similarity Using Alignments, Sentence-Level Embeddings and Gaussian Mixture Model Output
【24h】

DT_Team at SemEval-2017 Task 1: Semantic Similarity Using Alignments, Sentence-Level Embeddings and Gaussian Mixture Model Output

机译:DT_Team在SemEval-2017上的任务1:使用对齐,句子级嵌入和高斯混合模型输出的语义相似度

获取原文

摘要

We describe our system (DT_Team) submitted at SemEval-2017 Task 1, Semantic Textual Similarity (STS) challenge for English (Track 5). We developed three different models with various features including similarity scores calculated using word and chunk alignments, word/sentence em-beddings, and Gaussian Mixture Model (GMM). The correlation between our system's output and the human judgments were up to 0.8536, which is more than 10% above baseline, and almost as good as the best performing system which was at 0.8547 correlation (the difference is just about 0.1%). Also, our system produced leading results when evaluated with a separate STS benchmark dataset. The word alignment and sentence embeddings based features were found to be very effective.
机译:我们描述了在SemEval-2017任务1(英语)(语义5)语义文本相似性(STS)挑战中提交的系统(DT_Team)。我们开发了三种具有各种功能的模型,包括使用单词和单词对齐方式计算出的相似性评分,单词/句子嵌入和高斯混合模型(GMM)。我们系统的输出与人工判断之间的相关性高达0.8536,比基线高出10%以上,几乎与性能最佳的系统(0.8547的相关性)相差无几(相差约0.1%)。同样,当使用单独的STS基准数据集进行评估时,我们的系统也产生了领先的结果。发现基于单词对齐和句子嵌入的功能非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号