DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets

机译：DeepPurple：使用N-gram回归模型和Web片段估计句子语义相似度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We estimate the semantic similarity between two sentences using regression models with features: 1) n-gram hit rates (lexical matches) between sentences, 2) lexical semantic similarity between non-matching words, and 3) sentence length. Lexical semantic similarity is computed via co-occurrence counts on a corpus harvested from the web using a modified mutual information metric. State-of-the-art results are obtained for semantic similarity computation at the word level, however, the fusion of this information at the sentence level provides only moderate improvement on Task 6 of SemEval' 12. Despite the simple features used, regression models provide good performance, especially for shorter sentences, reaching correlation of 0.62 on the SemEval test set.

机译：我们使用具有以下特征的回归模型来估计两个句子之间的语义相似性：1）句子之间的n-gram命中率（词汇匹配），2）不匹配单词之间的词汇语义相似性和3）句子长度。词汇语义相似性是通过使用经过修改的互信息测度从网络收获的语料库上的共现计数来计算的。在单词级别的语义相似度计算中获得了最新的结果，但是，此信息在句子级别的融合仅对SemEval 12的任务6提供了适度的改进。提供良好的性能，尤其是对于较短的句子，在SemEval测试集上达到0.62的相关性。

著录项

来源
《Joint conference on lexical and computational semantics》|2012年|565-570|共6页
会议地点
作者
Nikos Malandrakis; Elias Iosif; Alexandros Potamianos;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Word n-gram attention models for sentence similarity and inference [J] . Lopez-Gazpio I, Maritxalar M., Lapata M., Expert Systems with Application . 2019,第OCTa期

机译：单词n-gram注意模型用于句子相似度和推理
2. Word n-gram attention models for sentence similarity and inference [J] . Lopez-Gazpio I, Maritxalar M., Lapata M., Expert systems with applications . 2019,第Octa期

机译：单词n-gram注意模型用于句子相似性和推理
3. Measuring semantic similarity between words by removing noise and redundancy in web snippets [J] . Zheng Xu, Xiangfeng Luo, Jie Yu, Concurrency, practice and experience . 2011,第18期

机译：通过消除网页摘要中的噪声和冗余来测量单词之间的语义相似性
4. DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets [C] . Nikos Malandrakis, Elias Iosif, Alexandros Potamianos SEM 2012 . 2012

机译：Deeppulple：使用n克回归模型和web片段估算句子语义相似性
5. Semantic similarity computation on the web of data. [D] . Zheng, Jin Guang. 2014

机译：数据网络上的语义相似度计算。
6. Neural sentence embedding models for semantic similarity estimation in the biomedical domain [O] . Kathrin Blagec, Hong Xu, Asan Agibetov, 2019

机译：神经句子嵌入模型在生物医学领域的语义相似度估计
7. Predicting Semantic Similarity Between Clinical Sentence Pairs Using Transformer Models: Evaluation and Representational Analysis [O] . Mark Ormerod, Jesús Martínez del Rincón, Barry Devereux 2021

机译：使用变压器模型预测临床句子对之间的语义相似性：评估和代表性分析

DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets

摘要

著录项

相似文献

相关主题

期刊订阅