Sentence Similarity Based on Semantic Vector Model

机译：基于语义向量模型的句子相似度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentence similarity measures play an increasingly important role in text-related research and applications in areas such as text mining, Web page retrieval, and dialogue systems. Existing methods for computing sentence similarity have been adopted from approaches used for long text documents. These methods process sentences in a very high-dimensional space and are consequently inefficient, require human input, and are not adaptable to some application domains. This paper focuses directly on computing the similarity between very short texts of sentence length. It presents an algorithm that takes account of semantic information, structure information and word order information implied in the sentences. The semantic similarity of two sentences is calculated using information from a structured lexical database, How-net. The use of a lexical database enables our method to model human common sense knowledge. The proposed method can be used in a variety of applications that involve text knowledge representation and discovery. Experiments on two sets of selected sentence pairs demonstrate that the proposed method provides a similarity measure that shows higher accuracy than other methods.

机译：句子相似度度量在诸如文本挖掘，网页检索和对话系统等领域中与文本相关的研究和应用中扮演着越来越重要的角色。已经从用于长文本文档的方法中采用了用于计算句子相似度的现有方法。这些方法在非常高的空间中处理句子，因此效率低下，需要人工输入，并且不适用于某些应用程序领域。本文直接关注于计算句子长度非常短的文本之间的相似度。它提出了一种算法，该算法考虑了句子中暗含的语义信息，结构信息和单词顺序信息。两个句子的语义相似性是使用来自结构化词汇数据库How-net的信息来计算的。词汇数据库的使用使我们的方法能够对人类常识知识进行建模。所提出的方法可以用于涉及文本知识表示和发现的各种应用中。在两组选定的句子对上进行的实验表明，所提出的方法提供了一种相似性度量，该相似性度量显示出比其他方法更高的准确性。

著录项

来源
《International Conference on P2P, Parallel, Grid, Cloud and Internet Computing》|2014年|499-503|共5页
会议地点
作者
Zhao Jingling; Zhang Huiyun; Cui Baojiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
natural language processing; semantic networks; text analysis; vectors; lexical database; natural language processing; semantic vector model; sentence similarity measure; text document processing; text knowledge representation; Accuracy; Dictionaries; Educational institutions; Joints; Semantics; Speech; Vectors; semantic vector; sentence similarity; word order similarity; word similarity;

机译：自然语言处理语义网络文本分析向量词汇数据库自然语言处理语义向量模型句子相似度文本文档处理文本知识表示准确性词典教育机构语义语音向量语义向量;句子相似度;词序相似度;词相似度;

相似文献

外文文献
中文文献
专利

1. A SENTENCE SEMANTIC SIMILARITY CALCULATING METHOD BASED ON SEGMENTED SEMANTIC COMPARISON [J] . YUNTONG LIU, YANJUN LIANG Journal of Theoretical and Applied Information Technology . 2013,第1期

机译：基于分段语义比较的句子语义相似度计算方法
2. Predicting Semantic Similarity Between Clinical Sentence Pairs Using Transformer Models: Evaluation and Representational Analysis [J] . Mark Ormerod, Jesús Martínez del Rincón, Barry Devereux JMIR Medical Informatics . 2021,第5期

机译：使用变压器模型预测临床句子对之间的语义相似性：评估和代表性分析
3. Sentence modeling via multiple word embeddings and multi-level comparison for semantic textual similarity [J] . Nguyen Huy Tien, Nguyen Minh Le, Tomohiro Yamasaki, Information Processing & Management . 2019,第6期

机译：通过多个词嵌入和多级比较进行句子建模，以实现语义文本相似性
4. Sentence Similarity Based on Semantic Vector Model [C] . Zhao Jingling, Zhang Huiyun, Cui Baojiang International Conference on P2P, Parallel, Grid, Cloud and Internet Computing . 2014

机译：基于语义矢量模型的句子相似度
5. Using semantic similarity measures in the biomedical domain for computing functional similarity between genes based on gene ontology [D] . Khabiri, Elham 2007

机译：在生物医学领域中使用语义相似性度量基于基因本体计算基因之间的功能相似性
6. Neural sentence embedding models for semantic similarity estimation in the biomedical domain [O] . Kathrin Blagec, Hong Xu, Asan Agibetov, 2019

机译：神经句子嵌入模型在生物医学领域的语义相似度估计
7. LIM-LIG at SemEval-2017 Task1: Enhancing the Semantic Similarity for Arabic Sentences with Vectors Weighting [O] . Billah Nagoudi, El Moatez, Ferrero, Jérémy, Schwab, Didier 2017

机译：LIM-LIG在SemEval-2017上的任务1：通过向量加权增强阿拉伯句子的语义相似性

Sentence Similarity Based on Semantic Vector Model

摘要

著录项

相似文献

相关主题

期刊订阅