首页> 外文会议>International Conference on Big Data Analytics >Spectral Learning of Semantic Units in a Sentence Pair to Evaluate Semantic Textual Similarity
【24h】

Spectral Learning of Semantic Units in a Sentence Pair to Evaluate Semantic Textual Similarity

机译:句子对中语义单元的光谱学习,以评估语义文本相似性

获取原文

摘要

Semantic Textual Similarity (STS) measures the degree of semantic equivalence between two snippets of text. It has applicability in a variety of Natural Language Processing (NLP) tasks. Due to the wide application range of STS in many fields, there is a constant demand for new methods as well as improvement in current methods. A surge of unsupervised and supervised systems has been proposed in this field but they pose a limitation in terms of scale. The restraints are caused either by the complex, non-linear sophisticated supervised learning models or by unsupervised learning models that employ a lexical database for word alignment. The model proposed here provides a spectral learning-based approach that is linear, scale-invariant, scalable, and fairly simple. The work focuses on finding semantic similarity by identifying semantic components from both the sentences that maximize the correlation amongst the sentence pair. We introduce an approach based on Canonical Correlation Analysis (CCA), using cosine similarity and Word Mover's Distance (WMD) as a calculation metric. The model performs at par with sophisticated supervised techniques such as LSTM and BiLSTM and adds a layer of semantic components that can contribute vividly to NLP tasks.
机译:语义文本相似性(STS)测量两种片段之间的语义等效程度。它具有各种自然语言处理(NLP)任务的适用性。由于许多领域的STS的广泛应用范围,对新方法的需求持续,以及当前方法的改进。在这一领域提出了无监督和监督系统的激增,但它们在规模方面提出了限制。约束是由复杂的非线性复杂的监督学习模型或使用词汇对齐数据库的无监督学习模型引起的。这里提出的模型提供了一种基于频谱学习的方法,它是线性,鳞片不变,可扩展性和相当简单的。该工作侧重于通过识别来自句子对中的相关性的句子的语义组件来找到语义相似度。我们使用余弦相似性和单词移动器的距离(WMD)作为计算度量来介绍一种基于规范相关分析(CCA)的方法。该模型以复杂的监督技术(如LSTM和BILSTM)执行,并添加一层可以对NLP任务进行生动地贡献的语义组件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号