Learning Sentence Embeddings Based on Weighted Contexts from Unlabelled Data

机译：学习句子基于来自未标记数据的加权上下文嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supervised learning and unsupervised learning are mainstream methods to solve semantic textual similarity tasks. However, it is obvious that supervised learning needs substantial labeled data which is hard to obtain in reality. Therefore, we turn our attention to construct sentence embeddings using unlabelled data due to lack of annotated data and success of unsupervised word embeddings in multiple tasks. We present a simple but efficient unsupervised learning method of sentence embeddings inspired by attention mechanism, in which weighted contexts are added to models to train distributed sentence representations inspired by word2vec. Our method outperforms state-of-the-art unsupervised models on semantic textual similarity tasks.

机译：监督学习和无监督的学习是解决语义文本相似性任务的主流方法。然而，显而易见的是，监督学习需要大量标记的数据，这很难在现实中获得。因此，我们注意到通过缺乏多种任务中缺乏注释的数据和无监督的单词嵌入的成功，使用未标记的数据来构建句子嵌入的数据。我们提出了一种简单但高效的句子嵌入式的学习方法，受到注意机制的启发，其中加权上下文被添加到模型中，以培训由Word2VEC启发的分布式句子表示。我们的方法优于语义文本相似性任务的最先进的无监督模型。

著录项

来源
《IEEE International Conference on Software Engineering and Service Science》|2018年|579p|共4页
会议地点
作者
Yixin Ding; Liutong Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词
Semantics; Training; Task analysis; Context modeling; Predictive models; Unsupervised learning; Computational modeling;

机译：语义;培训;任务分析;上下文建模;预测模型;无监督学习;计算建模;

相似文献

外文文献
中文文献
专利

1. Learning from unlabelled real seismic data: Fault detection based on transfer learning [J] . Zhou Ruoshui, Yao Xingmiao, Hu Guangmin, Geophysical Prospecting . 2021,第6期

机译：从未标识的真实地震数据学习：基于转移学习的故障检测
2. Sentence Context and Resolving Lexical Ambiguity for Special Groups of Words on the Base of Corpus Data [J] . Olga Nevzorova, Alfiya Galieva, Vladimir Nevzorov Procedia - Social and Behavioral Sciences . 2015,第2期

机译：基于语料库数据的句子语境与特殊词组的歧义化
3. Sentence Context and Resolving Lexical Ambiguity for Special Groups of Words on the Base of Corpus Data [J] . Olga Nevzorova, Alfiya Galieva, Vladimir Nevzorov Procedia - Social and Behavioral Sciences . 2015,第1期

机译：基于语料库数据的句子语境与特殊词组的歧义化
4. Learning Sentence Embeddings Based on Weighted Contexts from Unlabelled Data [C] . Yixin Ding, Liutong Xu IEEE International Conference on Software Engineering and Service Science . 2018

机译：从未标记数据中基于加权上下文学习句子嵌入
5. New Techniques for High-Dimensional and Complex Data Analysis Based on Weighted Learning. [D] . Shin, Seung Jun. 2013

机译：基于加权学习的高维和复杂数据分析新技术。
6. Clustered embedding using deep learning to analyze urban mobility based on complex transportation data [O] . Sung-Bae Cho, Jin-Young Kim 2021

机译：使用深度学习的聚集嵌入基于复杂的运输数据来分析城市移动性
7. Learning Distributed Representations of Sentences from Unlabelled Data [O] . Hill, Felix, Cho, Kyunghyun, Korhonen, Anna 2016

机译：从未标记数据中学习句子的分布式表示

Learning Sentence Embeddings Based on Weighted Contexts from Unlabelled Data

摘要

著录项

相似文献

相关主题

期刊订阅