首页> 外文会议>Pacific Rim international conference on artificial intelligence >DRWS: A Model for Learning Distributed Representations for Words and Sentences
【24h】

DRWS: A Model for Learning Distributed Representations for Words and Sentences

机译:DRWS:学习单词和句子的分布式表示的模型

获取原文

摘要

Vector-space distributed representations of words can capture syntactic and semantic regularities in language and help learning algorithms to achieve better performance in natural language processing tasks by grouping similar words. With progress of machine learning techniques in recent years, much attention has been paid on this field. However, many NLP tasks such as text summary and sentence matching treat sentences as atomic units. In this paper, we introduce a new model called DRWS which can learn distributed representations for words and variable-length sentences. Feature vectors for words and sentences are learned based on their probability of co-occurrence between words and sentences using a neural network. To evaluate feature vectors learned by our model, we applied our model on the tasks of detecting word similarity and text summarization. Extensive experiments demonstrate the effectiveness of our proposed model in learning vector representations for words and sentences.
机译:单词的向量空间分布式表示可以捕获语言的句法和语义规律,并通过对相似的单词进行分组来帮助学习算法在自然语言处理任务中实现更好的性能。近年来,随着机器学习技术的进步,在该领域已引起了很多关注。但是,许多NLP任务(例如文本摘要和句子匹配)将句子视为原子单位。在本文中,我们介绍了一种称为DRWS的新模型,该模型可以学习单词和变长句子的分布式表示形式。使用神经网络基于单词和句子之间共现的概率来学习单词和句子的特征向量。为了评估模型学习的特征向量,我们将模型应用于检测单词相似性和文本摘要的任务。大量的实验证明了我们提出的模型在学习单词和句子的向量表示中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号