融合词向量的多特征句子相似度计算方法研究

李峰; 侯加英; 曾荣仁; 凌晨

首页> 中文期刊>计算机科学与探索 >融合词向量的多特征句子相似度计算方法研究

融合词向量的多特征句子相似度计算方法研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Based on the summarization of sentence similarity computing methods,this paper applies 34 000 pieces of texts of People's Daily to train word vector space model for semantic similarity computing.Then,based on the trained word vector model,this paper designs a multi-feature sentence similarity computing method,which takes both word and sentence structure features into consideration.Firstly,the method takes note of possible effects of the number of overlapping words and word continuity,and then applies word vector model to calculate the semantic similarity of nonoverlapping words.Regarding the aspect of sentence structure,the method takes both overlapping word order and sentence length conformity into consideration.Finally,this paper designs and implements four different sentence similarity calculating methods,and further develops an experimental system.The experimental results show that the method proposed in this paper can get satisfactory results and the combination and optimization upon the features of words and sentence structures can improve the accuracy of sentence similarity calculating.%在归纳常见的句子相似度计算方法后,基于《人民日报》3.4万余份文本训练了用于语义相似度计算的词向量模型,并设计了一种融合词向量的多特征句子相似度计算方法.该方法在词方面,考虑了句子中重叠的词数和词的连续性,并运用词向量模型测量了非重叠词间的相似性;在结构方面,考虑了句子中重叠词的语序和两个句子的长度一致性.实验部分设计实现了4种句子相似度计算方法,并开发了相应的实验系统.结果表明:提出的算法能够取得相对较好的实验结果,对句子中词的语义特征和句子结构特征进行组合处理和优化,能够提升句子相似度计算的准确性.

著录项

来源
《计算机科学与探索》|2017年第4期|608-618|共11页
作者
李峰; 侯加英; 曾荣仁; 凌晨;
展开▼
作者单位

中国人民解放军后勤科学研究所,北京100166;

北京航空航天大学计算机学院,北京100191;

昆明理工大学信息工程与自动化学院,昆明650504;

中国人民解放军后勤科学研究所,北京100166;

中国人民解放军后勤科学研究所,北京100166;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
词向量; 句子相似度; Word2vec; 算法设计;

相似文献

中文文献
外文文献
专利

1. 融合词向量的多特征问句相似度计算方法研究 [J] . 曹莉丽 ,王未央 . 现代计算机（专业版） . 2017,第017期
2. 融合词向量的多特征问句相似度计算方法研究 [J] . 曹莉丽 ,王未央 . 现代计算机：上半月版 . 2017,第006期
3. 多特征融合的句子语义相似度计算方法 [J] . 翟社平 ,李兆兆 ,段宏宇 . 计算机工程与设计 . 2019,第010期
4. 基于多特征融合的句子相似度计算方法 [J] . 黄姝婧 ,张仰森 . 北京信息科技大学学报（自然科学版） . 2017,第005期
5. 基于多特征融合的句子语义相似度计算 [J] . 赵臻 ,吴宁 ,宋盼盼 . 计算机工程 . 2012,第001期
6. 基于多特征融合的句子相似度计算 [C] . 赵妍妍 ,哈尔滨工业大学-IBM中国研究实验室 ,秦兵 . 全国第八届计算语言学联合学术会议 . 2005
7. 汉维辅助翻译系统中结合词向量的句子相似度计算方法研究 [A] . 解倩倩 . 2017

融合词向量的多特征句子相似度计算方法研究

摘要

著录项

相似文献

相关主题

期刊订阅