A Deep Learning Based Method to Measure the Similarity of Long Text

机译：基于深度学习的长文本相似度度量方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For complex text data, especially for long text data, in order to measure the text similarity, the traditional methods are not accurate enough. We found that it is mainly because the feature representation ability is not strong enough. To improve the accuracy of long text similarity, an algorithm based on pre-training deep learning model is proposed to extract features of long text. On the benchmark data set of THUCNews corpus, the accuracy of our method is 5.4% higher than that of the traditional algorithm. Besides, we perform ablation experiments to test the improvement of fine-tuning technology.

机译：对于复杂的文本数据，尤其是长文本数据，为了测量文本的相似度，传统方法不够准确。我们发现这主要是因为特征表示能力不够强。为了提高长文本相似度的准确性，提出了一种基于预训练深度学习模型的算法来提取长文本特征。在THUCNews语料库的基准数据集上，我们的方法的准确性比传统算法高5.4％。此外，我们进行消融实验以测试微调技术的改进。

著录项

来源
《IEEE International Conference on Information Systems and Computer Aided Education》|2020年|173-178|共6页
会议地点
作者
Guohua Wang; Tianjian Zhang; Genpeng Xu; Yongsen Zheng; Zhiguo Du; Qi Long;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
test similarity; deep learning; long news;

机译：测试相似度;深度学习;长篇新闻;

相似文献

外文文献
中文文献
专利

1. Comparison of Deep Learning Methods Used to Detect the Similarity Between Two Texts [J] . El Mostafa HAMBI, Faouzia Benabbou International journal of computer science and network security . 2019,第10期

机译：用于检测两个文本之间相似性的深度学习方法的比较
2. A New Method for Measuring Text Similarity in Learning Management Systems Using WordNet [J] . Bassel Alkhatib, Ammar Alnahhas, Firas Albadawi International Journal of Web-Based Learning and Teaching Technologies . 2014,第2期

机译：利用WordNet测量学习管理系统中文本相似度的新方法
3. Text similarity semantic calculation based on deep reinforcement learning [J] . International Journal of Security and Networks . 2020,第1期

机译：基于深增强学习的文本相似性语义计算
4. New Methods for Text Categorization Based on a New, Feature Selection Method and a New Similarity Measure Between Documents [C] . Li-Wei Lee, Shyi-Ming Chen International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems(IEA/AIE 2006); 20060627-30; Annecy(FR) . 2006

机译：基于新的特征选择方法和新的文档间相似度度量的文本分类新方法
5. QUERY-FOCUSED EXTRACTIVE SUMMARIZATION BASED ON DEEP LEARNING: COMPARISON OF SIMILARITY MEASURES FOR PSEUDO GROUND TRUTH GENERATION [D] . Yuliska 2019

机译：基于深度学习的查询重点摘要：伪地面真相生成相似度量的比较
6. Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks [O] . Maged Nasser, Naomie Salim, Hentabli Hamza, 2021

机译：基于深度学习的分子相似性搜索方法使用深度信仰网络
7. Text similarity semantic calculation based on deep reinforcement learning [O] . Guanlin Chen, Xiaolong Shi, Moke Chen, 2020

机译：基于深增强学习的文本相似性语义计算

A Deep Learning Based Method to Measure the Similarity of Long Text

摘要

著录项

相似文献

相关主题

期刊订阅