首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking
【24h】

Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking

机译:Word Embeddings和基于Graph级别的生物医学文献智能多文件摘要

获取原文
获取原文并翻译 | 示例
           

摘要

With the rapid development of clinical and laboratory medicine, the field of bioinformatics boasts of extensive clinical records and research literature. Retrieving effective information from this huge data has become a challenging task. Hence, Intelligent text summarization, which enables users to find and understand relevant source texts more quickly and effortlessly, becomes a very significant and valuable field of research. In this study, we propose an improved TextRank algorithm with weight calculation based on sentence graph to solve this problem. For the experimental dataset obtained from Pubmed, we represent terms as vectors by using Skip-gram model. We design three methods which utilize word embeddings to calculate weights between sentences. Then we build an undirected graph with sentences as nodes. At last, we use the improved TextRank algorithm to calculate the importance of sentences and further generated summarizations base on its ranking. The experimental results and analysis on the datasets demonstrate the effectiveness of the proposed model.
机译:随着临床和实验室医学的快速发展,生物信息学领域具有广泛的临床记录和研究文献。从这个巨大数据中检索有效信息已成为一个具有挑战性的任务。因此,智能文本摘要使用户能够更快而毫不费力地找到和理解相关的来源文本,成为一个非常重要和有价值的研究领域。在这项研究中,我们提出了一种改进的Textrank算法,基于句子图来解决这个问题。对于从PubMed获得的实验数据集,我们通过使用Skip-Gram模型表示术语。我们设计三种方法,该方法利用Word Embeddings计算句子之间的权重。然后我们用句子构建一个无向图形的图表。最后,我们使用改进的Textrank算法来计算句子的重要性,并进一步生成的摘要基于其排名。数据集的实验结果和分析证明了所提出的模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号