首页> 外文会议>International Conference on Advanced Informatics: Concepts, Theory and Applications >Sentence structure-based summarization for Indonesian news articles
【24h】

Sentence structure-based summarization for Indonesian news articles

机译:基于句子结构的印尼新闻文章摘要

获取原文

摘要

Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text sources. Reflecting on previous researches, we propose an automatic summarization system using sentence structure information (subject, object, predicate, complement). The system consists of four main components, preprocessing and feature extraction, sentence structure information extraction, sentence clustering and fusion, and sentence selection. The system will extract the necessary information using dependency tree, cluster sentences using Density Based Spatial Clustering for Application with Noise (DBSCAN), fuse sentences with sentence structure information graph, and select sentences using Maximal Marginal Relevance (MMR). The evaluation shows that the proposed system performs with 0.276 average ROUGE-2, with many chances of improvements. Sentence structure extractor has 0.75 f1-measure score.
机译:自动多文档摘要可以帮助新闻阅读器有效地从数字新闻媒体检索信息。摘要器创建了一个简洁的摘要,其中包含来自一系列文章的重要信息,使读者可以仅阅读一个文本即可从多个文本源中获取信息。回顾以前的研究,我们提出一种使用句子结构信息(主题,宾语,谓语,补语)的自动摘要系统。该系统由四个主要组件组成:预处理和特征提取,句子结构信息提取,句子聚类和融合以及句子选择。系统将使用依存关系树提取必要的信息,使用基于噪声的基于空间的空间聚类(DBSCAN)提取句子,使用句子结构信息图融合句子,并使用最大边际相关度(MMR)选择句子。评估表明,所提出的系统的平均ROUGE-2性能为0.276,有很多改进的机会。句子结构提取器的分数为0.75 f1。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号