首页> 外文会议>Natural language understanding and intelligent applications >Extractive Single Document Summarization via Multi-feature Combination and Sentence Compression
【24h】

Extractive Single Document Summarization via Multi-feature Combination and Sentence Compression

机译:通过多特征组合和句子压缩提取单文档摘要

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we attempt to extract and generate the short summary for the news article with the length limit of 60 Chinese characters. Firstly, we preprocess the news article by segmenting sentences and words, and then extract four kinds of central words to form the keyword dictionary based on parsing tree. After that, the four kinds of features, i.e. the sentence weight, the sentence similarity, the sentence position and the length of sentence, will be employed to measure the significance of each sentence. Finally, we extract two sentences in the descending order of significance score and compress them to get the summary for each news article. This approach can analyze the grammatical elements from original sentences in order to generate compression rules and trim syntactic elements according to their parsing trees. The evaluation results show that our system is efficient in Chinese news summarization.
机译:在本文中,我们尝试提取并生成长度限制为60个汉字的新闻报道的简短摘要。首先,通过对句子和单词进行切分,对新闻进行预处理,然后提取四种中心词,构成基于解析树的关键词词典。之后,将采用四种特征,即句子权重,句子相似度,句子位置和句子长度,来衡量每个句子的重要性。最后,我们按照重要性得分从高到低的顺序提取两个句子,并对其进行压缩以获得每篇新闻的摘要。这种方法可以分析原始句子中的语法元素,以便生成压缩规则并根据其语法分析树修剪语法元素。评估结果表明,我们的系统在中文新闻摘要中是有效的。

著录项

  • 来源
  • 会议地点 Dalian(CN)
  • 作者单位

    College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China,Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan University of Science and Technology, Wuhan 430065, China;

    College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China,Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan University of Science and Technology, Wuhan 430065, China;

    College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China,Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan University of Science and Technology, Wuhan 430065, China;

    College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China,Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan University of Science and Technology, Wuhan 430065, China;

    Laboratory of Language Engineering and Computing, Guangdong University of Foreign Studies, Guangzhou 510006, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Single document summarization; Multi-feature combination Sentence extraction; Parsing tree; Sentence compression;

    机译:单一文件摘要;多特征组合句提取;解析树;句子压缩;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号