首页> 外文期刊>ITB Journal of Information and Communication Technology >Automatic Title Generation in Scientific Articles for Authorship Assistance: A Summarization Approach
【24h】

Automatic Title Generation in Scientific Articles for Authorship Assistance: A Summarization Approach

机译:自动为作者提供帮助的科学文章中的标题生成:一种汇总方法

获取原文
           

摘要

This paper presents a studyon automatic title generation for scientific articles considering sentence information types known as rhetorical categories. A title can be seenas a high-compression summary of a document. A rhetorical category is an information type conveyed by the author of a text for each textual unit, for example: background, method, or result of the research. The experiment in this studyfocused on extracting the research purpose and research method information for inclusion in a computer-generated title. Sentences are classifiedinto rhetorical categories, after which these sentences are filtered using three methods. Three title candidates whose contents reflect the filtered sentencesare then generated using a template-based or an adaptive K-nearest neighbor approach. The experiment was conducted using two different dataset domains: computational linguistics and chemistry. Our study obtained a 0.109-0.255 F1-measure score on average for computer-generated titles compared to original titles. In a human evaluation the automatically generated titles were deemed ‘relatively acceptable’ in the computational linguistics domain and ‘not acceptable’ in the chemistry domain. It can be concluded that rhetorical categories have unexplored potential to improve the performance of summarization tasks in general.
机译:本文提出了一种针对科学文章的自动标题生成的研究,其中考虑了称为修辞类的句子信息类型。标题可以看作是文档的高压缩摘要。修辞类别是文本作者针对每个文本单位传达的信息类型,例如:研究背景,方法或研究结果。本研究中的实验着重于提取研究目的和研究方法信息以包含在计算机生成的标题中。将句子分类为修辞类,然后使用三种方法过滤这些句子。然后使用基于模板的或自适应K最近邻居方法生成其内容反映已过滤句子的三个标题候选者。实验是使用两个不同的数据集领域进行的:计算语言学和化学。我们的研究与原始标题相比,计算机生成标题的F1-measure平均得分为0.109-0.255。在人工评估中,自动生成的标题在计算语言学领域被视为“相对可以接受”,而在化学领域则被视为“不可接受”。可以得出这样的结论:修辞类别总体上具有提高摘要任务性能的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号