首页> 外文期刊>ITB Journal of Information and Communication Technology >Rhetorical Sentences Classification Based on Section Class and Title of Paper for Experimental Technical Papers
【24h】

Rhetorical Sentences Classification Based on Section Class and Title of Paper for Experimental Technical Papers

机译:基于科目类别和实验技术论文的修辞句分类

获取原文
           

摘要

Rhetorical sentence classification is an interesting approach for making extractive summaries but this technique still needs to be developed because the performance of automatic rhetorical sentence classification is still poor. Rhetorical sentences are sentences that contain rhetorical words or phrases. Rhetorical sentences not only appear in the contents of a paper but also in the title. In this study, features related to section class and title class that have been proposed in a previous research were further developed. Our method uses different techniques to reach automatic section class extraction for which we introduce new, format-based features. Furthermore, we propose automatic rhetoric phrase extraction from the title. The corpus we used was a collection of technical-experimental scientific papers. Our method uses the Support Vector Machine (SVM) algorithm and the Na?ve Bayesian algorithm for classification. The four categories used were: Problem, Method, Data, and Result . It was hypothesized that these features would be able to improve classification accuracy compared to previous methods. The F-measure for these categories reached up to 14%.
机译:修辞句分类是一种用于提取摘要的有趣方法,但是由于自动修辞句分类的性能仍然很差,因此仍需要开发此技术。修辞句是包含修辞词或短语的句子。修辞句不仅出现在论文的内容中,而且出现在标题中。在这项研究中,先前研究中提出的与节类和标题类有关的功能得到了进一步发展。我们的方法使用了不同的技术来实现自动的节类提取,为此我们引入了基于格式的新功能。此外,我们建议从标题中自动提取修辞短语。我们使用的语料库是一组技术实验科学论文。我们的方法使用支持向量机(SVM)算法和朴素贝叶斯算法进行分类。使用的四个类别是:问题,方法,数据和结果。假设与以前的方法相比,这些功能将能够提高分类的准确性。这些类别的F值高达14%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号