首页> 外文会议>IIAI International Congress on Advanced Applied Informatics >Feature Words of Moves in Scientific Abstracts
【24h】

Feature Words of Moves in Scientific Abstracts

机译:科学文摘中的特色动词

获取原文

摘要

Extraction of structure from texts is a key issue of text mining. The rhetorical structure of move in scientific articles is useful for assisting in the reading and writing. In this paper, we classify move structure in the abstract of research articles with a small number of characteristic words that determine five moves of including background (B), purpose(P), method(M), result(R) and discussion(D). Eleven measures were introduced and used to select features of moves. Exhaustive parameter search were conducted to get the optimal combination of measure and the number of features. We applied support vector machine and evaluated 10 fold cross validations. The accuracies with optimal feature selections are 0.9022, 0.8322, 0.8442, 0.8820 and 0.8354 for B, P, M, R and D, respectively. They are 10% better than the baseline performance that use all keywords. This study surprisedly found that the negative feature words play central role for prediction performance improvement.
机译:从文本中提取结构是文本挖掘的关键问题。科学文章中的动词的修辞结构对于帮助阅读和写作很有用。在本文中,我们以少量的特征词对研究结构的摘要进行了分类,这些特征词确定了包括背景(B),目的(P),方法(M),结果(R)和讨论(D)在内的五个动作。 )。引入了11种措施,并用于选择移动的特征。进行详尽的参数搜索,以获得度量和特征数量的最佳组合。我们应用了支持向量机并评估了10倍交叉验证。对于B,P,M,R和D,具有最佳特征选择的精度分别为0.9022、0.8322、0.8442、0.8820和0.8354。它们比使用所有关键字的基准效果好10%。这项研究惊讶地发现,负面特征词在预测性能改善中起着核心作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号