首页> 外文期刊>Digital scholarship in the humanities >A small set of stylometric features differentiates Latin prose and verse
【24h】

A small set of stylometric features differentiates Latin prose and verse

机译:一小一套舞台训练功能区分拉丁文散文和诗歌

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Identifying the stylistic signatures characteristic of different genres is of central importance to literary theory and criticism. In this article we report a large-scale computational analysis of Latin prose and verse using a combination of quantitative stylistics and supervised machine learning. We train a set of classifiers to differentiate prose and poetry with high accuracy (97%) based on a set of twenty-six text-based, primarily syntactic features and rank the relative importance of these features to identify a low-dimensional set still sufficient to achieve excellent classifier performance. This analysis demonstrates that Latin prose and verse can be classified effectively using just three top features. From examination of the highly ranked features, we observe that measures of the hypotactic style favored in Latin prose (i.e. subordinating constructions in complex sentences, such as relative clauses) are especially useful for classification.
机译:确定不同类型的程度签名特征对文学理论和批评具有核心重要性。在本文中,我们通过定量风格测量和监督机器学习的组合报告了拉丁散文和诗歌的大规模计算分析。我们培养一组分类器,以高精度(> 97%)在基于二十六个基于文本的,主要的句法特征,并对这些功能的相对重要性进行识别,以识别低维集合的散文和诗歌足以实现优秀的分级性能。该分析表明拉丁散文和诗歌可以使用仅使用三个顶部特征进行有效归类。从检查高度排名的特征,我们观察到拉丁散文中青睐的措施(即复杂句子中的次名建筑,例如相对条款)对分类特别有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号