首页> 外文期刊>Automatic Control and Computer Sciences >Analysis of the Influence of Mixed-Level Stylometric Characteristics on the Verification of Authors of Literary Works
【24h】

Analysis of the Influence of Mixed-Level Stylometric Characteristics on the Verification of Authors of Literary Works

机译:Analysis of the Influence of Mixed-Level Stylometric Characteristics on the Verification of Authors of Literary Works

获取原文
获取原文并翻译 | 示例
           

摘要

This article analyses the influence of various combinations of mixed-level stylometric characteristics on the quality of verification of the authorship of Russian, English and French prose texts. The study is carried out both for low-level stylometric characteristics based on words and characters, and for higher-level structure ones. All stylometric characteristics are calculated automatically using the ProseRhythmDetector program. This approach provides the analyses of works of a large volume and many writers at the same time. In the course of the work, character-level, word-level, and structure-level stylometric vectors are associated with each text. During the experiments, the sets of parameters of these three levels were combined with each other in all possible ways. The resulting vectors of stylometric characteristics were submitted to the input of various classifiers to perform verification and identify the most suitable classifier for solving the problem. The best results were obtained using the AdaBoost classifier. The average F-measure for all languages was over 92. Detailed verification quality assessments are given for each author and analyzed. The use of high-level stylometric characteristics, in particular, the frequency of using N-grams of POS tags, opens the prospect of a more detailed analysis of author's styles. The results of the experiments show that when combining the characteristics of the structure level with the characteristics of the word level and/or character level, the most accurate results of authorship verification for literary texts in Russian, English, and French are obtained. Additionally, the authors concluded that stylometric characteristics have different degrees of influence on the quality of authorship verification for different languages.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号