Literary writing style recognition via a minimal spanning tree-based approach

Shalymov Dmitry; Granichin Oleg; Klebanov Lev; Volkovich Zeev

首页> 外文期刊>Expert Systems with Application >Literary writing style recognition via a minimal spanning tree-based approach

【24h】

Literary writing style recognition via a minimal spanning tree-based approach

机译：通过基于最小生成树的方法识别文学写作风格

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address the problem of literary writing style determination using a comparison of the randomness of two given texts. We attempt to comprehend if these texts are generated from distinct probability sources that can reveal a difference between the literary writing styles of the corresponding authors. We propose a new approach based on the incorporation of the known Friedman-Rafsky two-sample test into a multistage procedure with the aim of stabilizing the process. A sampling pro cedure constructed by applying the N-grams methodology is applied to simulate samples drawn from the pooled text with the aim of evaluating the null hypothesis distribution that appears after the writing styles coincide. Next, samples from different files are selected, and the p-values of the test statistics are calculated. An empirical distribution of these values is compared numerous times with the uniform one on the interval [0, 1], and the writing styles are recognized as different if the rejection fraction in this comparison's sequence is significantly greater than 0.5. The offered approach is language independent in the community of alphabetic languages and does not involve the use of linguistics. In comparison with most existing methods our approach does not deal with any authorship attribute determination. A text itself, more precisely speaking, the distribution of sequential text templates and their mutual occurrences essentially identifies the style. Experiments demonstrate the strong capability of the proposed method. (C) 2016 Elsevier Ltd. All rights reserved.

机译：在本文中，我们通过比较两个给定文本的随机性来解决文学写作风格确定的问题。我们试图理解这些文本是否来自不同的概率来源，这些来源可以揭示相应作者的文学写作风格之间的差异。我们提出了一种新方法，该方法基于将已知的Friedman-Rafsky两样品检验纳入多阶段程序的目的，目的是稳定过程。通过应用N-grams方法构建的抽样程序被用来模拟从合并的文本中抽取的样本，目的是评估在写作风格重合后出现的零假设分布。接下来，从不同文件中选择样本，并计算出检验统计量的p值。将这些值的经验分布与间隔[0，1]上的均匀值进行多次比较，并且如果此比较序列中的拒绝率显着大于0.5，则书写样式将被识别为不同。所提供的方法在字母语言社区中是独立于语言的，并且不涉及语言学的使用。与大多数现有方法相比，我们的方法不处理任何作者身份属性确定。更准确地说，文本本身就是顺序文本模板的分布及其相互出现的方式，从本质上确定了样式。实验证明了该方法的强大能力。（C）2016 Elsevier Ltd.保留所有权利。

著录项

来源
《Expert Systems with Application》 |2016年第11期|145-153|共9页
作者
Shalymov Dmitry; Granichin Oleg; Klebanov Lev; Volkovich Zeev;
展开▼
作者单位

St Petersburg State Univ, Fac Math & Mech, Univ Sky Prospekt 28, St Petersburg 198504, Russia;

St Petersburg State Univ, Fac Math & Mech, Univ Sky Prospekt 28, St Petersburg 198504, Russia|St Petersburg State Univ, Res Lab Anal & Modeling Social Proc, Univ Sky Prospekt 28, St Petersburg 198504, Russia;

Charles Univ Prague, Dept Probabil & Stat, Prague, Czech Republic;

ORT Braude Coll Engn, Software Engn Dept, IL-21982 Karmiel, Israel;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Writing style determination; Two-sample spanning Tree-based test;

机译：写作风格确定;基于两样本生成树的测试;
入库时间 2022-08-17 13:29:42

相似文献

外文文献
中文文献
专利

1. Visible Light Hydrogen Evolution over &i&α&/i&-MoO&/span&&sub&&span style="font-family:Verdana;"&3&/span&&/sub&&span style="font-family:Verdana;"& and &i&α&/i&-MoO&/span&&sub&&span style="font-family:Verdana;"&3&/span&&/sub&&span style="font-family:Verdana;"&/ZnO Hetero-Junction [J] . Kahina Bounache, Amel Boudjemaa, Souhila Boumaza, Open Journal of Physical Chemistry . 2021,第3期

机译：可见光析氢过＆安培 ; LT ; I＆安培 ; GT ; α ＆安培 ; LT ; I＆安培 / ; GT ; -MoO ＆安培 ; LT / 跨度＆安培 ; GT ;＆安培 ; 副＆安培 ; GT ;＆安培 ; LT ; 跨度风格 = 安培 ; QUOT Font-Family：Verdana;＆amp;＆amp; 3＆amp; l; /跨度＆amp; / sub＆amp; ＆amp; spand styled =＆amp; by; font-family：verdana; ＆amp;＆amp; 和＆amp; α＆amp; -i＆amp; -moo＆amp; ＆amp; ＆amp; ＆amp; ＆amp; ＆amp; spand styled =＆amp;＆amp; spand styled =＆amp;字体 - 家庭：Verdana;＆amp;＆amp; 3＆amp; l; /跨度＆amp; / sub＆amp; spand styled =＆amp; quot-family：verdana;＆amp;＆amp;＆amp;＆amp;＆amp;＆amp; ＆amp; / zno异结
2. Temporal and Oscillatory Behavior Observed during Methanol Synthesis on a Cu/ZnO/Al&sub&&span style="font-family:Verdana;"&2&/span&&/sub&&span style="font-family:Verdana;"&O&/span&&sub&&span style="font-family:Verdana;"&3&/span&&/sub&&span style="font-family:Verdana;"& (60:30:10) Catalyst [J] . Mohammad Ateeq Aldosari Green and Sustainable Chemistry . 2021,第3期

机译：在Cu / ZnO / Al＆amp的甲醇合成期间观察到的时间和振荡行为; LT;＆amp; ＆amp; spany styled =＆amp; font-family：verdana;＆amp;＆amp; 2＆amp; ; /跨度＆amp; ＆amp; / sub＆amp; ＆amp; spand styled =＆amp; font-family：verdana;＆amp;＆amp; ＆amp; ＆amp; ＆amp; ＆amp; ＆amp。 ; 亚和amp; ＆amp; spany styled =＆amp;字体家族：verdana;＆amp; 3＆amp; / spp; / shap; / shap; / sub＆amp; ＆amp; span style =＆amp; quot; font-family：verdana;＆amp;＆amp; （60:30:10）催化剂
3. Style and Story: Literary Methods for Writing Nonfiction [J] . Ford Donna Technical communication . 2019,第2期

机译：风格和故事：写非小说的文学方法
4. A Supervised Learning Approach Towards Profiling the Preservation of Authorial Style in Literary Translations [C] . Gerard Lynch International conference on computational linguistics . 2014

机译：对文学翻译中作者风格的保留进行分析的一种监督学习方法
5. Literary minimalism: Austere style from Wittgenstein to Mamet. [D] . McDermott, James Dishon. 2002

机译：文学极简主义：从维特根斯坦到马梅特的严肃风格。
6. CMAJ instruction of authors in the 20th century: from literary style to Vancouver style [O] . Jennifer J. Connor 2013

机译：CMAJ对20世纪作家的指导：从文学风格到温哥华风格
7. Measurement of the p p → t t production cross section at √s = 1.96-TeV in the fully hadronic decay channel}. [O] . Abazov, V. M., Bertram, Iain, Borissov, Guennadi, 2007

机译：p p→t t 生产截面处√ s = 1.96-TeV}。

Literary writing style recognition via a minimal spanning tree-based approach

摘要

著录项

相似文献

相关主题

期刊订阅