A comparative study on authorship attribution classification tasks using both neural network and statistical methods

Nikos Tsimboukakis; George Tambouratzis

首页> 外文期刊>Neural Computing & Applications >A comparative study on authorship attribution classification tasks using both neural network and statistical methods

【24h】

A comparative study on authorship attribution classification tasks using both neural network and statistical methods

机译：基于神经网络和统计方法的作者归因分类任务比较研究

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The present paper investigates the application of the multi-layer perceptron (MLP) to the task of categorizing texts based on their authors’ style. This task is of particular importance for information retrieval applications involving very large document databases. The emphasis of this article is to determine the extent to which the MLP model can be fine-tuned to successfully analyse such data, uncovering the stylistic differences among authors. The MLP-based method is compared and contrasted to statistical techniques, such as discriminant analysis, that are widely used in stylistic studies. The comparison of the methods is based on their classification performance, to provide an objective evaluation of the advantages of each method. A second aim of the study presented here is to compare the effectiveness of distinct features in the task of uncovering the author identity for each method. To evaluate to a greater depth the effectiveness of the entire approach, the results of the proposed MLP-based method are compared to those of established approaches, such as the support vector machines (SVM), using both the original parameters employed by the MLP as well as term frequency–inverse document frequency (TF–IDF) parameters, and the cascade correlation approach. It is found that the proposed MLP-based approach possesses a number of advantages, such as high classification accuracy, broadly comparable to that of the SVM, coupled with the ability to algorithmically reduce the set of parameters used without adversely affecting the classification accuracy.

机译：本文研究了多层感知器（MLP）在基于作者风格对文本进行分类的任务中的应用。对于涉及非常大的文档数据库的信息检索应用程序，此任务特别重要。本文的重点是确定可以微调MLP模型以成功分析此类数据的程度，从而揭示作者之间的风格差异。对基于MLP的方法进行了比较，并将其与广泛用于文体研究中的统计技术（例如判别分析）进行对比。这些方法的比较基于它们的分类性能，以客观评估每种方法的优点。本文提出的研究的第二个目的是比较各种功能在发现每种方法的作者身份这一任务中的有效性。为了更深入地评估整个方法的有效性，将建议的基于MLP的方法的结果与已建立的方法（如支持向量机（SVM））的结果进行比较，并使用MLP所使用的两个原始参数作为参数。以及术语频率-逆文档频率（TF-IDF）参数，以及级联相关方法。发现所提出的基于MLP的方法具有许多优点，例如，高分类精度，与SVM具有广泛的可比性，并且具有在算法上减少使用的参数集而不会对分类精度产生不利影响的能力。

著录项

来源
《Neural Computing & Applications》 |2010年第4期|p.573-582|共10页
作者
Nikos Tsimboukakis; George Tambouratzis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Document classification; Neural networks; Feature selection;

机译：文件分类;神经网络;功能选择;

相似文献

外文文献
中文文献
专利

1. A comparative study on authorship attribution classification tasks using both neural network and statistical methods [J] . Nikos Tsimboukakis, George Tambouratzis Neural computing & applications . 2010,第4期

机译：神经网络和统计方法对作者归因分类任务的比较研究
2. Comparative analysis of the prediction and classification accuracy of artificial neural networks with respect to traditional statistical methods [J] . Carlos Belman-López, José Alfredo Jiménez García, José Antonio Vázquez López International Journal of Combinatorial Optimization Problems and Informatics . 2021,第2期

机译：人工神经网络关于传统统计方法预测和分类准确性的比较分析
3. Use of Artificial Neural Networks, Aided by Methods to Reduce Dimensions, to Resolve Overlapped Electrochemical Signals. A Comparative Study Including other Statistical Methods [J] . J. M. Palacios-Santander, A. Jiménez-Jiménez, L. M. Cubillana-Aguilera, Microchimica Acta . 2003,第1a2期

机译：人工神经网络的使用，通过减小尺寸的方法来解决重叠的电化学信号。包括其他统计方法的比较研究
4. A Comparative Study of Language Modeling to Instance-Based Methods, and Feature Combinations for Authorship Attribution [C] . Olga Fourkioti, Symeon Symeonidis, Avi Arampatzis International conference on theory and practice of digital libraries . 2017

机译：语言建模与基于实例的方法以及作者归因的特征组合的比较研究
5. Comparative analysis of statistical methods and neural networks for predicting life insurers' insolvency. [D] . Jang, Jaeho. 1997

机译：统计方法与神经网络的比较分析，以预测寿险公司的破产能力。
6. A comparative study on polyp classification using convolutional neural networks [O] . Krushi Patel, Kaidong Li, Ke Tao, 2020

机译：卷积神经网络息肉分类的比较研究
7. A Comparison Of Artificial Neural Networks And Other Statistical Methods For Rotating Machine Condition Classification [O] . A. C. McCormick 2007

机译：人工神经网络与其他统计方法在旋转机械状态分类中的比较

A comparative study on authorship attribution classification tasks using both neural network and statistical methods

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅