首页> 外文会议>International conference on computational linguistics >Topic or Style? Exploring the Most Useful Features for Authorship Attribution
【24h】

Topic or Style? Exploring the Most Useful Features for Authorship Attribution

机译:主题还是风格?探索著作权归属最有用的功能

获取原文

摘要

Approaches to authorship attribution, the task of identifying the author of a document, are based on analysis of individuals' writing style and/or preferred topics. Although the problem has been widely explored, no previous studies have analysed the relationship between dataset characteristics and effectiveness of different types of features. This study carries out an analysis of four widely used datasets to explore how different types of features affect authorship attribution accuracy under varying conditions. The results of the analysis are applied to authorship attribution models based on both discrete and continuous representations. We apply the conclusions from our analysis to an extension of an existing approach to authorship attribution and outperform the prior state-of-the-art on two out of the four datasets used.
机译:作者身份归属的方法,即确定文档作者的任务,是基于对个人写作风格和/或偏好主题的分析。尽管已经对该问题进行了广泛的探讨,但是以前的研究都没有分析数据集特征与不同类型特征的有效性之间的关系。这项研究对四个广泛使用的数据集进行了分析,以探讨不同类型的特征如何在不同条件下影响作者的署名准确性。分析的结果将应用于基于离散表示和连续表示的作者归因模型。我们将分析得出的结论应用于现有作者著作权归属方法的扩展,并在使用的四个数据集中的两个数据集上超越了现有技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号