【24h】

A Survey Of Modern Authorship Attribution Methods

机译:现代著作权归属方法研究

获取原文
获取原文并翻译 | 示例
           

摘要

Authorship attribution supported by statistical or computational methods has a long history starting from the 19th century and is marked by the seminal study of Mosteller and Wallace (1964) on the authorship of the disputed "Federalist Papers." During the last decade, this scientific field has been developed substantially, taking advantage of research advances in areas such as machine learning, information retrieval, and natural language processing. The plethora of available electronic texts (e.g., e-mail messages, online forum messages, blogs, source code, etc.) indicates a wide variety of applications of this technology, provided it is able to handle short and noisy text from multiple candidate authors. In this article, a survey of recent advances of the automated approaches to attributing authorship is presented, examining their characteristics for both text representation and text classification. The focus of this survey is on computational requirements and settings rather than on linguistic or literary issues. We also discuss evaluation methodologies and criteria for authorship attribution studies and list open questions that will attract future work in this area.
机译:统计或计算方法所支持的作者身份归属始于19世纪,由来已久,并且以Mosteller和Wallace(1964)对有争议的“联邦主义者论文”作者身份的开创性研究为标志。在过去的十年中,利用机器学习,信息检索和自然语言处理等领域的研究进展,该科学领域得到了长足的发展。大量可用的电子文本(例如,电子邮件消息,在线论坛消息,博客,源代码等)表明该技术的广泛应用,只要它能够处理来自多个候选作者的简短而嘈杂的文本。在本文中,将对自动归因于作者身份的方法的最新进展进行综述,并检查它们在文本表示和文本分类方面的特征。该调查的重点是计算要求和设置,而不是语言或文学问题。我们还将讨论作者归属研究的评估方法和标准,并列出将吸引该领域未来工作的未解决问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号