首页> 外文期刊>The American statistician >A Unified Approach to Authorship Attribution and Verification
【24h】

A Unified Approach to Authorship Attribution and Verification

机译:著作权归属和验证的统一方法

获取原文
获取原文并翻译 | 示例
           

摘要

In authorship attribution, one assigns texts from an unknown author to either one of two or more candidate authors by comparing the disputed texts with texts known to have been written by the candidate authors. In authorship verification, one decides whether a text or a set of texts could have been written by a given author. These two problems are usually treated separately. By assuming an open-set classification framework for the attribution problem, contemplating the possibility that none of the candidate authors is the unknown author, the verification problem becomes a special case of attribution problem. Here both problems are posed as a formal Bayesian multinomial model selection problem and are given a closed-form solution, tailored for categorical data, naturally incorporating text length and dependence in the analysis, and coping well with settings with a small number of training texts. The approach to authorship verification is illustrated by exploring whether a court ruling sentence could have been written by the judge that signs it, and the approach to authorship attribution is illustrated by revisiting the authorship attribution of the Federalist papers and through a small simulation study.
机译:在作者身份归属中,通过将有争议的文本与候选作者已撰写的已知文本进行比较,将未知作者的文本分配给两个或多个候选作者之一。在作者身份验证中,一个人决定一个给定作者可能写过一个文本或一组文本。这两个问题通常分开处理。通过假设一个归因问题的开放式分类框架,考虑所有候选作者都不是未知作者的可能性,验证问题成为归因问题的特例。在这里,这两个问题都被视为形式上的贝叶斯多项式模型选择问题,并给出了封闭形式的解决方案,专为分类数据量身定制,在分析中自然地结合了文本的长度和依赖性,并能很好地应对少量训练文本的设置。通过探究是否可以由签署该判决的法官写出法院判决书来说明作者身份验证的方法,并通过重新研究联邦主义者论文的作者身份并通过小型模拟研究来说明作者身份的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号