首页> 外文期刊>Information Forensics and Security, IEEE Transactions on >What You Submit Is Who You Are: A Multimodal Approach for Deanonymizing Scientific Publications
【24h】

What You Submit Is Who You Are: A Multimodal Approach for Deanonymizing Scientific Publications

机译:您提交的是您是谁:对科学出版物进行匿名处理的多模式方法

获取原文
获取原文并翻译 | 示例
           

摘要

The peer-review system of most academic conferences relies on the anonymity of both the authors and reviewers of submissions. In particular, with respect to the authors, the anonymity requirement is heavily disputed and pros and cons are discussed exclusively on a qualitative level. In this paper, we contribute a quantitative argument to this discussion by showing that it is possible for a machine to reveal the identity of authors of scientific publications with high accuracy. We attack the anonymity of authors using statistical analysis of multiple heterogeneous aspects of a paper, such as its citations, its writing style, and its content. We apply several multilabel, multiclass machine learning methods to model the patterns exhibited in each feature category for individual authors and combine them to a single ensemble classifier to deanonymize authors with high accuracy. To the best of our knowledge, this is the first approach that exploits multiple categories of discriminative features and uses multiple, partially complementing classifiers in a single, focused attack on the anonymity of the authors of an academic publication. We evaluate our author identification framework, deAnon, based on a real-world data set of 3894 papers. From these papers, we target 1405 productive authors that each have at least three publications in our data set. Our approach returns a ranking of probable authors for anonymous papers, an ordering for guessing the authors of a paper. In our experiments, following this ranking, the first guess corresponds to one of the authors of a paper in 39.7% of the cases, and at least one of the authors is among the top 10 guesses in 65.6% of all cases. Thus, deAnon significantly outperforms current state-of-the-art techniques for automatic deanonymization.
机译:大多数学术会议的同行评审系统都依赖于提交者和评审者的匿名性。特别是,对于作者而言,匿名性要求存在很大争议,其优缺点仅在质量上进行讨论。在本文中,我们通过展示一种机器有可能高精度地揭示科学出版物作者的身份,为这一讨论提供了定量的论据。我们使用对论文的多个不同方面的统计分析来攻击作者的匿名性,例如论文的引文,写作风格和内容。我们应用了几种多标签,多类机器学习方法来对每个作者的每个特征类别中显示的模式进行建模,并将它们组合到单个整体分类器中,从而以较高的准确性对作者进行匿名处理。据我们所知,这是第一种利用多种区分特征的方法,并在针对学术出版物作者的匿名性的一次集中攻击中使用多个部分补充的分类器。我们基于3894篇论文的真实数据集评估我们的作者识别框架deAnon。从这些论文中,我们针对1405位富有成效的作者,他们在我们的数据集中至少拥有三篇出版物。我们的方法返回匿名论文的可能作者排名,以猜测论文作者的顺序。在我们的实验中,按照该排名,在39.7%的案例中,第一个猜测与一位论文的作者相对应,在所有案例的65.6%中,至少有一位作者名列前十名。因此,deAnon明显优于自动去匿名化的最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号