...
首页> 外文期刊>Digital investigation >Mining writeprints from anonymous e-mails for forensic investigation
【24h】

Mining writeprints from anonymous e-mails for forensic investigation

机译:从匿名电子邮件中挖掘文字进行法医调查

获取原文
获取原文并翻译 | 示例

摘要

Many criminals exploit the convenience of anonymity in the cyber world to conduct illegal activities. E-mail is the most commonly used medium for such activities. Extracting knowledge and information from e-mail text has become an important step for cybercrime investigation and evidence collection. Yet, it is one of the most challenging and time-consuming tasks due to special characteristics of e-mail dataset. In this paper, we focus on the problem of mining the writing styles from a collection of e-mails written by multiple anonymous authors. The general idea is to first cluster the anonymous e-mail by the stylometric features and then extract the writeprint, i.e., the unique writing style, from each cluster. We emphasize that the presented problem together with our proposed solution is different from the traditional problem of authorship identification, which assumes training data is available for building a classifier. Our proposed method is particularly useful in the initial stage of investigation, in which the investigator usually have very little information of the case and the true authors of suspicious e-mail collection. Experiments on a real-life dataset suggest that clustering by writing style is a promising approach for grouping e-mails written by the same author.
机译:许多罪犯利用网络世界中匿名的便利来进行非法活动。电子邮件是此类活动最常用的媒介。从电子邮件文本中提取知识和信息已成为网络犯罪调查和证据收集的重要步骤。但是,由于电子邮件数据集的特殊特性,它是最具挑战性和最耗时的任务之一。在本文中,我们关注于从多个匿名作者撰写的电子邮件集合中挖掘写作风格的问题。通常的想法是,首先通过样式特征将匿名电子邮件聚类,然后从每个聚类中提取出写痕,即独特的写作风格。我们强调,提出的问题以及我们提出的解决方案与传统的作者身份识别问题不同,后者假定培训数据可用于构建分类器。我们提出的方法在调查的最初阶段特别有用,在调查的初期,调查人员通常很少了解案件的信息以及可疑电子邮件收集的真实作者。在现实生活中的数据集上的实验表明,通过写作风格进行聚类是对同一位作者撰写的电子邮件进行分组的一种有前途的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号