首页> 外文OA文献 >Mining Chat Logs to Extract Information about Authors and Topics for Crime Investigation
【2h】

Mining Chat Logs to Extract Information about Authors and Topics for Crime Investigation

机译:挖掘聊天记录以提取有关犯罪调查的作者和主题的信息

摘要

Cybercriminals have been using the Internet to accomplish illegitimate activities and to execute catastrophic attacks. Computer Mediated Communication, such as online chat, provides an anonymous channel for predators to exploit victims. In order to prosecute criminals in a court of law, an investigator often needs to extract evidence from a large volume of chat messages. Most of the existing search tools are keyword-based, and the search terms are provided by an investigator. The quality of the retrieved results depends on the search terms provided. Due to the large volume of chat messages and the large number of participants in public chat rooms, the process is usually time-consuming and error-prone. This thesis presents a topic search model to analyze archives of chat logs for segregating crime-relevant logs from others. Specifically, we propose an extension of the Latent Dirichlet Allocation (LDA)-based model to extract topics, compute the contribution of authors in these topics, and study the transitions of these topics over time. In addition, we present another unique model for characterizing authors-topics over time. This is crucial for investigation because it provides a view of the activity in which authors are involved in certain topics. Experiments on two real-life datasets suggest that the proposed approach can discover hidden criminal topics and the distribution of authors to these topics.
机译:网络罪犯一直在使用互联网来完成非法活动并执行灾难性攻击。诸如在线聊天之类的计算机中介通信为掠夺者提供了一个匿名渠道来利用受害者。为了在法院起诉罪犯,调查人员通常需要从大量聊天消息中提取证据。现有的大多数搜索工具都是基于关键字的,并且搜索词是由调查人员提供的。检索结果的质量取决于提供的搜索词。由于大量的聊天消息和公共聊天室中的大量参与者,此过程通常很耗时且容易出错。本文提出了一种主题搜索模型,用于分析聊天日志的存档,以将与犯罪相关的日志与其他日志分开。具体来说,我们建议对基于潜在狄利克雷分配(LDA)的模型进行扩展,以提取主题,计算作者在这些主题中的贡献以及研究这些主题随时间的推移。此外,我们提出了另一个独特的模型,用于随时间推移表征作者主题。这对于调查至关重要,因为它提供了作者参与某些主题的活动的视图。在两个现实生活中的数据集上进行的实验表明,所提出的方法可以发现隐藏的犯罪主题以及这些主题的作者分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号