首页> 外文会议>International conference on future data and security engineering >Performance Evaluation of a Natural Language Processing Approach Applied in White Collar Crime Investigation
【24h】

Performance Evaluation of a Natural Language Processing Approach Applied in White Collar Crime Investigation

机译:一种自然语言处理方法在白领犯罪调查中的性能评估

获取原文

摘要

In today's world we are confronted with increasing amounts of information every day coming from a large variety of sources. People and corporations are producing data on a large scale, and since the rise of the internet, e-mail and social media the amount of produced data has grown exponentially. From a law enforcement perspective we have to deal with these huge amounts of data when a criminal investigation is launched against an individual or company. Relevant questions need to be answered like who committed the crime, who were involved, what happened and on what time, who were communicating and about what? Not only the amount of available data to investigate has increased enormously, but also the complexity of this data has increased. When these communication patterns need to be combined with for instance a seized financial administration or corporate document shares a complex investigation problem arises. Recently, criminal investigators face a huge challenge when evidence of a crime needs to be found in the Big Data environment where they have to deal with large and complex datasets especially in financial and fraud investigations. To tackle this problem, a financial and fraud investigation unit of a European country has developed a new tool named LES that uses Natural Language Processing (NLP) techniques to help criminal investigators handle large amounts of textual information in a more efficient and faster way. In this paper, we present briefly this tool and we focus on the evaluation its performance in terms of the requirements of forensic investigation: speed, smarter and easier for investigators. In order to evaluate this LES tool, we use different performance metrics. We also show experimental results of our evaluation with large and complex datasets from real-world application.
机译:在当今世界,我们每天都面临着来自各种来源的越来越多的信息。人们和企业正在大规模生产数据,并且自从Internet,电子邮件和社交媒体兴起以来,生产的数据量呈指数增长。从执法角度来看,在对个人或公司进行刑事调查时,我们必须处理这些大量数据。需要回答相关问题,例如谁犯罪,谁参与,发生了什么事情,什么时候发生,谁在交流以及关于什么?不仅要调查的可用数据量大大增加,而且此数据的复杂性也增加了。当需要将这些通信模式与例如扣押的财务管理或公司文件共享时,会产生一个复杂的调查问题。最近,当需要在大数据环境中找到犯罪证据时,犯罪侦查人员面临着巨大的挑战,在大数据环境中,他们必须处理大型和复杂的数据集,尤其是在财务和欺诈调查中。为了解决这个问题,欧洲国家的金融和欺诈调查部门开发了一种名为LES的新工具,该工具使用自然语言处理(NLP)技术来帮助犯罪侦查员以更高效,更快捷的方式处理大量文本信息。在本文中,我们将简要介绍此工具,并根据法医调查的要求重点评估其性能:调查人员的速度,智慧和便捷性。为了评估此LES工具,我们使用了不同的性能指标。我们还展示了使用来自实际应用程序的大型和复杂数据集进行评估的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号