【24h】

Using Text Mining in Finding Criminal Data on the Web

机译:使用文本挖掘在网络上查找犯罪数据

获取原文
获取原文并翻译 | 示例

摘要

With the rapid growth of cyber crime, law enforcement agencies are trying rigorously to find new methods to cope with the problem. Most cyber crimes seldom expose themselves explicitly, making the search of criminal information a difficult task. Fortunately, based on our experience, each type of crime has certain properties that can be recognized by human experts or by computers. In this paper we illustrate the use of text mining to find criminal information by an example — cautioning drugs on the Web. The underlying technique of text mining is the application of a special-purpose search engine, the eDetective system, constructed according to Facet Analysis Method (FAM). To formulate our search target, we organize the keywords of sample pages provided by law enforcement agencies in four facets, namely article, form & effect, exchange and marketing, and to represent the concept of cautioning drugs on the Web. A Web page will be compared with the search target and the similarity between them will be determined. Experimental results reveal that the average search precision is 81.70% and the highest F measure is 0.74510, if the exponential model of FAM is used in the Web page comparison. The result shows the applicability of our method in carrying out difficult and complex search tasks.
机译:随着网络犯罪的迅速增长,执法机构正在努力寻找新方法来解决该问题。大多数网络犯罪很少明确地暴露自己,使搜索犯罪信息成为一项艰巨的任务。幸运的是,根据我们的经验,每种类型的犯罪都具有人类专家或计算机可以识别的某些属性。在本文中,我们通过一个示例来说明如何使用文本挖掘来查找犯罪信息-警告Web上的毒品。文本挖掘的基础技术是根据Facet分析方法(FAM)构建的专用搜索引擎eDetective系统的应用。为了制定搜索目标,我们将执法机构提供的示例页面的关键字组织在四个方面,即文章,形式和效果,交换和市场营销,并在网络上表示警告毒品的概念。将网页与搜索目标进行比较,并确定它们之间的相似性。实验结果表明,如果在网页比较中使用FAM的指数模型,则平均搜索精度为81.70%,最高F测度为0.74510。结果表明我们的方法在执行困难和复杂的搜索任务中的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号