【24h】

Data Base Investigation as a Ranking Problem

机译:数据库调查是排名问题

获取原文
获取原文并翻译 | 示例

摘要

When data mining for forensic investigations, we are typically confronted with strongly imbalanced classes. Moreover, the labels of the non-target or negative class are usually not confirmed. In other words, the non-positive objects are unlabeled. For these situations classification methods are not well suited. We propose to approach these problems as ranking problems. We apply several supervised learning methods, including recently developed methods that are specifically aimed at optimizing ranking performance. With a true investigation dataset, we show the improvement over the prior probabilities using the ranking approach. It turns out that some two-class classification methods perform competitively on ranking performance, while the true ranking methods do not stand out.
机译:在进行法证调查的数据挖掘时,我们通常会遇到严重失衡的类。此外,通常不会确认非目标或否定类别的标签。换句话说,非阳性对象是未标记的。对于这些情况,分类方法不太适合。我们建议将这些问题作为排名问题来处理。我们采用了几种有监督的学习方法,包括最近开发的专门用于优化排名表现的方法。通过一个真实的调查数据集,我们使用排名方法显示了对先验概率的改进。事实证明,某些两类分类方法在排名表现上具有竞争性,而真正的排名方法并不突出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号