【24h】

Rare Category Detection Forest

机译:稀有类别检测森林

获取原文

摘要

Rare category detecion (RCD) aims to discover rare cate-gories in a massive unlabeled data set with the help of a labeling oracle. A challenging task in RCD is to discover rare categories which are concealed by numerous data examples from major categories. Only a few algorithms have been proposed for this issue, most of which are on quadratic or cubic time complexity. In this paper, we propose a novel tree-based algorithm known as RCD-Forest with O(Φ n log (n/s)) time complexity and high query efficiency where n is the size of the unlabeled data set. Experimental results on both synthetic and real data sets verify the effectiveness and efficiency of our method.
机译:罕见的类别百分之屈(RCD)旨在在标签Oracle的帮助下发现大规模未标记的数据集中的稀有契约。 RCD中的一个具有挑战性的任务是发现稀有类别,这些类别由大类的许多数据示例隐藏。只有少数算法已经提出了这个问题,其中大多数是二次或立方时间复杂性。在本文中,我们提出了一种新建的基于树的算法,称为RCD-Forest,具有o(φnlog(n / s))时间复杂度和高查询效率,其中n是未标记数据集的大小。合成和实际数据集的实验结果验证了我们方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号