...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >An efficient construction and application usefulness of rectangle greedy covers
【24h】

An efficient construction and application usefulness of rectangle greedy covers

机译:矩形贪婪封面的有效构造和应用效用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We develop efficient construction methods of a rectangle greedy cover (RGC), and evaluate its usefulness in applications. An RGC is a greedy cover of the set of given positive instances by exclusive axis-parallel hyperrectangles, namely, axis-parallel hyperrectangles that exclude all the given negative instances. An RGC is expected to be a compact classification rule with high readability because the number of its component rectangles is expected to be small and it can be seen as a disjunctive normal form, which is one of the most readable representations for us. We propose two approaches of RGC construction: enumeration approach and direct approach. In enumeration approach, the maximal exclusive positive subsets (MEPSs) are enumerated first and then an ordinary greedy set covering is done using the enumerated MEPSs. We make clear the relation between enumeration of the maximal frequent itemsets and enumeration of the MEPSs, and convert an efficient enumeration algorithm LCMmax [1] of maximal frequent itemsets to an enumeration algorithm LCMmax.R_naive of MEPSs. We also develop a more efficient version of LCMmax.R_naive, or LCMmax.R, by incorporating effective dynamic reordering of instances using excluded frequency and bit-parallel exclusiveness check. In direct approach, each component MEPS of an RGC is searched not from enumerated MEPSs but directly from the dataset that consists of the remaining uncovered positive instances and the whole negative instances. We developed an algorithm called MRF that efficiently finds an maximum-sized MEPS for given positive and negative instances. MRF is made from LCMmax.R by modifying it so as to find a maximum-sized MEPS only. An RGC is constructed by MRF repetition, that is, by repeatedly executing MRF using the remaining uncovered positive instances. According to our experimental evaluation using UCI-repository datasets, LCMmax.R was about 5-11 times faster than LCMmax.R_naive, which indicates effectiveness of the introduced two improvements. MRF repetition, however, was significantly faster than LCMmax.R, and it was fast enough for practical usage. The experimental results using UCI-repository datasets also showed that accuracy of a nearest rectangle classifier using an RGC is close to that using the hyperrectangles output by the randomized subclass method (RSM) [2] though the number of component rectangles of an RGC is significantly smaller than the number of the hyperrectangles output by RSM. The performance of RGC was also shown to be comparable to that of the six popular classifiers including logistic regression and support vector machine. The disjunctive normal form representation of the classification rules obtained by RGC was demonstrated to be simpler and more readable for us than that obtained by RSM and C4.5.
机译:我们开发了矩形贪婪封面(RGC)的有效构造方法,并评估了其在应用中的实用性。 RGC是排他的轴平行超矩形(即排除所有给定负实例的轴平行超矩形)对给定正实例的贪婪覆盖。 RGC有望成为具有高可读性的紧凑分类规则,因为其组成矩形的数量预计会很小,并且可以将其视为析取范式,这对我们来说是最易读的表示形式。我们提出了两种RGC构建方法:枚举方法和直接方法。在枚举方法中,首先枚举最大排他的正子集(MEPS),然后使用枚举的MEPS完成普通贪婪集的覆盖。我们弄清了最大频繁项集的枚举与MEPS枚举之间的关系,并将最大频繁项集的有效枚举算法LCMmax [1]转换为MEPS的枚举算法LCMmax.R_naive。通过使用排除的频率和位并行排他性检查合并实例的有效动态重排序,我们还开发了LCMmax.R_naive或LCMmax.R的更有效版本。在直接方法中,不是从枚举的MEPS中搜索RGC的每个组成部分MEPS,而是直接从包含剩余未发现的阳性实例和整个阴性实例的数据集中进行搜索。我们开发了一种称为MRF的算法,可以针对给定的正负实例有效地找到最大MEPS。 MRF由LCMmax.R修改而成,以便仅找到最大大小的MEPS。通过MRF重复(即通过使用剩余的未发现肯定实例重复执行MRF)来构造RGC。根据我们使用UCI存储库数据集进行的实验评估,LCMmax.R大约比LCMmax.R_naive快5-11倍,这表明所引入的两项改进的有效性。但是,MRF重复的速度明显快于LCMmax.R,并且对于实际使用而言足够快。使用UCI存储库数据集进行的实验结果还表明,尽管RGC的分量矩形数量显着增加,但使用RGC的最近矩形分类器的精度与使用随机子类方法(RSM)输出的超矩形的精度接近[2]。小于RSM输出的超矩形的数量。研究还表明,RGC的性能可与六种流行的分类器(包括逻辑回归和支持向量机)相媲美。与RSM和C4.5获得的分类规则相比,RGC获得的分类规则的析取范式表示对我们来说更简单易读。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号