...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >On using prototype reduction schemes to optimize dissimilarity-based classification
【24h】

On using prototype reduction schemes to optimize dissimilarity-based classification

机译:关于使用原型归约方案优化基于差异的分类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The aim of this paper is to present a strategy by which a new philosophy for pattern classification, namely that pertaining to dissimilarity-based classifiers (DBCs), can be efficiently implemented. This methodology, proposed by Duin and his co-authors (see Refs. [Experiments with a featureless approach to pattern recognition, Pattern Recognition Lett. 18 (1997) 1159-1166; Relational discriminant analysis, Pattern Recognition Lett. 20 (1999) 1175-1181; Dissimilarity representations allow for buillding good classifiers, Pattern Recognition Lett. 23 (2002) 943-956; Dissimilarity representations in pattern recognition, Concepts, theory and applications, Ph.D. Thesis, Delft University of Technology, Delft, The Netherlands, 2005; Prototype selection for dissimilarity-based classifiers, Pattern Recognition 39 (2006) 189-208]), is a way of defining classifiers between the classes, and is not based on the feature measurements of the individual patterns, but rather on a suitable dissimilarity measure between them. The advantage of this methodology is that since it does not operate on the class-conditional distributions, the accuracy can exceed the Bayes' error bound. The problem with this strategy is, however, the need to compute, store and process the inter-pattern dissimilarities for all the training samples, and thus, the accuracy of the classifier designed in the dissimilarity space is dependent on the methods used to achieve this. In this paper, we suggest a novel strategy to enhance the computation for all families of DBCs. Rather than compute, store and process the DBC based on the entire data set, we advocate that the training set be first reduced into a smaller representative subset. Also, rather than determine this subset on the basis of random selection, or clustering, etc., we advocate the use of a prototype reduction scheme (PRS), whose output yields the points to be utilized by the DBC. The rationale for this is explained in the paper. Apart from utilizing PRSs, in the paper we also propose simultaneously employing the Mahalanobis distance as the dissimilarity-measurement criterion to increase the DBCs classification accuracy. Our experimental results demonstrate that the proposed mechanism increases the classification accuracy when compared with the "conventional" approaches for samples involving real-life as well as artificial data sets-even though the resulting dissimilarity criterion is not symmetric. (c) 2007 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
机译:本文的目的是提出一种策略,通过该策略可以有效地实现一种新的模式分类哲学,即与基于差异的分类器(DBC)有关的哲学。这种方法是由Duin及其合作者提出的(请参见参考文献[无特征方法的模式识别实验,Pattern Recognition Lett。18(1997)1159-1166;关系判别分析,Pattern Recognition Lett。20(1999)1175)。 -1181;不相似表示允许建立良好的分类器,模式识别lett。23(2002)943-956;模式识别中的不相似表示,概念,理论和应用,荷兰代尔夫特理工大学博士学位论文,2005;基于不相似性的分类器的原型选择,模式识别39(2006)189-208]),是一种在类之间定义分类器的方法,而不是基于各个模式的特征度量,而是基于它们之间合适的相异性度量。这种方法的优点是,由于它不对类条件分布进行运算,因此准确性可以超过贝叶斯的误差范围。然而,该策略的问题是需要为所有训练样本计算,存储和处理模式间的差异,因此,在差异空间中设计的分类器的准确性取决于实现此目标的方法。 。在本文中,我们提出了一种新颖的策略来增强所有DBC系列的计算能力。我们提倡首先将训练集简化为较小的代表性子集,而不是根据整个数据集来计算,存储和处理DBC。此外,我们提倡使用原型简化方案(PRS),而不是根据随机选择或聚类等方法确定此子集,该方案的输出会得出DBC要使用的点。本文对此进行了解释。除了利用PRS,我们还建议同时采用马氏距离作为相异性度量标准,以提高DBC的分类精度。我们的实验结果表明,与“常规”方法相比,对于涉及现实生活和人工数据集的样本,即使所产生的相异性标准不是对称的,所提出的机制也提高了分类的准确性。 (c)2007模式识别学会。由Elsevier Ltd.出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号