...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Exploring the boundary region of tolerance rough sets for feature selection
【24h】

Exploring the boundary region of tolerance rough sets for feature selection

机译:探索公差粗糙集的边界区域以进行特征选择

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Of all of the challenges which face the effective application of computational intelligence technologies for pattern recognition, dataset dimensionality is undoubtedly one of the primary impediments. In order for pattern classifiers to be efficient, a dimensionality reduction stage is Usually performed prior to classification. Much use has been made of rough set theory for this purpose as it is completely data-driven and no other information is required; most other methods require some additional knowledge. However, traditional rough set-based methods in the literature are restricted to the requirement that all data must be discrete. It is therefore not possible to consider real-valued or noisy data. This is usually addressed by employing a discretisation method, which can result in information loss. This paper proposes a new approach based on the tolerance rough set model, which has the ability to deal with real-valued data whilst simultaneously retaining dataset semantics. More significantly, this paper describes the underlying mechanism for this new approach to utilise the information contained within the boundary region or region Of uncertainty. The use of this information can result in the discovery of more compact feature subsets and improved classification accuracy. These results are supported by an experimental evaluation which compares the proposed approach with a number of existing feature selection techniques.
机译:在有效利用计算智能技术进行模式识别所面临的所有挑战中,数据集的维数无疑是主要障碍之一。为了使模式分类器有效,通常在分类之前执行降维阶段。为此,已经广泛使用了粗糙集理论,因为它完全是数据驱动的,不需要其他信息。大多数其他方法需要一些其他知识。但是,文献中传统的基于粗糙集的方法仅限于所有数据必须是离散的要求。因此,不可能考虑实值或嘈杂的数据。这通常通过采用离散化方法解决,这可能导致信息丢失。本文提出了一种基于容差粗糙集模型的新方法,该方法具有处理实值数据并同时保留数据集语义的能力。更重要的是,本文描述了这种新方法利用边界区域或不确定区域中包含的信息的潜在机制。使用此信息可以导致发现更紧凑的特征子集并提高分类精度。这些结果得到了实验评估的支持,该评估将提出的方法与许多现有特征选择技术进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号