首页> 外文会议>International Conference on Data Mining >An approach to finding reduced sets of information features describing discrete objects based on rough sets theory
【24h】

An approach to finding reduced sets of information features describing discrete objects based on rough sets theory

机译:一种基于粗糙集理论的发现描述离散对象的信息特征约简集的方法

获取原文

摘要

Modern Data Mining methods allow discovering non-trivial dependencies in large information arrays. Since these methods are used for processing and analysis of huge information volumes, reducing the number of features necessary for describing a discrete object is one of the most important problems. One of the classical problems in intelligent data analysis is the problem of classifying new objects based on some a-priori information. This information might not allow us to exactly classify an object as one belonging to a certain set. In such cases using rough sets theory may be an effective solution as this theory operates with the concept of 'indiscernible' elements and ambiguous information. In this paper we introduce a concept of a local reduct as a reduced set of features allowing us to describe a particular subset of the original set with the same precision as with the help of the full set of features. A method has been suggested which allows finding reduced sets of features adequately describing a rough set without losing necessary information (so-called reducts), and also assessing the importance of each feature. The suggested method is based on the algebraic approach to finding rough set approximations developed by the authors earlier. The main idea of the developed approach is as follows: if the algebraic approximations of a rough set do not change substantially in the process of excluding features the resulting reduced set of features can be used instead of the original full set. Also the greater changes eliminating a particular feature causes in the approximations, the more important this feature is.
机译:现代数据挖掘方法允许在大型信息数组中发现非平凡的依赖关系。由于这些方法用于处理和分析大量信息,因此减少描述离散对象所需的特征数量是最重要的问题之一。智能数据分析中的经典问题之一是基于一些先验信息对新对象进行分类的问题。此信息可能不允许我们将一个对象准确地分类为属于某个集合的对象。在这种情况下,使用粗糙集理论可能是一种有效的解决方案,因为该理论以“模糊”元素和模糊信息的概念运行。在本文中,我们引入了局部约简的概念,将其作为一组简化的特征,使我们能够以与全套特征相同的精度描述原始集合的特定子集。已经提出了一种方法,该方法允许找到减少描述的特征集,以充分描述一个粗糙集而不会丢失必要的信息(所谓的归约法),并且还可以评估每个特征的重要性。建议的方法是基于代数方法来找到作者较早前开发的粗糙集近似值。所开发方法的主要思想如下:如果在排除特征的过程中粗集的代数近似值没有实质变化,则可以使用所得的减少的特征集代替原始的完整集。同样,消除特定特征的变化越大,则近似值也越重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号