首页> 外文会议>Data mining X : Data mining, protection, detection and other security technologies >A method for finding minimal sets of features adequately describing discrete information objects
【24h】

A method for finding minimal sets of features adequately describing discrete information objects

机译:一种用于找到足以描述离散信息对象的最小特征集的方法

获取原文
获取原文并翻译 | 示例

摘要

One of the classical Data Mining problems is the problem of classifying new objects on the basis of available information when the information associated with these objects does not allow identifying them unambiguously as elements of some set. In such cases using rough sets theory is often an effective solution. This theory operates with such concepts as "indiscernible" elements and relations. A rough set is characterized by lower and upper approximations for finding which the authors earlier suggested an original algebraic method. The given method uses only logic operations, which makes the process of searching logic rules very quick and efficient.rnThe upper and lower approximations of a rough set allow describing elements of this set as completely as it is possible from the viewpoint of available information. In this connection it seems interesting and important to find irreducible sets of features describing a rough set with the same "precision" as with the help of a full set of features (so called reducts). This problem is quite difficult and complicated and at present it does not have good solutions. Our paper continues research carried out by the authors earlier and we suggest a method for finding reducts based on eliminating non-salient features in the reverse order of their importance. The suggested procedure allows us to avoid exhaustive searching by extracting a predefined number of most significant reducts. In this paper we consider arbitrary features taking on their values from finite sets.
机译:经典的数据挖掘问题之一是当与这些对象关联的信息不允许将它们明确地标识为某个集合的元素时,根据可用信息对新对象进行分类的问题。在这种情况下,使用粗糙集理论通常是有效的解决方案。该理论以诸如“不清楚”的要素和关系的概念起作用。粗糙集的特征在于上下近似,以发现作者早些时候提出的原始代数方法。给定的方法仅使用逻辑运算,这使得搜索逻辑规则的过程非常快速和高效。粗糙集的上下近似允许从可用信息的角度尽可能完整地描述该集合的元素。在这一点上,找到不可约的特征集与完整的特征集(所谓的归约法)一样具有“精确度”,描述了一个粗糙集,这似乎很有趣且重要。这个问题是非常困难和复杂的,目前还没有好的解决方案。我们的论文继续了作者先前进行的研究,我们提出了一种以消除非显着特征的重要性相反的顺序寻找还原的方法。建议的过程使我们能够通过提取预定义数量的最重要的约简来避免详尽搜索。在本文中,我们考虑从有限集取值的任意特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号