首页> 外文会议>SEMCCO 2011;International conference on swarm, evolutionary, and memetic computing >Attribute Clustering and Dimensionality Reduction Based on In/Out Degree of Attributes in Dependency Graph
【24h】

Attribute Clustering and Dimensionality Reduction Based on In/Out Degree of Attributes in Dependency Graph

机译:基于依存关系图属性进/出程度的属性聚类与降维

获取原文

摘要

In order to mine useful information from huge datasets development of appropriate tools and techniques are needed to organize and evaluate such data. However, ultra high dimensionality of data poses serious challenges in data mining research. The method proposed in the paper encompasses a new strategy in dimensionality reduction by attribute clustering based on the dependency graph of the attributes. Information gain, an established theory of measuring uncertainty and quantified the information contained in the system, of each attribute is calculated that expresses dependency relationship between the attributes in the graph. The underlying principles able to select the optimum set of attributes, called reduct able to classify the dataset as could be done in presence of all attributes. The rate of dimension reduction of the datasets of UCI repository is measured and compared with existing methods and also the classification accuracy with reduced dataset is calculated by various classifiers to measure the effectiveness of the method.
机译:为了从庞大的数据集中挖掘有用的信息,需要开发适当的工具和技术来组织和评估此类数据。但是,数据的超高维度在数据挖掘研究中提出了严峻的挑战。本文提出的方法包含了一种新的基于属性依赖图的属性聚类降维方法。计算每个属性的信息增益,这是一种测量不确定性并量化系统中包含的信息的既定理论,它表示图形中属性之间的依赖关系。能够选择最佳属性集的基本原理(称为归约法)能够对所有数据集进行分类,从而对数据集进行分类。测量了UCI存储库数据集的降维率,并与现有方法进行了比较,并通过各种分类器计算了减少了数据集的分类精度,以衡量该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号