首页> 外文会议>5th International FLINS Conference on Computational Intelligent Systems for Applied Research, Sep 16-18, 2002, Gent, Belgium >USING SOFT COMPUTING TECHNIQUES TO INTEGRATE MULTIPLE KINDS OF ATTRIBUTES IN DATA MINING
【24h】

USING SOFT COMPUTING TECHNIQUES TO INTEGRATE MULTIPLE KINDS OF ATTRIBUTES IN DATA MINING

机译:使用软计算技术整合数据挖掘中的多种属性

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Data mining discouvers interesting information from a data set. Mining incorporates different methods and considers different kinds of information. Granulation is an important task of mining. Mining methods include: association rule discouvery, classification, partitioning, clustering, and sequence discouvery. The data sets can be extremely large with multiple kinds of data in high dimensionality. Most current clustering algorithms deal with either quantitative or qualitative data, but not both. However, many data sets contain a mixture of quantitative and qualitative data. We are considering how to best group records containing multiple kinds of data. It is difficult to do this. Even grouping based on different quantitative metrics has its difficulties. There are many partially successful strategies as well as several different possible differential geometries. Adding in various qualitative elements is exceedingly difficult. We expect to use a mixture of scalar methods, soft computing (rough sets, fuzzy sets), as well as methods using other metrics. To cluster records in a data set, it would be useful to have a similarity measure. Unfortunately, few exist that account meaningfully for any combination of kinds of data. The more meaningful metrics known are restrictive to a particular area or science. The method of combining magnitude difference and simple matching so that it is general enough for mining is a topic that is yet to be reasonably solved. We will present several strategies for integrating multiple metrics.
机译:数据挖掘可从数据集中发现有趣的信息。挖掘采用不同的方法并考虑不同种类的信息。制粒是采矿的重要任务。挖掘方法包括:关联规则无关,分类,分区,聚类和顺序无关。具有多种高维数据的数据集可能非常大。当前大多数聚类算法都处理定量或定性数据,但不能同时处理这两种数据。但是,许多数据集包含定量和定性数据。我们正在考虑如何最好地对包含多种数据的记录进行分组。很难做到这一点。即使基于不同的量化指标进行分组也有其困难。有许多部分成功的策略以及几种不同的可能的差异几何。添加各种定性元素极其困难。我们期望混合使用标量方法,软计算(粗糙集,模糊集)以及使用其他指标的方法。要将记录集中在数据集中,采用相似性度量将很有用。不幸的是,很少有有意义地说明各种数据组合的情况。已知的更有意义的指标仅限于特定领域或科学。结合大小差异和简单匹配以使其足够通用以进行挖掘的方法是一个尚待合理解决的话题。我们将提出几种整合多个指标的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号