...
首页> 外文期刊>Expert Systems >Decision-making method using a visual approach for cluster analysis problems; indicative classification algorithms and grouping scope
【24h】

Decision-making method using a visual approach for cluster analysis problems; indicative classification algorithms and grouping scope

机译:使用视觉方法进行聚类分析的决策方法;指示性分类算法和分组范围

获取原文
获取原文并翻译 | 示例
           

摘要

Currently, classifying samples into a fixed number of clusters (i.e. supervised cluster analysis) as well as unsupervised cluster analysis are limited in their ability to support 'cross-algorithms' analysis. It is well known that each cluster analysis algorithm yields different results (i.e. a different classification); even running the same algorithm with two different similarity measures commonly yields different results. Researchers usually choose the preferred algorithm and similarity measure according to analysis objectives and data set features, but they have neither a formal method nor tool that supports comparisons and evaluations of the different classifications that result from the diverse algorithms. Current research development and prototype decisions support a methodology based upon formal quantitative measures and a visual approach, enabling presentation, comparison and evaluation of multiple classification suggestions resulting from diverse algorithms. This methodology and tool were used in two basic scenarios: (Ⅰ) a classification problem in which a 'true result' is known, using the Fisher iris data set; (Ⅱ) a classification problem in which there is no 'true result' to compare with. In this case, we used a small data set from a user profile study (a study that tries to relate users to a set of stereotypes based on sociological aspects and interests). In each scenario, ten diverse algorithms were executed. The suggested methodology and decision support system produced a cross-algorithms presentation; all ten resultant classifications are presented together in a 'Tetris-like' format. Each column represents a specific classification algorithm, each line represents a specific sample, and formal quantitative measures analyse the 'Tetris blocks', arranging them according to their best structures, i.e. best classification.
机译:当前,将样本分类为固定数量的聚类(即监督聚类分析)和非监督聚类分析在支持“跨算法”分析的能力方面受到限制。众所周知,每种聚类分析算法都会产生不同的结果(即不同的分类)。即使使用两个不同的相似性度量运行相同的算法,通常也会产生不同的结果。研究人员通常根据分析目标和数据集特征选择首选算法和相似性度量,但是他们既没有正式的方法也没有工具来支持对由不同算法产生的不同分类进行比较和评估。当前的研究开发和原型决策支持基于正式定量方法和视觉方法的方法论,从而能够对来自不同算法的多种分类建议进行展示,比较和评估。这种方法和工具被用在两个基本场景中:(一)使用费舍尔虹膜数据集的分类问题,其中“真实结果”是已知的; (Ⅱ)没有“真实结果”可比的分类问题。在这种情况下,我们使用了来自用户概况研究(该研究试图根据社会学方面和兴趣将用户与一组刻板印象相关的研究)中的少量数据集。在每种情况下,都会执行十种不同的算法。建议的方法和决策支持系统产生了跨算法的表示;所有十个分类结果都以“俄罗斯方块”格式一起显示。每列代表一个特定的分类算法,每行代表一个特定的样本,形式上的定量度量分析``俄罗斯方块'',并根据它们的最佳结构(即最佳分类)进行排列。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号