首页> 外文OA文献 >A comparison of several cluster algorithms on artificial binary data Part 1. Scenarios from travel market segmentation Part 2: Working Paper 19.

【2h】

A comparison of several cluster algorithms on artificial binary data Part 1. Scenarios from travel market segmentation Part 2: Working Paper 19.

机译：几种基于人工二进制数据的聚类算法的比较第1部分。来自旅游市场细分的场景第2部分：工作文件19。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Social scientists confronted with the problem of segmenting individuals into plausible subgroups usually encounter two main problems: First: there is very little indication about the correct choice of the number of clusters to search for. Second: different cluster algorithms and even multiple replications of the same algorithm result in different solutions due to random initializations and stochastic learning methods. In the worst case numerous solutions are found which all seem plausible as far as interpretation is concerned. The consequence is, that in the end clusters are postulated that are in fact "chosen" by the researcher, as he or she makes decisions on the number of clusters and the solution chosen as the "final" one. In this paper we concentrate on the power and stability of several popular clustering algorithms under the condition that the correct number of clusters is known. Artificial data sets modeled to mimic typical situations from tourism marketing are constructed. The structure of these data sets is described in several scenarios, and artificial binary data are generated accordingly. These data, ranging from very simple to more complex, real-data-like structures, enable us to systematically analyze the "behavior" of the cluster methods. Section 3 gives an overview of all cluster methods under investigation. Section 4 describes our experimental results, comparing first all scenarios and then all cluster methods. To accomplish this task, several evaluation criteria for cluster methods are proposed. Finally: Sections 5 and 6 draw some conclusions and give an outlook on future research. (author's abstract)

机译：社会科学家面临着将个人划分为合理的亚组的问题，通常会遇到两个主要问题：第一：很少有关于正确选择要搜索的集群数的迹象。第二：由于随机初始化和随机学习方法，不同的群集算法，甚至同一算法的多次复制都将产生不同的解决方案。在最坏的情况下，发现了许多解决方案，就解释而言，所有解决方案似乎都是合理的。结果是，最终，假定研究人员在决定簇的数量和选择为“最终”簇的解决方案时，实际上是由研究者“选择”的。在本文中，我们集中在已知正确簇数的情况下，几种流行的聚类算法的功能和稳定性。人工建模的数据集可以模仿旅游营销中的典型情况。这些数据集的结构在几种情况下进行了描述，并相应地生成了人工二进制数据。这些数据的范围从非常简单到更加复杂，类似于真实数据的结构，使我们能够系统地分析聚类方法的“行为”。第三部分概述了所有正在研究的聚类方法。第4节介绍了我们的实验结果，首先比较了所有方案，然后比较了所有聚类方法。为了完成这一任务，提出了几种聚类方法的评价标准。最后：第5和第6节得出一些结论，并对未来的研究进行展望。（作者的摘要）

著录项

作者
Dolnicar Sara; Leisch Friedrich; Weingessel Andreas; Buchta Christian; Dimitriadou Evgenia;
展开▼
作者单位

展开▼
年度 1998
总页数
原文格式 PDF
正文语种 {"code":"it","name":"Italian","id":21}
中图分类

相似文献

外文文献
中文文献
专利

1. Algorithms for finding attribute value group for binary segmentation of categorical databases [J] . Morimoto Y., Fukuda T., Tokuyama T. IEEE Transactions on Knowledge and Data Engineering . 2002,第6期

机译：寻找用于分类数据库二进制分割的属性值组的算法
2. A Research paper: An ASCII value based data encryption algorithm and its comparison with other symmetric data encryption algorithms [J] . Akanksha Mathur International Journal on Computer Science and Engineering . 2012,第9期

机译：研究论文：基于ASCII值的数据加密算法及其与其他对称数据加密算法的比较
3. Customized M-clustering Algorithm Comparison with Clustering Algorithms in Data Mining with the Case Study of Lead Generation Techniques [J] . E. Manigandan, V. Shanthi, Magesh Kasthuri Indian Journal of Science and Technology . 2016,第38期

机译：数据挖掘中定制M聚类算法与聚类算法的比较-以潜在客户生成技术为例
4. Behavioral Market Segmentation of Binary Guest Survey Data with Bagged Clustering [C] . Sara Dolnicar, Friedrich Leisch International Conference on Artificial Neural Networks . 2001

机译：BAGGET聚类二进制访客调查数据的行为市场分割
5. A Data Clustering Algorithm for Stratified Data Partitioning in Artificial Neural Network. [D] . Sahoo, Ajit Kumar. 2011

机译：人工神经网络中数据分层的数据聚类算法。
6. Data from multimodal functions based on an array of photovoltaic modules and an approximation with artificial neural networks as a scenario for testing optimization algorithms [O] . Carlos Robles-Algarín, Diego Restrepo-Leal, Adalberto Ospino Castro 2019

机译：来自基于光伏模块阵列的多峰函数的数据以及人工神经网络的近似值作为测试优化算法的场景
7. A comparison of several cluster algorithms on artificial binary data Part 2. Scenarios from travel market segmentation. Part 2 (Addition to Working Paper No. 7). [O] . Dolnicar Sara, Leisch Friedrich, Steiner Gottfried, 1998

机译：几种基于人工二进制数据的聚类算法的比较第2部分。来自旅游市场细分的场景。第2部分（第7号工作文件的补充）。
8. Comparison of Income, Expenditures, and Home Market Value Distributions Using Luxembourg Income Study Data from the 1990's Working paper [R] . Sierminska, E., Garner, T. I. 2005

机译：使用1990年工作文件中的卢森堡收入研究数据比较收入，支出和本地市场价值分布

A comparison of several cluster algorithms on artificial binary data Part 1. Scenarios from travel market segmentation Part 2: Working Paper 19.

摘要

著录项

相似文献

相关主题

期刊订阅