首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >A New Approach for Cluster Detection for Large Datasets with High Dimensionality
【24h】

A New Approach for Cluster Detection for Large Datasets with High Dimensionality

机译:具有高维数的大型数据集的集群检测方法

获取原文

摘要

The study of the use of computers through human computer interfaces (HCI) is essential to improve the productivity in any computer application environment. HCI analysts use a number of techniques to build models that are faithful to actual computer use. A key technique is through eye tracking, in which the region of the screen being examined is recorded in order to determine key areas of use. Clustering techniques allow these regions to be grouped to help facilitate usability analysis. Historically, approaches such as the Expectation Maximization (EM) and K-Means algorithm have performed well. Unfortunately, these approaches require the number of clusters k to be known beforehand -in many real world situations, this hampers the effectiveness of the analysis of the data. We propose a novel algorithm that is well suited for cluster discovery for HCI data; we do not require the number of clusters to be specified a priori and our approach scales very well for both large datasets and high dimensionality. Experiments have demonstrated that our approach works well for real data from HCI applications.
机译:通过人机接口使用计算机(HCI)的研究对于提高任何计算机应用环境中的生产率至关重要。 HCI分析师使用许多技术来构建忠实于实际计算机使用的模型。键技术通过眼睛跟踪,记录被检查的屏幕区域以确定使用关键的使用区域。聚类技术允许分组这些区域以帮助促进可用性分析。从历史上看,诸如期望最大化(EM)和K均值算法的方法表现良好。遗憾的是,这些方法需要预先知道的群集K的数量 - 在许多真实世界情况中,这妨碍了数据分析的有效性。我们提出了一种新颖的算法,非常适合为HCI数据进行聚类发现;我们不需要指定群集数量先验,并且我们的方法对于大型数据集和高维度来说非常好。实验表明,我们的方法适用于来自HCI应用的实际数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号