首页> 外文会议>International Conference on Data Analytics >Automating Clustering Analysis of Ivory Coast Mobile Phone Data: Deriving Decision Support Models for Community Detection and Sensemaking
【24h】

Automating Clustering Analysis of Ivory Coast Mobile Phone Data: Deriving Decision Support Models for Community Detection and Sensemaking

机译:自动化象牙海岸移动电话数据的聚类分析:派生决策支持模型进行社区检测和传感

获取原文

摘要

Sensemaking involves numerous levels of processing and logic in order to achieve automated decision support. Many of these concepts derive from the realm of pattern recognition. The data under consideration frequently is observed in a noisy environment and so one of the first steps involves preprocessing the data to suppress noise and isolate the data signal. Patterns within the data are often used to improve signal detection and aid identification of the data in the quest to produce actionable information. A critical step of making sense from raw or partially processed data and other aspects of decision support is to organize information, which frequently involves grouping, partitioning, or clustering objects. However, there is typically an assumption that structure exists within the data, and the number of clusters is a required parameter for many of the clustering algorithms. A common approach to determine the best number of clusters is to iterate across a set of potential values the for number of clusters and evaluate the quality of the resulting clusters using some metric. In this paper, we present an automated approach to detect structure and improve automation of clustering algorithm parameters. We apply our approach to analyze a complex, dynamic multiple edge set network that was used to model call data from the Ivory Coast compiled from France Telecom/Orange anonymized call records over a 5 month period.
机译:传感涉及许多水平的处理和逻辑,以实现自动决策支持。许多这些概念来自模式识别的领域。在嘈杂的环境中经常观察所考虑的数据,因此第一步之一包括预处理数据以抑制噪声并隔离数据信号。数据内的模式通常用于改善寻址中的信号检测和辅助识别,以产生可操作的信息。从原始或部分处理的数据和决策支持的其他方面进行意义的关键步骤是组织频繁涉及分组,分区或群集对象的信息。然而,通常存在假设数据在数据内存存在,并且群集的数量是许多聚类算法的必需参数。确定最佳数量的群集的常见方法是跨越一组潜在值,用于使用一些度量来评估所得集群的质量。在本文中,我们提出了一种自动化方法来检测结构和改进聚类算法参数的自动化。我们应用我们的方法来分析一个复杂的动态多边集网络,用于在5个月内从法国电信/橙色匿名呼叫记录中编写的象牙海岸模拟呼叫数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号