首页> 外文期刊>Data & Knowledge Engineering >An approach for temporal analysis of email data based on segmentation
【24h】

An approach for temporal analysis of email data based on segmentation

机译:基于分割的电子邮件数据时态分析方法

获取原文
获取原文并翻译 | 示例
           

摘要

Many kinds of information are hidden in email data, such as the information being exchanged, the time of exchange, and the user IDs participating in the exchange. Analyzing the email data can reveal valuable information about the social networks of a single user or multiple users, the topics being discussed, and so on. In this paper, we describe a novel approach for temporally analyzing the communication patterns embedded in email data based on time series segmentation. The approach computes egocentric communication patterns of a single user, as well as sociocentric communication patterns involving multiple users. Time series segmentation is used to uncover patterns that may span multiple time points and to study how these patterns change over time. To find egocentric patterns, the email communication of a user is represented as an item-set time series. An optimal segmentation of the item-set time series is constructed, from which patterns are extracted. To find sociocentric patterns, the email data is represented as an item-setgroup time series Patterns involving multiple users are then extracted from an optimal segmentation of the item-setgroup time series. The proposed approach is applied to the Enron email data set, which produced very promising results.
机译:电子邮件数据中隐藏了许多信息,例如正在交换的信息,交换的时间以及参与交换的用户ID。分析电子邮件数据可以揭示有关单个用户或多个用户的社交网络,正在讨论的主题等有价值的信息。在本文中,我们描述了一种基于时间序列分割来临时分析嵌入在电子邮件数据中的通信模式的新颖方法。该方法计算单个用户的以自我为中心的通信模式,以及涉及多个用户的以社会为中心的通信模式。时间序列分段用于发现可能跨越多个时间点的模式,并研究这些模式如何随时间变化。为了找到以自我为中心的模式,将用户的电子邮件通信表示为项目设置的时间序列。构建项目设置时间序列的最佳分割,从中提取模式。为了找到以社会为中心的模式,将电子邮件数据表示为项目组时间序列。然后,从项目组时间序列的最佳分段中提取涉及多个用户的模式。提议的方法应用于Enron电子邮件数据集,该数据集产生了非常可喜的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号