首页> 外文会议>IEEE International Conference on Collaboration and Internet Computing >Mining the Best Observational Window to Model Social Phenomena
【24h】

Mining the Best Observational Window to Model Social Phenomena

机译:挖掘建立社会现象模型的最佳观察窗口

获取原文

摘要

The structure and behavior of organizations can be learned by mining the event logs of the information systems they manage. This supports numerous applications, such as inferring the structure of social relations, uncovering implicit workflows, and detecting illicit behavior. However, to date, no clear guidelines regarding how to select an appropriate time period to perform organizational modeling have been articulated. This is a significant concern because an inaccurately defined period can lead to incorrect models and poor performance in data-driven applications. In this paper, we introduce a data-driven approach to infer the optimal time period for organizational modeling. Our approach 1) represents the system as a social network, 2) decomposes it into its respective principal components, and 3) optimizes the signal-to-noise ratio over varying temporal observation windows. In doing so, we minimize the variance in the organizational structure while maximizing its patterns. We assess the capability of this approach using an anomaly detection scenario, which is based on the patterns learned from the interactions documented in audit logs. The classification performance of two known algorithms is investigated over a range of time periods in two representative datasets. First, we use the electronic health record access logs from Northwestern Memorial Hospital to demonstrate that our framework detects a period that coincides with the optimal performance of the anomaly detection algorithms. Second, we assess the generalizability of the framework through an analysis with a less clearly defined organization, in the form of the social network inferred from the DBLP co-authorship dataset. The results with this data further illustrate that our framework can discover the optimal time period in the context of a more loosely organized group.
机译:可以通过挖掘其管理的信息系统的事件日志来了解组织的结构和行为。这支持许多应用程序,例如推断社会关系的结构,发现隐式工作流以及检测非法行为。但是,迄今为止,尚未阐明有关如何选择合适的时间段来执行组织建模的明确指南。这是一个非常重要的问题,因为定义不正确的时间段可能导致数据驱动的应用程序中的模型不正确和性能不佳。在本文中,我们介绍了一种数据驱动的方法来推断组织建模的最佳时间段。我们的方法1)将系统表示为社交网络,2)将其分解成其各自的主要组件,并且3)在变化的时间观察窗口上优化信噪比。这样,我们可以最大程度地减少组织结构的差异,同时最大程度地提高组织结构的模式。我们使用异常检测方案评估这种方法的能力,该方案基于从审计日志中记录的交互中获悉的模式。在两个代表性数据集中的一段时间范围内研究了两种已知算法的分类性能。首先,我们使用了西北纪念医院的电子健康记录访问日志来证明我们的框架检测到的时间与异常检测算法的最佳性能相吻合。其次,我们通过对定义不那么明确的组织的分析,通过从DBLP合著者数据集推断出的社交网络的形式,评估了该框架的可推广性。这些数据的结果进一步说明,我们的框架可以在组织更松散的团队的背景下发现最佳时间段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号