首页> 外文会议>ACM SIGKDD international conference on Knowledge discovery in data mining >Modeling and predicting personal information dissemination behavior
【24h】

Modeling and predicting personal information dissemination behavior

机译:建模和预测个人信息传播行为

获取原文

摘要

In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal communications. A personal profile, called CommunityNet, is established for each individual based on a novel algorithm incorporating contact, content, and time information simultaneously. It can be used for personal social capital management. Clusters of CommunityNets provide a view of informal networks for organization management. Our new algorithm is developed based on the combination of dynamic algorithms in the social network field and the semantic content classification methods in the natural language processing and machine learning literatures. We tested CommunityNets on the Enron Email corpus and report experimental results including filtering, prediction, and recommendation capabilities. We show that the personal behavior and intention are somewhat predictable based on these models. For instance, "to whom a personis going to send a specific email" can be predicted by one's personal social network and content analysis. Experimental results show the prediction accuracy of the proposed adaptive algorithm is 58% better than the social network-based predictions, and is 75% better than an aggregated model based on Latent Dirichlet Allocation with social network enhancement. Two online demo systems we developed that allow interactive exploration of CommunityNet are also discussed.
机译:在本文中,我们提出了一种通过分析个人通信的联系和内容来自动建模和预测人类接收和传播信息的行为的新方法。基于同时包含联系方式,内容和时间信息的新颖算法,为每个人建立了一个称为CommunityNet的个人资料。它可以用于个人社会资本管理。 CommunityNet集群提供了用于组织管理的非正式网络的视图。我们的新算法是基于社交网络领域中的动态算法与自然语言处理和机器学习文献中的语义内容分类方法相结合而开发的。我们在Enron电子邮件语料库上测试了CommunityNets,并报告了包括过滤,预测和推荐功能在内的实验结果。我们显示,基于这些模型,个人行为和意图在某种程度上是可以预测的。例如,可以通过一个人的个人社交网络和内容分析来预测“某人将向其发送特定电子邮件”。实验结果表明,所提出的自适应算法的预测精度比基于社交网络的预测高58%,比基于潜在Dirichlet分配和社交网络增强的聚集模型的预测精度高75%。还讨论了我们开发的两个在线演示系统,这些系统允许交互式浏览CommunityNet。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号