首页> 外文期刊>ACM Transactions on Internet Technology >Behavior-Based Modeling and Its Application to Email Analysis
【24h】

Behavior-Based Modeling and Its Application to Email Analysis

机译:基于行为的建模及其在电子邮件分析中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

The Email Mining Toolkit (EMT) is a data mining system that computes behavior profiles or models of user email accounts. These models may be used for a multitude of tasks including forensic analyses and detection tasks of value to law enforcement and intelligence agencies, as well for as other typical tasks such as virus and spam detection. To demonstrate the power of the methods, we focus on the application of these models to detect the early onset of a viral propagation without "content-based" (or signature-based) analysis in common use in virus scanners. We present several experiments using real email from 15 users with injected simulated viral emails and describe how the combination of different behavior models improves overall detection rates. The performance results vary depending upon parameter settings, approaching 99% true positive (TP) (percentage of viral emails caught) in general cases and with 0.38% false positive (FP) (percentage of emails with attachments that are mislabeled as viral). The models used for this study are based upon volume and velocity statistics of a user's email rate and an analysis of the user's (social) cliques revealed in the person's email behavior. We show by way of simulation that virus propagations are detectable since viruses may emit emails at rates different than human behavior suggests is normal, and email is directed to groups of recipients in ways that violate the users' typical communications with their social groups.
机译:电子邮件挖掘工具包(EMT)是一个数据挖掘系统,用于计算用户电子邮件帐户的行为配置文件或模型。这些模型可用于多种任务,包括法务分析和对执法和情报机构有价值的检测任务,以及其他典型任务,例如病毒和垃圾邮件检测。为了证明这些方法的强大功能,我们将重点放在这些模型的应用上,以检测病毒传播的早期发作,而无需使用病毒扫描程序中常用的“基于内容”(或基于特征)的分析。我们提出了几个实验,这些实验使用了来自15个用户的真实电子邮件以及注入的模拟病毒电子邮件,并描述了不同行为模型的组合如何提高整体检测率。性能结果因参数设置而异,一般情况下接近99%的真阳性(TP)(捕获的病毒电子邮件的百分比),而0.38%的假阳性(FP)(带有错误标签为附件的电子邮件的百分比)。本研究使用的模型基于用户电子邮件速率的数量和速度统计数据以及对用户电子邮件行为中揭示的用户(社会)集团的分析。通过仿真显示,由于病毒可能以与人类行为不同的速率发送电子邮件,因此病毒传播是可以检测到的,并且将电子邮件定向到收件人组的方式违反了用户与社交团体的典型通信方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号