首页> 外文会议>Cybercrime and Trustworthy Computing Workshop >Malicious Spam Emails Developments and Authorship Attribution
【24h】

Malicious Spam Emails Developments and Authorship Attribution

机译:恶意垃圾邮件的发展和作者身份归属

获取原文

摘要

The Internet is a decentralized structure that offers speedy communication, has a global reach and provides anonymity, a characteristic invaluable for committing illegal activities. In parallel with the spread of the Internet, cybercrime has rapidly evolved from a relatively low volume crime to a common high volume crime. A typical example of such a crime is the spreading of spam emails, where the content of the email tries to entice the recipient to click a URL linking to a malicious Web site or downloading a malicious attachment. Analysts attempting to provide intelligence on spam activities quickly find that the volume of spam circulating daily is overwhelming; therefore, any intelligence gathered is representative of only a small sample, not of the global picture. While past studies have looked at automating some of these analyses using topic-based models, i.e. separating email clusters into groups with similar topics, our preliminary research investigates the usefulness of applying authorship-based models for this purpose. In the first phase, we clustered a set of spam emails using an authorship-based clustering algorithm. In the second phase, we analysed those clusters using a set of linguistic, structural and syntactic features. These analyses reveal that emails within each cluster were likely written by the same author, but that it is unlikely we have managed to group together all spam produced by each group. This problem of high purity with low recall, has been faced in past authorship research. While it is also a limitation of our research, the clusters themselves are still useful for the purposes of automating analysis, because they reduce the work needing to be performed. Our second phase revealed useful information on the group that can be utilized in future research for further analysis of such groups, for example, identifying further linkages behind spam campaigns.
机译:互联网是一种分散的结构,可提供快速的通信,具有全球影响力并提供匿名性,这是进行非法活动不可估量的特征。随着互联网的普及,网络犯罪已从相对较少的犯罪迅速发展为常见的大量犯罪。这种犯罪的典型示例是散布垃圾邮件,其中电子邮件的内容试图诱使收件人单击链接到恶意网站的URL或下载恶意附件。试图提供有关垃圾邮件活动情报的分析人员很快发现,每天传播的垃圾邮件数量非常庞大。因此,收集到的任何情报仅代表一小部分样本,并不代表全球情况。虽然过去的研究着眼于使用基于主题的模型来自动化其中的一些分析,即将电子邮件集群分为具有相似主题的组,但我们的初步研究调查了为此目的应用基于作者身份的模型的有用性。在第一阶段,我们使用基于作者身份的聚类算法对一组垃圾邮件进行聚类。在第二阶段,我们使用一组语言,结构和句法特征来分析这些类。这些分析表明,每个群集中的电子邮件很可能是由同一位作者撰写的,但是我们不太可能将每个组产生的所有垃圾邮件归为一类。在过去的作者研究中已经遇到了这种纯度高,召回率低的问题。尽管这也是我们研究的局限性,但集群本身仍可用于自动化分析,因为它们减少了需要执行的工作。我们的第二阶段揭示了有关该组的有用信息,这些信息可用于将来的研究中,以进一步分析此类组,例如,确定垃圾邮件活动背后的进一步联系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号