首页> 外文会议>European Conference on Artificial Intelligence >GNUsmail: Open Framework for On-line Email Classification
【24h】

GNUsmail: Open Framework for On-line Email Classification

机译:Gnusmail:开放式电子邮件分类框架

获取原文

摘要

Real-time classification of massive email data is a challenging task that presents its own particular difficulties. Since email data presents an important temporal component, several problems arise: emails arrive continuously, and the criteria used to classify those emails can change, so the learning algorithms have to be able to deal with concept drift. Our problem is more general than spam detection, which has received much more attention in the literature. In this paper we present GNUsmail, an open-source extensible framework for email classification, which structure supports incremental and on-line learning. This framework enables the incorporation of algorithms developed by other researchers, such as those included in WEKA and MOA. We evaluate this framework, characterized by two overlapping phases (pre-processing and learning), using the ENRON dataset, and we compare the results achieved by WEKA and MOA algorithms.
机译:大规模电子邮件数据的实时分类是一个具有挑战性的任务,呈现了自己特殊的困难。由于电子邮件数据具有重要的时间组件,因此出现了几个问题:电子邮件持续到达,并且用于对这些电子邮件进行分类的标准可以改变,因此学习算法必须能够处理概念漂移。我们的问题比垃圾邮件检测更为一般,这在文献中得到了更多的关注。在本文中,我们呈现Gnusmail,一个用于电子邮件分类的开源可扩展框架,该结构支持增量和在线学习。该框架使得能够加入由其他研究人员开发的算法,例如Weka和MoA中包含的算法。我们评估此框架,其特征在于使用enron数据集的两个重叠阶段(预处理和学习),并比较Weka和MoA算法所实现的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号