首页> 外文会议>Second International Conference on Machine Learning and Computing (ICMLC 2010) >A Survey on Text Classification Techniques for E-mail Filtering
【24h】

A Survey on Text Classification Techniques for E-mail Filtering

机译:电子邮件过滤的文本分类技术概述

获取原文
获取原文并翻译 | 示例

摘要

The continuing explosive growth of textual content within the World Wide Web has given rise to the need for sophisticated Text Classification (TC) techniques that combine efficiency with high quality of results. E-mail filtering is one application that has the potential to affect every user of the internet. Even though a large body of research has delved into this problem, there is a paucity of survey that indicates trends and directions. This paper attempts to categorize the prevalent popular techniques for classifying email as spam or legitimate and suggest possible techniques to fill in the lacunae. Our findings suggest that context-based email filtering has the most potential in improving quality by learning various contexts such as n-gram phrases, linguistic constructs or usersȁ9; profile based context to tailor his/her filtering scheme.
机译:万维网内文本内容的持续爆炸性增长,引起了对复杂文本分类(TC)技术的需求,这些技术将效率与高质量结果结合在一起。电子邮件过滤是一种有可能影响互联网每个用户的应用程序。即使进行了大量的研究来研究这个问题,也缺乏指示趋势和方向的调查。本文尝试将将电子邮件分类为垃圾邮件或合法邮件的流行技术进行分类,并提出可能的技术来填补空白。我们的发现表明,通过学习各种上下文(例如n-gram短语,语言结构或用户),基于上下文的电子邮件过滤在提高质量方面最有潜力[9]。基于配置文件的上下文来定制他/她的过滤方案。

著录项

  • 来源
  • 会议地点 Bangalore(IN);Bangalore(IN)
  • 作者单位

    Issue Date: 9-11 Feb. 2010rnrntOn page(s): rnt32rnttrn- 36rnrnrnLocation: Bangalore, IndiarnrnPrint ISBN: 978-1-4244-6006-9rnrnrnrnttrnDigital Object Identifier: href='http://dx.doi.org/10.1109/ICMLC.2010.61' target='_blank'>10.1109/ICMLC.2010.61 rnrnDate of Current Version: trnrnt2010-05-06 14:33:58.0rnrnt rntt class="body-text">rntname="Abstract">>Abstractrn>The continuing explosive growth of textual content within the World Wide Web has given rise to the need for sophisticated Text Classification (TC) techniques that combine efficiency with high quality of results. E-mail filtering is one application that has the potential to affect every user of the internet. Even though a large body of research has delved into this problem;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动推理、机器学习;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号