首页> 外国专利> (More) advanced spam detection features

(More) advanced spam detection features

机译:(更多)高级垃圾邮件检测功能

摘要

The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with the subject line, timestamps, and the message body can be extracted and employed to generate one or more features. In particular, subject lines and message bodies can be examined for consecutive, repeating characters, blobs, the association or distance between such characters, blobs and non-blob portions of the message. The values or counts obtained can be broken down into one or more ranges corresponding to a degree of spaminess. Presence and type of attachments to messages, percentage of non-white-space and non-numeric characters of a message, and determining message delivery times can be used to identify spam. A time-based delta can be computed to facilitate determining the delivery time.
机译:本发明涉及有助于从消息中提取数据以进行垃圾邮件过滤的系统和方法。提取的数据可以采用特征的形式,可以与机器学习系统结合使用以构建改进的过滤器。可以提取与主题行,时间戳和消息正文关联的数据,并将其用于生成一个或多个功能。特别是,可以检查主题行和消息正文,以查找连续的重复字符,斑点,消息的此类字符,斑点和非斑点部分之间的关​​联或距离。可以将获得的值或计数分解为一个或多个对应于spaminess程度的范围。邮件附件的存在和类型,邮件的非空白字符和非数字字符的百分比以及确定邮件的传递时间可用于识别垃圾邮件。可以计算基于时间的增量,以便于确定交货时间。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号