首页> 外国专利> (More) advanced spam detection features

(More) advanced spam detection features

机译：（更多）高级垃圾邮件检测功能

页面导航

摘要
著录项
相似文献

摘要

The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with the subject line, timestamps, and the message body can be extracted and employed to generate one or more features. In particular, subject lines and message bodies can be examined for consecutive, repeating characters, blobs, the association or distance between such characters, blobs and non-blob portions of the message. The values or counts obtained can be broken down into one or more ranges corresponding to a degree of spaminess. Presence and type of attachments to messages, percentage of non-white-space and non-numeric characters of a message, and determining message delivery times can be used to identify spam. A time-based delta can be computed to facilitate determining the delivery time.

机译：本发明涉及有助于从消息中提取数据以进行垃圾邮件过滤的系统和方法。提取的数据可以采用特征的形式，可以与机器学习系统结合使用以构建改进的过滤器。可以提取与主题行，时间戳和消息正文关联的数据，并将其用于生成一个或多个功能。特别是，可以检查主题行和消息正文，以查找连续的重复字符，斑点，消息的此类字符，斑点和非斑点部分之间的关联或距离。可以将获得的值或计数分解为一个或多个对应于spaminess程度的范围。邮件附件的存在和类型，邮件的非空白字符和非数字字符的百分比以及确定邮件的传递时间可用于识别垃圾邮件。可以计算基于时间的增量，以便于确定交货时间。

著录项

公开/公告号US8214438B2

专利类型
公开/公告日2012-07-03

原文格式PDF
申请/专利权人 JOHN D. MEHR;NATHAN D. HOWELL;MICAH C. RUPERSBURG;
展开▼

申请/专利号US20040790574
发明设计人 JOHN D. MEHR;NATHAN D. HOWELL;MICAH C. RUPERSBURG;
展开▼

申请日2004-03-01
分类号G06F15/16;
国家 US
入库时间 2022-08-21 17:27:26

相似文献

专利
外文文献
中文文献