首页> 外国专利> Bayesian surety check to reduce false positives in filtering of content in non-trained languages

Bayesian surety check to reduce false positives in filtering of content in non-trained languages

机译：贝叶斯保证检查可减少在未经训练的语言中过滤内容时出现误报的情况

页面导航

摘要
著录项
相似文献

摘要

A Bayesian spam filter (101) determines an amount of content in incoming email messages (103) that it knows from training. If the filter is familiar with a threshold (107) amount of the content, then the filter (101) proceeds to classify the email message as being spam (102) or legitimate (104). On the other hand, if not enough of the words in the email are known to the filter (101) from training, then the filter (101) cannot accurately determine whether or not the message is spam. In this case, the filter classifies the message as being of type unknown (106). Different threshold (107) metrics can be used, such as the percentage of known words, and the percentage of maximum correction value used during processing. This greatly improves the processing of emails in languages on which the filter was not trained.

机译：贝叶斯垃圾邮件过滤器（101）确定其从培训中知道的传入电子邮件消息（103）中的内容量。如果过滤器熟悉内容的阈值（107）量，则过滤器（101）继续将电子邮件消息分类为垃圾邮件（102）或合法邮件（104）。另一方面，如果过滤器（101）从训练中不知道电子邮件中足够多的单词，则过滤器（101）不能准确地确定消息是否为垃圾邮件。在这种情况下，过滤器将消息分类为未知类型（106）。可以使用不同的阈值（107）度量，例如已知单词的百分比以及在处理期间使用的最大校正值的百分比。这极大地改进了未训练过滤器的语言的电子邮件的处理。

著录项

公开/公告号EP2028806A1

专利类型
公开/公告日2009-02-25

原文格式PDF
申请/专利权人 SYMANTEC CORPORATION;
展开▼

申请/专利号EP20080014069
发明设计人 COOLEY SHAUN;
展开▼

申请日2008-08-06
分类号H04L12/58;
国家 EP
入库时间 2022-08-21 19:16:10

相似文献

专利
外文文献
中文文献