首页> 外国专利> Training procedure for N-gram-based statistical content classification

Training procedure for N-gram-based statistical content classification

机译：基于N元语法的统计内容分类的培训过程

页面导航

摘要
著录项
相似文献

摘要

Training step is used to be disclosed based on the classification of n-gram statistical doucument, export. In one embodiment, second of n-gram is selected in one group of n-gram, each n-gram has N number of byte of a sequence, and wherein N is integer. Then statistical content is generated disaggregated model and is talked about based on the n-gram of appearance, one group of Training document and one group of verifying document. The statistical content disaggregated model is provided to content filter and carrys out categorised content.

机译：训练步骤用于根据n-gram统计数据的分类公开输出。在一个实施例中，在一组n-gram中选择n-gram的第二个，每个n-gram具有N个序列的字节，并且其中N是整数。然后根据n-gram外观，一组训练文档和一组验证文档，生成统计内容分解模型并进行讨论。统计内容分解模型被提供给内容过滤器并执行分类的内容。

著录项

公开/公告号US7917522B1

专利类型
公开/公告日2011-03-29

原文格式PDF
申请/专利权人 THOMAS E. RAFFILL;SHUNHUI ZHU;ROMAN YANOVSKY;BORIS YANOVSKY;JOHN GMUENDER;
展开▼

申请/专利号US20100822439
发明设计人 BORIS YANOVSKY;SHUNHUI ZHU;ROMAN YANOVSKY;THOMAS E. RAFFILL;JOHN GMUENDER;
展开▼

申请日2010-06-24
分类号G06F7/00;G06F17/30;
国家 US
入库时间 2022-08-21 18:08:01

相似文献

专利
外文文献
中文文献