首页> 外国专利> Training procedure for N-gram-based statistical content classification

Training procedure for N-gram-based statistical content classification

机译:基于N元语法的统计内容分类的培训过程

摘要

Training step is used to be disclosed based on the classification of n-gram statistical doucument, export. In one embodiment, second of n-gram is selected in one group of n-gram, each n-gram has N number of byte of a sequence, and wherein N is integer. Then statistical content is generated disaggregated model and is talked about based on the n-gram of appearance, one group of Training document and one group of verifying document. The statistical content disaggregated model is provided to content filter and carrys out categorised content.
机译:训练步骤用于根据n-gram统计数据的分类公开输出。在一个实施例中,在一组n-gram中选择n-gram的第二个,每个n-gram具有N个序列的字节,并且其中N是整数。然后根据n-gram外观,一组训练文档和一组验证文档,生成统计内容分解模型并进行讨论。统计内容分解模型被提供给内容过滤器并执行分类的内容。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号