首页> 外国专利> DOCUMENT CLASSIFICATION PROGRAM, SERVER, AND METHOD BASED ON OUTLINE FEATURE OF DOCUMENT INFORMATION

DOCUMENT CLASSIFICATION PROGRAM, SERVER, AND METHOD BASED ON OUTLINE FEATURE OF DOCUMENT INFORMATION

机译:基于文档信息大纲特征的文档分类程序,服务器和方法

摘要

PROBLEM TO BE SOLVED: To provide a document classification program for classifying Web document information such as illegal and harmful sites at high speed without analyzing text content or image content in the document information.;SOLUTION: The document classification program makes a computer function as: an object featured value extraction means for extracting the object featured values of k dimensional vectors based on tag elements (m pieces of types) from object document information to be analyzed; a featured value determination means for determining whether the object featured values corresponding to the object document information belong to the prescribed range of learning featured values obtained from a plurality of document information included in a specific category; and a category classification means for classifying the object document information determined to be true by the featured value determination means as that included in the specific category.;COPYRIGHT: (C)2011,JPO&INPIT
机译:解决的问题:提供一种文档分类程序,用于在不分析文档信息中的文本内容或图像内容的情况下,对Web文档信息(例如非法和有害站点)进行高速分类;解决方案:文档分类程序使计算机具有以下功能:目标特征值提取装置,用于从要分析的目标文件信息中基于标签元素(m个类型)提取k维向量的目标特征值;特征值确定装置,用于确定与对象文档信息相对应的对象特征值是否属于从包括在特定类别中的多个文档信息获得的学习特征值的规定范围内; COPYRIGHT:(C)2011,JPO&INPIT;和类别分类装置,用于将由特征值确定装置确定为真实的对象文档信息分类为特定类别中包括的类别。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号