首页> 外国专利> METHOD FOR AUTOMATIC CLASSIFICATION OF FORMALIZED TEXT DOCUMENTS AND AUTHORIZED USERS OF ELECTRONIC DOCUMENT MANAGEMENT SYSTEM

METHOD FOR AUTOMATIC CLASSIFICATION OF FORMALIZED TEXT DOCUMENTS AND AUTHORIZED USERS OF ELECTRONIC DOCUMENT MANAGEMENT SYSTEM

机译:电子文档管理系统形式化文本文档自动分类和授权用户的方法

摘要

FIELD: computer equipment.;SUBSTANCE: method includes: extraction of metadata and informative part of document, conversion of document from storage format into text, conversion of words into word forms, discarding non-significant words, counting word weights, generating a set of classification features, wherein at the training step, a system of predicates for identifying the confidentiality mark of the document is generated based on the set of classified documents; at the document classification step, based on the characteristics, a decision is made on the relevance of the document of each of the confidentiality marks, at the training stage, based on the set of manually classified authorized users, forming a predicate identification system of their confidentiality mark, wherein on the basis of confidentiality marks of incoming documents and access rights of authorized users of system to these documents form a set of classification features.;EFFECT: automatic classification of formalized text documents and authorized users of electronic document management system according to confidentiality marks.;1 cl, 1 dwg, 1 tbl
机译:领域:计算机设备;实体:方法包括:提取元数据和文档的信息部分,将文档从存储格式转换为文本,将单词转换为单词形式,丢弃不重要的单词,计算单词权重,生成一组分类特征,其中在训练步骤中,基于分类文件的集合生成用于识别文件的机密标记的谓词系统;在文档分类步骤中,基于特征,在训练阶段,基于一组手动分类的授权用户,来确定每个机密标记的文档的相关性,从而形成他们的谓词识别系统机密性标记,其中基于传入文档的机密性标记和系统的授权用户对这些文档的访问权,形成一组分类功能。效果:根据机密标记。; 1 cl,1 dwg,1 tbl

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号