首页> 外国专利> System and method for separation and classification of unstructured documents

System and method for separation and classification of unstructured documents

机译:用于分离和分类非结构化文件的系统和方法

摘要

A classification system is provided that separates unclassified pages into unclassified, separated documents and classifies the separated documents. The classification system applies a page-level recognition model to the unclassified pages to recognize the logical boundaries between documents and, based on the logical boundaries, separates the pages into unclassified, separated documents. The classification system further applies a document-level recognition model to classify the separated documents.
机译:提供了分类系统,将未分类的页面分离成未分类,分隔的文档并对分离的文件进行分类。分类系统将页面级别识别模型应用于未分类的页面,以识别文档之间的逻辑边界,并且根据逻辑边界,将页面与未分类,分隔的文档分开。分类系统进一步应用文档级识别模型以对分隔的文档进行分类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号