首页> 外国专利> Document title tree construction method, device, electronic equipment, storage medium, and program

Document title tree construction method, device, electronic equipment, storage medium, and program

机译:文档标题树施工方法,设备,电子设备,存储介质和程序

摘要

Problem to be solved: to provide a method for identifying a title of an unstructured document and a method of constructing a document title tree capable of constructing a document title tree, an electronic equipment, a storage medium and a program.How to build a document title treeRule matching the text features of each paragraph in a document to be processed based on predefined rules and paragraph features in a predefined rule, andReturnsDetermining the paragraph level of each paragraph in the document to be processed based on the result of the rule matchingReturnsDetermining the paragraph level of each paragraph in a document to be processed using a machine learning modelBuilding the document title tree for the document to be processed based on the paragraph level of each paragraph.Diagram
机译:要解决的问题:提供一种用于识别非结构化文件的标题的方法和构建能够构建文档标题树,电子设备,存储介质和程序的文档标题树的方法。如何构建符合要根据预定义规则和段落特征在预定义规则中进行处理的文档中的文本标题交易,并在预定义规则中的段落特征,并将每个段落的段落级别基于要根据的结果进行处理该规则将在要使用机器学习模型的文件中进行处理的文档中每个段落的段落级别,该规则使用ModelsBuildBuild为每个段落的段落级别进行处理的文件来处理文档标题树。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号