首页> 外国专利> Method and arrangement for prior annotation of documents and for creating a summary based on document image data

Method and arrangement for prior annotation of documents and for creating a summary based on document image data

机译:用于文档的先前注释以及用于基于文档图像数据创建摘要的方法和装置

摘要

A target document in a document processing system is annotated on the basis of annotations made previously to a source document. A source document (either a scanned image of a paper document or an electronic document) is annotated by a user to identify words or phrases of interest. The annotated words are extracted for use as keywords or phrases to search in future document. When a target document is processed, the target document is searched to locate any of the keywords of interest to the user. If any of the keywords are located, electronic annotations are applied to these in the target document for display or printing out and/or registered as keywords to the project. The automatically annotated words or phrases enable the user to locate regions of interest more quickly. A summary of a captured document image is produced on the basis of detected annotations made to a document prior to image capture. The scanned (or otherwise captured) image is processed to detect annotations made to the document prior to scanning. The detected annotations can be used to identify features, or text, for use to summarize that document. Additionally, or alternatively, the detected annotations in one document can be used to identify features, or text, for use to summarize a different document. The summary may be displayed in expandable detail levels. IMAGE
机译:基于先前对源文档进行的注释,对文档处理系统中的目标文档进行注释。用户注释源文档(纸质文档或电子文档的扫描图像)以标识感兴趣的单词或短语。提取带注释的单词,以用作关键字或短语以在将来的文档中搜索。在处理目标文档时,将搜索目标文档以找到用户感兴趣的任何关键字。如果找到任何关键字,则将电子注释应用于目标文档中的这些注释,以显示或打印和/或注册为项目的关键字。自动注释的单词或短语使用户可以更快地找到感兴趣的区域。根据在图像捕获之前检测到的对文档所做的注释,生成捕获的文档图像的摘要。在扫描之前,对扫描的(或以其他方式捕获的)图像进行处理以检测对文档所做的注释。检测到的注释可用于标识要素或文本,以用于汇总该文档。另外地或可替代地,在一个文档中检测到的注释可以用于识别特征或文本,以用于总结不同的文档。该摘要可以以可扩展的详细程度显示。 <图像>

著录项

  • 公开/公告号DE60217450T2

    专利类型

  • 公开/公告日2007-10-11

    原文格式PDF

  • 申请/专利权人 XEROX CORP.;

    申请/专利号DE2002617450T

  • 发明设计人

    申请日2002-10-11

  • 分类号G06F17/24;G06F17/30;

  • 国家 DE

  • 入库时间 2022-08-21 20:27:52

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号