首页> 外文会议>Document Recognition III >Document image decoding in the UC Berkeley Digital Library
【24h】

Document image decoding in the UC Berkeley Digital Library

机译:加州大学伯克利分校数字图书馆中的文档图像解码

获取原文

摘要

Abstract: The UC Berkeley Environmental Digital Library Project is one of six university-led projects that were initiated in the fall of 1994 as part of a four-year digital library initiative sponsored by the NSF, NASA, and ARPA. The Berkeley project is particularly interesting from a document image analysis perspective because its testbed collection consists almost entirely of scanned materials. As a result, the Berkeley project is making extensive use of document recognition and other image analysis technology to provide content-based access to the collection. The Document Image Decoding (DID) group at Xerox PARC is a member of the Berkeley team and is investigating the application of DID techniques to providing high-quality (accurate and properly structured) transcriptions of scanned documents in the collection. This paper briefly describes the Berkeley project, discusses some of its recognition requirements and presents examples of online structured documents created using DID technology. !10
机译:摘要:加州大学伯克利分校的环境数字图书馆项目是由大学领导的六个项目之一,该项目是由NSF,NASA和ARPA赞助的四年数字图书馆计划的一部分,于1994年秋季启动。从文档图像分析的角度来看,伯克利项目特别有趣,因为它的测试台集合几乎完全由扫描的材料组成。因此,伯克利项目正在广泛使用文档识别和其他图像分析技术,以提供基于内容的馆藏访问。 Xerox PARC的文档图像解码(DID)小组是伯克利团队的成员,并且正在研究DID技术的应用,以提供馆藏中已扫描文档的高质量(准确且结构正确的)转录。本文简要介绍了伯克利项目,讨论了其识别要求,并提供了使用DID技术创建的在线结构化文档的示例。 !10

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号