首页> 外国专利> Document retrieval apparatus retrieving document data using calculated record identifier

Document retrieval apparatus retrieving document data using calculated record identifier

机译:文档检索设备使用计算出的记录标识符来检索文档数据

摘要

There is provided a document retrieval apparatus in which signatures can be easily extracted from document data, and false drop probability is reduced even for a long document so as to reduce a burden of eliminating the false drop. A processing unit converts the document data and the character string into character codes, respectively. The processing unit extracts signatures from each of the character codes, and calculates a record identifier of the document data to be stored based on a storing position of the document data in a record file. A data storing unit stores the document data to be registered in the record file, and stores the signature corresponding to the document data to be registered in a signature file. The signature is stored in a storing position in the signature file, the storing position being designated by the record identifier of corresponding document data stored in the record file. The processing unit retrieves the document data containing a character string identical to the character string to be searched for by referring to a record identifier calculated based on a storing position of the signature in a signature file.
机译:提供了一种文档检索设备,其中可以容易地从文档数据中提取签名,并且即使对于长文档,也减少了误丢的可能性,从而减轻了消除误丢的负担。处理单元将文档数据和字符串分别转换为字符代码。处理单元从每个字符代码中提取签名,并且基于文档数据在记录文件中的存储位置来计算要存储的文档数据的记录标识符。数据存储单元将要注册的文档数据存储在记录文件中,并且将与要注册的文档数据相对应的签名存储在签名文件中。签名被存储在签名文件中的存储位置中,该存储位置由存储在记录文件中的相应文档数据的记录标识符来指定。处理单元通过参考基于签名在签名文件中的存储位置而计算出的记录标识符来检索包含与要搜索的字符串相同的字符串的文档数据。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号