首页> 外国专利> DOCUMENT INDEXING DEVICE, DOCUMENT INDEXING METHOD AND DOCUMENT INDEXING PROGRAM

DOCUMENT INDEXING DEVICE, DOCUMENT INDEXING METHOD AND DOCUMENT INDEXING PROGRAM

机译:文件索引装置,文件索引方法和文件索引程序

摘要

PROBLEM TO BE SOLVED: To facilitate document text retrieval by a user by easily and automatically extracting keywords for large amounts of document texts, especially, existing Japanese document texts, and applying those keywords to the document texts.;SOLUTION: The document indexing device is provided with: a character code identifying part (131) for identifying the character type of characters configuring a Japanese document text based on a character code from the text, and for respectively extracting a Kanji character string and a Katakana character string; character string appearance frequency counting parts (132, 134) for counting the appearance frequency of the extracted character string; and keyword generating parts (133, 135) for acquiring the character string whose appearance frequency is a predetermined rate or more to the total number of respective character strings in the Japanese document text as keywords.;COPYRIGHT: (C)2007,JPO&INPIT
机译:解决的问题:为了方便用户检索文档文本,方法是轻松,自动地提取大量文档文本(尤其是现有的日语文档文本)的关键字,并将这些关键字应用于文档文本。具备:字符代码识别部(131),用于根据文字中的字符代码来识别构成日语文档文本的字符的字符类型,并分别提取汉字字符串和片假名字符串;字符串出现频率计数部分(132、134),用于对所提取的字符串的出现频率进行计数;关键字产生部分(133、135),用于获取出现频率为日语文档文本中各个字符串总数的预定比率以上的字符串作为关键字。COPYRIGHT:(C)2007,JPO&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号