首页> 外国专利> KEYWORD EXTRACTION DEVICE, SIMILAR DOCUMENT RETRIEVAL DEVICE USING THE SAME, KEYWORD EXTRACTION METHOD AND RECORD MEDIUM

KEYWORD EXTRACTION DEVICE, SIMILAR DOCUMENT RETRIEVAL DEVICE USING THE SAME, KEYWORD EXTRACTION METHOD AND RECORD MEDIUM

机译:关键字提取设备,使用该关键字提取类似文档的设备,关键字提取方法和记录介质

摘要

PROBLEM TO BE SOLVED: To highly precisely extract a keyword considering respective documents in a data base from a text given as a keyword extraction object without executing a troublesome processing such as morpheme analysis on the respective documents in the data base. ;SOLUTION: A word extraction part 12b extracts a word from a keyword extraction object text. Intra-text appearing frequency is obtained at every extracted word and is stored in a word management table 13b. A word retrieval execution part 12c searches the full texts of the respective documents in a document data base storage part 11b at every extracted word. Intra-data base appearing frequency is obtained and is stored in the word management table 13b. A significance calculation part 12d calculates the significance of the respective words based on intra-text appearing frequency and intra-data base appearing frequency, which are stored in the word management table 13b. A keyword deciding part 12e decides a keyword based on the significance of the respective words.;COPYRIGHT: (C)2000,JPO
机译:解决的问题:在不对数据库中的各个文档执行诸如词素分析之类的麻烦处理的情况下,从作为关键字提取对象的文本中高精度地考虑数据库中的各个文档来提取关键字。 ;解决方案:单词提取部分12b从关键字提取对象文本中提取单词。在每个提取的单词处获得文本内出现频率,并将其存储在单词管理表13b中。单词检索执行部分12c在每个提取的单词处在文档数据库存储部分11b中搜索各个文档的全文。获得数据库内出现频率并将其存储在单词管理表13b中。重要性计算部分12d基于存储在单词管理表13b中的文本内出现频率和数据库内出现频率来计算各个单词的重要性。关键字确定部分12e基于各个单词的含义来确定关键字。版权所有:(C)2000,JPO

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号