首页> 外国专利> SEQUENTIAL IMPORTANT WORD EXTRACTION DEVICE, SEQUENTIAL IMPORTANT WORD EXTRACTION METHOD AND PROGRAM

SEQUENTIAL IMPORTANT WORD EXTRACTION DEVICE, SEQUENTIAL IMPORTANT WORD EXTRACTION METHOD AND PROGRAM

机译:序贯重要单词提取装置,序贯重要单词提取方法和程序

摘要

PROBLEM TO BE SOLVED: To provide a technology, in discriminating an important word from a word set, capable of preventing an increase in a storage capacity and updating TFIDF upon arrival of a packet.;SOLUTION: A HTTP data assembly section 22 links fragmented HTTP data stored in a packet received from a packet reception section 21 through a terminal 10 and restores the HTTP data to an original state thereof. A keyword extraction section 23 extracts a keyword by applying a morphological analysis to the original HTTP data. A calculation section 24 calculates a level of importance by acquiring a parameter for each received word necessary for a calculation from a keyword parameter DB25. The level of importance is expressed in a form of a recurrence formula which requires only a last value for the calculation and thereby preventing an increase in a storage capacity and making a real time calculation possible. An important word transmission section 26 packetizes the calculated levels of importance or words having high levels of importance and transmits the packetized data to a service device 40.;COPYRIGHT: (C)2012,JPO&INPIT
机译:解决的问题:提供一种从单词集中区分重要单词的技术,该技术能够防止存储容量的增加并在数据包到达时更新TFIDF。;解决方案:HTTP数据组装部分22链接分段的HTTP存储在通过终端10从分组接收部分21接收的分组中的数据,并将HTTP数据恢复到其原始状态。关键字提取部分23通过对原始HTTP数据进行形态分析来提取关键字。计算部分24通过从关键字参数DB25获取用于计算所需的每个接收到的单词的参数来计算重要度。重要程度以递归公式的形式表示,该递归公式只需要最后一个值即可进行计算,从而可以防止存储容量的增加并使实时计算成为可能。重要单词发送部分26将计算出的重要性级别或具有高重要性级别的单词打包,并将打包的数据发送到服务设备40。版权所有:(C)2012,JPO&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号