首页> 外国专利> METHOD, DEVICE AND COMPUTER PROGRAM FOR IDENTIFYING ITEMS HAVING HIGH FREQUENCY OF OCCURRENCE AMONG ITEMS INCLUDED IN TEXT DATA STREAM

METHOD, DEVICE AND COMPUTER PROGRAM FOR IDENTIFYING ITEMS HAVING HIGH FREQUENCY OF OCCURRENCE AMONG ITEMS INCLUDED IN TEXT DATA STREAM

机译:用于识别文本数据流中包含频繁出现项目的项目的方法,设备和计算机程序

摘要

PROBLEM TO BE SOLVED: To provide a method, device and computer program for efficiently identifying items having a high frequency of occurrence among items included in a large-volume text data stream.SOLUTION: Identification information for identifying an item and a count for the item are stored in a memory of a higher level, and only identification information is stored in a memory of a level lower than said higher level. When text data stream input is received: if identification information for an item included in a bucket resulted from division of the received text data stream input is stored in the higher-level memory, the count for the item is incremented; when stored in the lower-level memory, the identification information for the item is transferred with an initial count to the higher-level memory; and, when not stored in any level, the identification information for the item is newly stored with the initial count in the higher-level memory.
机译:解决的问题:提供一种方法,设备和计算机程序,用于有效地识别大容量文本数据流中包括的项目中出现频率高的项目。解决方案:用于识别项目的标识信息和项目的计数存储在较高级别的存储器中的存储器,并且仅识别信息存储在低于所述较高级别的存储器中。当接收到文本数据流输入时:如果由于接收到的文本数据流输入的划分而导致的存储桶中所包括的项的标识信息被存储在上级存储器中,则该项的计数增加;否则,计数增加。当存储在较低级存储器中时,该物品的识别信息以初始计数被传送到较高级存储器;并且,当不以任何级别存储时,该项目的标识信息以初始计数新存储在更高级别的存储器中。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号