首页>
外国专利>
METHOD, DEVICE AND COMPUTER PROGRAM FOR IDENTIFYING ITEMS HAVING HIGH FREQUENCY OF OCCURRENCE AMONG ITEMS INCLUDED IN TEXT DATA STREAM
METHOD, DEVICE AND COMPUTER PROGRAM FOR IDENTIFYING ITEMS HAVING HIGH FREQUENCY OF OCCURRENCE AMONG ITEMS INCLUDED IN TEXT DATA STREAM
展开▼
机译:用于识别文本数据流中包含频繁出现项目的项目的方法,设备和计算机程序
展开▼
页面导航
摘要
著录项
相似文献
摘要
PROBLEM TO BE SOLVED: To provide a method, device and computer program for efficiently identifying items having a high frequency of occurrence among items included in a large-volume text data stream.SOLUTION: Identification information for identifying an item and a count for the item are stored in a memory of a higher level, and only identification information is stored in a memory of a level lower than said higher level. When text data stream input is received: if identification information for an item included in a bucket resulted from division of the received text data stream input is stored in the higher-level memory, the count for the item is incremented; when stored in the lower-level memory, the identification information for the item is transferred with an initial count to the higher-level memory; and, when not stored in any level, the identification information for the item is newly stored with the initial count in the higher-level memory.
展开▼