首页>
外国专利>
Method and system for configuring presence bitmaps identifying records with unique keys in a large data set
Method and system for configuring presence bitmaps identifying records with unique keys in a large data set
展开▼
机译:用于配置存在位图的方法和系统,该存在位图使用大型数据集中的唯一键来标识记录
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system, method, and apparatus are provided for supporting and/or executing count-distinct queries. A large set of data (e.g., tens or hundreds of millions of event records) is condensed daily to generate presence bitmaps to reflect the distinctiveness of a selected data dimension S (e.g., user ID) for one or more key dimensions g1, g2, . . . (e.g., advertisement ID, campaign ID, advertiser ID). The condensation process eliminates duplication and yields a single value (e.g., 1 or 0) for each tuple [S, g1, . . . ] to represent the distinctiveness of each value in the S dimension to each combination of values in the grouping dimensions. On a monthly basis, the daily values are condensed to yield a single value for the month, and a similar process is applied on any other desired time granularities (e.g., year). The condensed data may be generated for any combination of selected dimension(s) and grouping dimension(s).
展开▼