首页> 外国专利> SYSTEM AND METHOD FOR PICK-AND-DROP SAMPLING

SYSTEM AND METHOD FOR PICK-AND-DROP SAMPLING

机译:拾取和抽取采样的系统和方法

摘要

A database system includes an input to a database server configured to deliver a data stream formed of a sequence of elements, D={p1, p2, . . . , pm} of size m of numbers from {1, . . . , n} to the database server. The system further includes a computer program that causes a processor to approximate frequency moments (Fk) in the data stream, such that a frequency of an element (i) is defined as fi=|{j:pj=i}| and a k-th frequency moment of D is defined as; <math overflow="scroll"><mrow><msub><mi>F</mi><mi>k</mi></msub><mo>=</mo><mrow><munderover><mo>∑</mo><mrow><mi>i</mi><mo>=</mo><mn>1</mn></mrow><mi>n</mi></munderover><mo></mo><msubsup><mi>m</mi><mi>i</mi><mi>k</mi></msubsup></mrow></mrow></math> ;in a single pass through the data stream. The processor is caused to carry out the steps of locating elements (i) with a frequency ΩFk in the data stream as heavy elements and approximating fi as ≧ a fraction of fi to limit memory resources used by the processor to estimate Fk to O(n1−2/k log(n)) bits.
机译:数据库系统包括到数据库服务器的输入,该数据库服务器被配置为传递由一系列元素D = {p 1 ,p 2 ,...组成的数据流。 。 。 ,大小为m的{<,> p },{1,。 。 。 ,n}到数据库服务器。该系统还包括计算机程序,该计算机程序使处理器近似数据流中的频率矩(F k ),从而将元素(i)的频率定义为f i < / Sub> = | {j:p j = i} | D的第k个频率矩定义为: <![CDATA [<数学溢出=“ scroll”> F k = i = 1 n < / mi> m i k ]]> ;一次通过数据流。使处理器执行以下步骤:定位数据流中频率为ΩF k 的元素(i)作为重元素,并将f i 近似为f i 将处理器用来将F k 估计为O(n 1-2 / k log(n))位的内存资源进行限制。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号