首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >On data sparsification and a recursive algorithm for estimating a kernel-based measure of independence
【24h】

On data sparsification and a recursive algorithm for estimating a kernel-based measure of independence

机译:关于数据稀疏化和用于估计基于核的独立性的递归算法

获取原文

摘要

Technological improvements have led to situations where data sets are sufficiently rich that in the interests of processing speed it is desirable to throw away samples that provide little additional information. This is referred to here as data sparsification. The first contribution is a study of a recently proposed data sparsification scheme; ideas from vector quantisation are used to assess its performance. Informed by this study, a modification of the data sparsification algorithm is proposed and applied to the problem of estimating a kernel-based measure of independence of two datasets. (Given i.i.d. observations from two random variables, x and y, the underlying problem is to determine whether or not x and y are independent of each other.) The second contribution of this paper is to make recursive an existing algorithm for measuring independence and able to operate on both raw data and on sparsified data generated by the aforementioned data sparsification algorithm. Compared with the original algorithm, the recursive algorithm is significantly faster due to its lower memory and computational requirements.
机译:技术上的进步导致了这样一种情况,即数据集足够丰富,以至于为了提高处理速度,需要丢弃很少提供额外信息的样本。这在这里称为数据稀疏化。第一项贡献是对最近提出的数据稀疏化方案的研究;矢量量化的思想被用来评估其性能。在这项研究的指导下,提出了对数据稀疏化算法的改进,并将其应用于估计基于核的两个数据集独立性度量的问题。 (鉴于对x和y这两个随机变量的独立观察,根本的问题是确定x和y是否彼此独立。)本文的第二个贡献是使递归成为现有的测量独立性和能力的算法。对由上述数据稀疏化算法生成的原始数据和稀疏化数据进行操作。与原始算法相比,递归算法由于其较低的内存和计算要求而明显更快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号