首页> 外文会议>Annual Allerton Conference on Communication, Control, and Computing >Unsupervised nonparametric anomaly detection: A kernel method
【24h】

Unsupervised nonparametric anomaly detection: A kernel method

机译:无监督的非参数异常检测:一种核方法

获取原文

摘要

An anomaly detection problem is investigated, in which s out of n sequences are anomalous and need to be detected. Each sequence consists of m independent and identically distributed (i.i.d.) samples drawn either from a nominal distribution p or from an anomalous distribution q that is distinct from p. Neither p nor q is known a priori. Two scenarios respectively with s known and unknown are studied. Distribution-free tests are constructed based on the metric of the maximum mean discrepancy (MMD). It is shown that if the value of s is known, as n goes to infinity, the number m of samples in each sequence should be of order O(log n) or larger to guarantee that the constructed test is exponentially consistent. On the other hand, if the value of s is unknown, the number m of samples in each sequence should be of the order strictly greater than O(log n) to guarantee the constructed test is consistent. The computational complexity of all tests are shown to be polynomial. Numerical results are provided to confirm the theoretic characterization of the performance. Further numerical results on both synthetic data sets and real data sets demonstrate that the MMD-based tests outperform or perform as well as other approaches.
机译:研究了一个异常检测问题,其中n个序列中的s个是异常的,需要检测。每个序列由m个独立的且分布均匀的(i.i.d.)样本组成,这些样本是从标称分布p或不同于p的异常分布q中提取的。 p和q都不是先验的。研究了分别已知和未知的两种情况。基于最大平均差异(MMD)的指标构建无分布测试。结果表明,如果s的值是已知的,则随着n趋于无穷大,每个序列中的样本数量m应当为O(log n)或更大,以确保所构造的测试是指数一致的。另一方面,如果s的值未知,则每个序列中的样本数m的数量级应严格大于O(log n),以确保构造的测试是一致的。所有测试的计算复杂度均显示为多项式。提供数值结果以确认性能的理论特征。综合数据集和真实数据集上的进一步数值结果表明,基于MMD的测试的性能或性能优于其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号