首页> 美国卫生研究院文献>Journal of the Royal Society Interface >Data smashing: uncovering lurking order in data
【2h】

Data smashing: uncovering lurking order in data

机译:数据粉碎:发现数据中的潜伏次序

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

From automatic speech recognition to discovering unusual stars, underlying almost all automated discovery tasks is the ability to compare and contrast data streams with each other, to identify connections and spot outliers. Despite the prevalence of data, however, automated methods are not keeping pace. A key bottleneck is that most data comparison algorithms today rely on a human expert to specify what ‘features' of the data are relevant for comparison. Here, we propose a new principle for estimating the similarity between the sources of arbitrary data streams, using neither domain knowledge nor learning. We demonstrate the application of this principle to the analysis of data from a number of real-world challenging problems, including the disambiguation of electro-encephalograph patterns pertaining to epileptic seizures, detection of anomalous cardiac activity from heart sound recordings and classification of astronomical objects from raw photometry. In all these cases and without access to any domain knowledge, we demonstrate performance on a par with the accuracy achieved by specialized algorithms and heuristics devised by domain experts. We suggest that data smashing principles may open the door to understanding increasingly complex observations, especially when experts do not know what to look for.
机译:从自动语音识别到发现不寻常的恒星,几乎所有自动发现任务的基础都是能够相互比较和对比数据流,识别连接和发现异常值。尽管数据盛行,但是自动化方法并没有跟上步伐。一个关键瓶颈在于,当今大多数数据比较算法都依赖人类专家来指定数据的哪些“特征”与比较相关。在这里,我们提出了一种新的原理,即不使用领域知识也不使用学习来估计任意数据流的源之间的相似性。我们证明了该原理在许多现实世界中具有挑战性的问题的数据分析中的应用,包括消除与癫痫发作有关的脑电图模式的歧义,从心音记录中检测异常的心脏活动以及从中提取天文物体的分类原始光度法。在所有这些情况下,在没有任何领域知识的情况下,我们展示的性能与领域专家设计的专用算法和启发式方法所达到的准确性相当。我们建议数据粉碎原则可能为理解日益复杂的观察结果打开大门,尤其是当专家不知道要寻找什么时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号