【24h】

Computational Science and Data Mining

机译:计算科学与数据挖掘

获取原文
获取原文并翻译 | 示例

摘要

The last decade has witnessed an impressive growth of Data Mining through algorithms and applications. Despite the advances, a computational theory of Data Mining is still largely outstanding. This paper discusses some aspects relevant to computation in Data Mining from the point of view of the Machine Learning theoretician. Computational techniques used in other fields that deal with learning from data, such as Statistics and Machine Learning, are potentially very relevant. However, the specifics of Data Mining are such that most often those techniques are not directly applicable but require to be re-cast and re-analysed within Data Mining starting from first principles. We illustrate this with a PAC-learnability analysis for a Data Mining-like task. We show that accounting for Data Mining specific requirements, such as inference of weak predictors and agnosticity assumptions, requires the generalisation of the classical PAC framework in novel ways.
机译:在过去的十年中,通过算法和应用程序见证了数据挖掘的惊人增长。尽管取得了进步,但是数据挖掘的计算理论仍然非常出色。本文从机器学习理论家的角度讨论了与数据挖掘中的计算相关的一些方面。在其他领域中用于处理数据学习的计算技术(例如统计和机器学习)可能非常相关。但是,数据挖掘的具体细节使得大多数情况下,这些技术不能直接应用,而是需要从第一原理开始在数据挖掘中进行重新广播和重新分析。我们通过针对数据挖掘类任务的PAC可学习性分析来说明这一点。我们表明,考虑到数据挖掘的特定要求,例如推断弱预测变量和不可知论假设,需要以新颖的方式对经典PAC框架进行概括。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号