【24h】

Computational Aspects of Data Mining

机译:数据挖掘的计算方面

获取原文

摘要

The last decade has witnessed an impressive growth of Data Mining through algorithms and applications. Despite the advances, a computational theory of Data Mining is still largely outstanding. This paper discusses some aspects relevant to computation in Data Mining from the point of view of the Machine Learning theoretician. Computational techniques used in other fields that deal with learning from data such as Statistics and Machine Learning, are potentially very relevant. However, the specifics of Data Mining are such that most often those techniques are not directly applicable but require to be re-cast and reanalysed within Data Mining starting from first principles. We illustrate this with a PAC-learnability analysis for a Data Mining-like task. We show that accounting for Data Mining specific requirements, such as inference of weak predictors and agnosticity assumptions, requires the generalisation of the classical PAC framework in novel ways.
机译:过去十年目睹了通过算法和应用程序的数据挖掘令人印象深刻的增长。尽管有所进展,但数据挖掘的计算理论仍然很大程度上。本文讨论了与机器学习理论域的数据挖掘中的计算有关的一些方面。在统计和机器学习等数据中处理学习的其他领域中使用的计算技术可能非常相关。然而,数据挖掘的具体细节是,大多数人通常都不适用,但从第一个原则开始,需要在数据挖掘中重新投射和重新分析。我们以PAC-Instemity分析说明了数据挖掘任务。我们展示了数据挖掘特定要求的核算,例如弱预测因子和不可知论性假设的推断,需要以新颖的方式概括古典PAC框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号