首页> 外文会议>International Conference on Data Mining >Generating clusters' explanations in just one data scan
【24h】

Generating clusters' explanations in just one data scan

机译:在只有一个数据扫描中生成群集的解释

获取原文

摘要

Knowledge discovery from unlabeled data comprises two main tasks: identification of "natural groups" and analysis of these groups in order to interpret their meaning. These tasks are accomplished by unsupervised and supervised learning, respectively, and correspond to the phases of the discovery process described by Langley. The efforts of Knowledge Discovery from Databases (KDD) research field have addressed these two processes into two main dimensions: (1) scaling up the learning algorithms to very large databases and (2) improving the efficiency of the KDD process. In this paper we argue that the advances achieved in scaling up supervised and unsupervised learning algorithms allow us to combine these two processes in just one stream, providing extensional and intensional descriptions of unlabeled data. We propose a framework, called Orpheo, which enables the integration of any two unsupervised and supervised algorithms to compose the stream. A particular advantage of our approach is that, if the two algorithms are incremental, the system will build the model in just one data scan. This characteristic satisfies important desiderata for clustering in data mining posed by Bradley et al. The framework is instantiated using as building blocks two incremental neural networks: the ART1 model and the Combinatorial Neural Model. An application related to the agricultural research is presented to illustrate the approach.
机译:从标签数据的知识点发现包括两个主要任务:“自然群体”和这些组的分析鉴定,以解释其含义。这些任务由无监督和监督学习,分别和对应实现由兰利描述的发现过程的各个阶段。知识发现的从数据库(KDD)研究领域的努力,这两个过程涉及到两个主要方面:(1)扩大学习算法非常大的数据库和(2)提高KDD过程的效率。在本文中,我们认为,在扩大取得的进展监督和无监督的学习算法,使我们能够在短短的一个流这两个过程结合起来,提供标签数据的外延和内涵的描述。我们提出了一个框架,叫做Orpheo,这使得任何两个无监督和监督的算法整合,组成流。我们的方法的一个特别的优点是,如果两个算法是增量,系统将建立在短短的一个数据扫描模型。这种特性满足了在布拉德利等人提出的数据挖掘聚类重要的必要条件。在ART1模型和组合神经元模型:该框架是使用如积木两个增量式神经网络实例化。有关农业研究中的应用,提出了该方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号