首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery(DaWaK 2006) >Learning Classifiers from Distributed, Ontology-Extended Data Sources
【24h】

Learning Classifiers from Distributed, Ontology-Extended Data Sources

机译:从分布式,本体 - 扩展数据源的学习分类器

获取原文

摘要

There is an urgent need for sound approaches to integrative and collaborative analysis of large, autonomous (and hence, inevitably semantically heterogeneous) data sources in several increasingly data-rich application domains. In this paper, we precisely formulate and solve the problem of learning classifiers from such data sources, in a setting where each data source has a hierarchical ontology associated with it and semantic correspondences between data source ontologies and a user ontology are supplied. The proposed approach yields algorithms for learning a broad class of classifiers (including Bayesian networks, decision trees, etc.) from semantically heterogeneous distributed data with strong performance guarantees relative to their centralized counterparts. We illustrate the application of the proposed approach in the case of learning Naive Bayes classifiers from distributed, ontology-extended data sources.
机译:迫切需要在几个越来越多的数据丰富的应用域中的大型,自主(以及因此,不可避免地,不可避免地)数据源的综合和协作分析途径迫切需要。在本文中,我们精确地制定和解决这些数据源的学习分类器的问题,其中每个数据源具有与它相关联的分层本体和数据源本体和用户本体之间的语义对应。所提出的方法产生用于从语义异构分布式数据学习广泛类别的分类器(包括贝叶斯网络,决策树等)的算法,其具有相对于其集中式对应物的强烈性能保证。我们说明了在从分布式的本体 - 扩展数据源学习Naive Bayes分类器的情况下提出的方法的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号