首页> 外文会议>International Joint Conferences on Web Intelligent and Intelligent Agent Technologies >CosDic: Towards a Comprehensive System for Knowledge Discovery in Large-Scale Data: Architecture, Implementation and Case Studies
【24h】

CosDic: Towards a Comprehensive System for Knowledge Discovery in Large-Scale Data: Architecture, Implementation and Case Studies

机译:COSDIC:在大规模数据中迈向知识发现的综合系统:架构,实施和案例研究

获取原文

摘要

The continued exponential growth in both the volume and the complexity of information is giving birth to a new challenge to the specific requirements of analysts, researchers and intelligence providers. In this paper, to move the scientific activity forward to practice, we elaborate a prototype of our on-going constructed system, CosDic, for knowledge discovery from extremely large-scale datasets. The major infrastructure of CosDic is deployed on a distributed cluster environment using MapReduce platform. To undertake the mining tasks from gigabytes to petabytes, we carefully devised our system, from architecture to particular algorithms, from under layer construction to upper layer public service interface, from effectiveness to efficiency. Moreover, to illustrate its functionality, we employ CosDic to a real-world huge dataset and demonstrate an integrated analysis procedure from initial raw data preprocessing to finally knowledge discovering. We show that CosDic has a good performance in such cloud-scale data computing.
机译:卷的持续指数增长以及信息的复杂性正在为分析师,研究人员和情报提供者的具体要求产生新的挑战。在本文中,为了使科学活动前进到实践中,我们详细阐述了我们正在进行的构建系统,COSDIC的原型,用于了解来自极大的数据集的知识发现。 Cosdic的主要基础架构在使用MapReduce平台上部署在分布式群集环境中。要从千兆字节进行挖掘任务到Petabytes,我们仔细设计了我们的系统,从体系结构到特定算法,从层构建到上层公共服务接口,从有效地实现效率。此外,为了说明其功能,我们将COSDIC雇用了真实世界的巨大数据集,并从初始原始数据预处理中展示了一个综合分析程序,以最后​​知识发现。我们表明COSDIC在这种云级数据计算中具有良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号