...
首页> 外文期刊>The Computer journal >Orange4WS Environment for Service-Oriented Data Mining
【24h】

Orange4WS Environment for Service-Oriented Data Mining

机译:用于面向服务的数据挖掘的Orange4WS环境

获取原文
获取原文并翻译 | 示例

摘要

Novel data-mining tasks in e-science involve mining of distributed, highly heterogeneous data and knowledge sources. However, standard data mining platforms, such as Weka and Orange, involve only their own data mining algorithms in the process of knowledge discovery from local data sources. In contrast, next generation data mining technologies should enable processing of distributed data sources, the use of data mining algorithms implemented as web services, as well as the use of formal descriptions of data sources and knowledge discovery tools in the form of ontologies, enabling automated composition of complex knowledge discovery workflows for a given data mining task. This paper proposes a novel Service-oriented Knowledge Discovery framework and its implementation in a service-oriented data mining environment Orange4WS (Orange for Web Services), based on the existing Orange data mining toolbox and its visual programming environment, which enables manual composition of data mining workflows. The new service-oriented data mining environment Orange4WS includes the following new features: simple use of web services as remote components that can be included into a data mining workflow; simple incorporation of relational data mining algorithms; a knowledge discovery ontology to describe workflow components (data, knowledge and data mining services) in an abstract and machine-interpretable way, and its use by a planner that enables automated composition of data mining workflows. These new features are showcased in three real-world scenarios.
机译:电子科学中的新型数据挖掘任务涉及对分布式,高度异构的数据和知识源的挖掘。但是,标准数据挖掘平台(例如Weka和Orange)在从本地数据源进行知识发现的过程中仅涉及其自己的数据挖掘算法。相反,下一代数据挖掘技术应能够处理分布式数据源,使用实现为Web服务的数据挖掘算法以及使用本体形式的数据源形式描述和知识发现工具,从而实现自动化给定数据挖掘任务的复杂知识发现工作流程的组成。本文基于现有的Orange数据挖掘工具箱及其可视化编程环境,提出了一种新颖的面向服务的知识发现框架及其在面向服务的数据挖掘环境Orange4WS(Orange for Web Services)中的实现,该框架能够手动组合数据挖掘工作流程。新的面向服务的数据挖掘环境Orange4WS包括以下新功能:简单地将Web服务用作可以包含在数据挖掘工作流中的远程组件;关系数据挖掘算法的简单合并;知识发现本体,以抽象的和机器可解释的方式描述工作流程的组成部分(数据,知识和数据挖掘服务),并由计划人员使用,以实现数据挖掘工作流程的自动组合。这些新功能在三种现实情况中得以展示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号