首页> 外文会议>International Congress on Automation in Mining >Data stream mining for knowledge discovery in real time
【24h】

Data stream mining for knowledge discovery in real time

机译:数据流挖掘实时知识发现

获取原文

摘要

Data stream mining is the process of extracting knowledge from continuous, rapid data records where the goal is to predict the class or value of new instances in the data stream. Real time data and events software are one of the applications most widely used by industries. The huge amount of data collected and the need to control and predict industrial business processes behavior in real time, makes that kind of software a suitable data source for applying machine learning techniques to discover processes' hidden knowledge. The goal of this work is to adapt machine learning algorithms in order to process real time data and obtain useful information for operational decision making. We will concentrate on adaptive sliding window algorithm for detecting change and keeping updated statistics from a data stream, very fast decision trees as a classification scheme, unsupervised k-means clustering for discovering data groups and cascade trained neural networks for pattern recognition. Data and knowledge visualization is made through mobile devices, desktops and tablets computers. The benefits of applying this kind of technology are: Visualize in advance events that could affect plant operation, predict and forecast equipment behavior, early fault detection, energy consumption pattern recognition, among others. Compared to traditional business intelligence software, this application has better performance for a large volume of high speed data streams, bringing real time knowledge for decision support. In order to apply the techniques exposed in this paper, a business project must be conducted to identify process knowledge needs, gaps and improvement opportunities.
机译:数据流挖掘是从持续的快速数据记录中提取知识的过程,其中目标是预测数据流中的新实例的类或值。实时数据和事件软件是行业最广泛使用的应用程序之一。收集的大量数据以及需要实时控制和预测工业业务流行的需求,使得这种软件适用于应用机器学习技术来发现流程的隐藏知识的数据源。这项工作的目标是调整机器学习算法,以便处理实时数据并获得操作决策的有用信息。我们将专注于自适应滑动窗口算法,用于检测数据流的变化,使更新的统计数据,非常快的决策树作为分类方案,无监督的K-Means群集用于发现数据组和级联培训的神经网络进行模式识别。通过移动设备,台式机和平板电脑计算机进行数据和知识可视化。应用这种技术的好处是:预先可视化可能影响植物运行,预测和预测设备行为,早期故障检测,能量消耗模式识别的事件。与传统商业智能软件相比,此应用程序对大量高速数据流具有更好的性能,为决策支持带来了实时知识。为了应用本文所公开的技术,必须进行商业项目以确定流程知识需求,差距和改进机会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号