Combining Machine Learning and Semantics for Anomaly Detection

机译：组合机器学习和语义对异常检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The emergence of the Internet of Things and stream processing forces large scale organizations to consider anomaly detection as a key component of their business. Using machine learning to solve such complex use cases is generally a cumbersome, costly, time-consuming and error-prone process. It involves many tasks from data cleansing, to dimension reduction, algorithm selection and fine tuning. It also requires the involvement of various experts such as statisticians, programmers and testers. With RAMSSES, we remove the burden of this pipeline and demonstrate that these tasks can be automated. Our system leverages on a Lambda architecture based on Apache Spark to analyze historical data, perform cleansing and deal with the curse of dimensionality. Then, it identifies the most interesting attributes and uses a continuous semantic query generator executed over streams. The sampled data are processed by self-selected machine learning methods to detect anomalies, an iterative process using end user annotations improves significantly the accuracy of the system. After a description of RAMSSES's main components, the performance and relevancy of the system are demonstrated via a thorough evaluation over real-world and synthetic datasets.

机译：事物互联网和流处理的出现强制大规模组织将异常检测视为其业务的关键组成部分。使用机器学习来解决这些复杂的用例通常是一个繁琐的，昂贵，耗时和出错的过程。它涉及数据清理的许多任务，以减少尺寸，算法选择和微调。它还需要参与各种专家，如统计员，程序员和测试人员。使用Ramsses，我们消除了该管道的负担，并证明了这些任务可以自动化。我们的系统利用基于Apache Spark的Lambda架构来分析历史数据，执行清洁和处理维度的诅咒。然后，它标识最有趣的属性，并使用在流中执行的连续语义查询生成器。采样数据由自选择的机器学习方法处理，以检测异常，使用最终用户注释的迭代过程显着提高了系统的准确性。在RAMSSES的主要组件描述后，通过对现实世界和合成数据集进行全面评估来证明系统的性能和相关性。

著录项

来源
《International Conference on Knowledge Engineering and Knowledge Management》|2018年|586p|共16页
会议地点
作者
Badre Belabbess; Musab Bairat; Jeremy Lhez; Olivier Cure;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Real time; Streaming; Machine learning; Knowledge extraction; Anomaly detection; Smart water management;

机译：实时;流;机器学习;知识提取;异常检测;智能水管理;

相似文献

外文文献
中文文献
专利

1. Healthcare and anomaly detection: using machine learning to predict anomalies in heart rate data [J] . Edin Sabic, David Keeley, Bailey Henderson, AI & society . 2021,第1期

机译：医疗保健和异常检测：使用机器学习预测心率数据中的异常
2. Architecting Smart City Digital Twins: Combined Semantic Model and Machine Learning Approach [J] . Austin Mark, Delgoshaei Parastoo, Coelho Maria, Journal of Management in Engineering . 2020,第4期

机译：架构智能城市数字双胞胎：组合语义模型和机器学习方法
3. Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures [J] . Evert Stefan, Heinrich Philipp, Henselmann Klaus, Journal of logic, language and information . 2019,第2期

机译：在公司披露分类中将机器学习和语义特征相结合
4. Combining Machine Learning and Semantics for Anomaly Detection [C] . Badre Belabbess, Musab Bairat, Jeremy Lhez, International conference on knowledge engineering and knowledge management . 2018

机译：结合机器学习和语义进行异常检测
5. Explainability with Semantic Concept Composition and Zero-Shot Learning for Anomaly Detection [D] . Bendre, Nihar Shrikant. 2021

机译：用语义概念组成和对异常检测的零射击学习的解释性
6. Automatic Anomaly Detection on In-Production Manufacturing Machines Using Statistical Learning Methods [O] . Federico Pittino, Michael Puggl, Thomas Moldaschl, 2020

机译：使用统计学习方法的生产中制造机器的自动异常检测
7. Development of Gamma Background Radiation Digital Twin with Machine Learning Algorithms: Application of Unsupervised Machine Learning to Detection of Anomalies and Nuisances in Gamma Background Radiation Environmental Screening Data [O] . Allen Herrera, Eugene Moore, Alexander Heifetz 2020

机译：伽马背景辐射数字双胞胎与机器学习算法的开发：无监督机器学习在伽玛背景辐射环境筛选数据中检测异常和滋扰的应用

Combining Machine Learning and Semantics for Anomaly Detection

摘要

著录项

相似文献

相关主题

期刊订阅