首页> 外文期刊>Knowledge-Based Systems >Graph-based approach for outlier detection in sequential data and its application on stock market and weather data
【24h】

Graph-based approach for outlier detection in sequential data and its application on stock market and weather data

机译:基于图的序列数据离群值检测方法及其在股市和天气数据中的应用

获取原文
获取原文并翻译 | 示例

摘要

Outlier detection has a large variety of applications ranging from detecting intrusion in a computer network, to forecasting hurricanes and tornados in weather data, to identifying indicators of potential crisis in stock market data, etc. The problem of finding outliers in sequential data has been widely studied in the data mining literature and many techniques have been developed to tackle the problem in various application domains. However, many of these techniques rely on the peculiar characteristics of a specific type of data to detect the outliers. As a result, they cannot be easily applied to different types of data in other application domains; they should at least be tuned and customized to adapt to the new domain. They also may need certain amount of training data to build their models. This makes them hard to apply especially when only a limited amount of data is available. The work described in this paper tackle the problem by proposing a graph-based approach for the discovery of contextual outliers in sequential data. The developed algorithm offers a higher degree of flexibility and requires less amount of information about the nature of the analyzed data compared to the previous approaches described in the literature. In order to validate our approach, we conducted experiments on stock market and weather data; we compared the results with the results from our previous work. Our analysis of the results demonstrate that the algorithm proposed in this paper is successful and effective in detecting outliers in data from different domains, one financial and the other meteorological.
机译:离群值检测具有广泛的应用范围,从检测计算机网络中的入侵,预测天气数据中的飓风和龙卷风,识别股市数据中潜在危机的指标等。在序列数据中寻找离群值的问题已经广泛存在。在数据挖掘文献中进行了研究,并开发了许多技术来解决各种应用领域中的问题。但是,这些技术中的许多技术都依赖于特定类型数据的特殊特征来检测异常值。结果,它们不能轻易地应用于其他应用程序域中的不同类型的数据。至少应对其进行调整和定制以适应新领域。他们还可能需要一定数量的训练数据才能建立模型。这使得它们难以应用,尤其是在只有有限数量的数据可用时。本文描述的工作通过提出一种基于图的方法来发现顺序数据中的上下文离群值的方法来解决该问题。与文献中描述的先前方法相比,所开发的算法具有更高的灵活性,并且需要较少的有关分析数据性质的信息。为了验证我们的方法,我们对股票市场和天气数据进行了实验;我们将结果与之前的工作进行了比较。我们对结果的分析表明,本文提出的算法在检测来自不同领域,一种金融领域和另一种气象领域的数据中的异常值方面是成功且有效的。

著录项

  • 来源
    《Knowledge-Based Systems》 |2014年第5期|89-97|共9页
  • 作者单位

    Department of Computer Science, Global University, Beirut, Lebanon;

    Department of Computer Science, Global University, Beirut, Lebanon;

    Department of Computer Science, Global University, Beirut, Lebanon;

    Department of Computer Science, Global University, Beirut, Lebanon;

    Department of Computer Science, Global University, Beirut, Lebanon;

    Department of Computer Science, Global University, Beirut, Lebanon;

    Department of Computer Science, Global University, Beirut, Lebanon,Department of Computer Science, University of Calgary, Calgary, Alberta, Canada;

    Department of Computer Science, Global University, Beirut, Lebanon;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data mining; Graph-based algorithm; Outlier detection; Weather data; Stock market;

    机译:数据挖掘;基于图的算法;离群值检测;天气数据;股市;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号