首页> 外文期刊>Information Processing & Management >SSTS: A syntactic tool for pattern search on time series
【24h】

SSTS: A syntactic tool for pattern search on time series

机译:SSTS:用于按时间序列进行模式搜索的语法工具

获取原文
获取原文并翻译 | 示例
       

摘要

Nowadays, data scientists are capable of manipulating and extracting complex information from time series data, given the current diversity of tools at their disposal. However, the plethora of tools that target data exploration and pattern search may require an extensive amount of time to develop methods that correspond to the data scientist's reasoning, in order to solve their queries. The development of new methods, tightly related with the reasoning and visual analysis of time series data, is of great relevance to improving complexity and productivity of pattern and query search tasks. In this work, we propose a novel tool, capable of exploring time series data for pattern and query search tasks in a set of 3 symbolic steps: Pre-Processing, Symbolic Connotation and Search. The framework is called SSTS (Symbolic Search in Time Series) and uses regular expression queries to search the desired patterns in a symbolic representation of the signal. By adopting a set of symbolic methods, this approach has the purpose of increasing the expressiveness in solving standard pattern and query tasks, enabling the creation of queries more closely related to the reasoning and visual analysis of the signal. We demonstrate the tool's effectiveness by presenting 9 examples with several types of queries on time series. The SSTS queries were compared with standard code developed in Python, in terms of cognitive effort, vocabulary required, code length, volume, interpretation and difficulty metrics based on the Halstead complexity measures. The results demonstrate that this methodology is a valid approach and delivers a new abstraction layer on data analysis of time series.
机译:如今,鉴于当前可使用的各种工具,数据科学家能够从时间序列数据中处理和提取复杂的信息。但是,针对数据探索和模式搜索的大量工具可能需要大量时间来开发与数据科学家的推理相对应的方法,以解决他们的查询问题。与时间序列数据的推理和可视化分析紧密相关的新方法的开发与提高模式和查询搜索任务的复杂性和生产力密切相关。在这项工作中,我们提出了一种新颖的工具,该工具能够通过3个符号步骤来探索模式和查询搜索任务的时间序列数据:预处理,符号含义和搜索。该框架称为SSTS(时间序列中的符号搜索),并使用正则表达式查询以信号的符号表示形式搜索所需的模式。通过采用一组符号方法,此方法的目的是提高解决标准模式和查询任务时的表现力,从而使查询的创建与信号的推理和视觉分析更加紧密相关。我们通过提供9个示例以及几种按时间序列进行的查询来演示该工具的有效性。将SSTS查询与Python开发的标准代码进行了比较,包括基于Halstead复杂性度量的认知努力,所需词汇量,代码长度,数量,解释和难度指标。结果表明,该方法是一种有效的方法,并为时间序列的数据分析提供了新的抽象层。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号