首页> 外文会议>American Control Conference >Alphabet size selection for symbolization of dynamic data-driven systems: An information-theoretic approach
【24h】

Alphabet size selection for symbolization of dynamic data-driven systems: An information-theoretic approach

机译:动态数据驱动系统符号化的字母大小选择:一种信息理论方法

获取原文

摘要

Symbolic time series analysis (STSA) is built upon the concept of symbolic dynamics that deals with discretization of dynamical systems in both space and time. The notion of STSA has led to the development of a pattern recognition tool in the paradigm of dynamic data-driven application systems (DDDAS), where a time series of sensor signals is partitioned to obtain a symbol sequence that, in turn, leads to the construction of probabilistic finite state automata (PFSA). Although modeling of PFSA from symbol sequences has been widely reported, similar efforts have not been expended to investigate how to find an appropriate alphabet size for partitioning of time series so that the symbol sequences can be optimally generated. This paper addresses this critical issue and proposes an information-theoretic procedure of data partitioning to extract low-dimensional features from time series. The key idea lies in optimal partitioning of the time series via maximization of the mutual information between the input state probability vector and pattern classes. The proposed procedure has been validated by two examples. The first example elucidates the underlying concept of data partitioning for parameter identification in a Duffing system with a sinusoidal input excitation. The second example is built upon time series of chemiluminescence data to predict lean blow-out (LBO) phenomena in a laboratory-scale combustor. Classification performance of data partitioning is analyzed in each of the two examples.
机译:符号时间序列分析(STSA)建立在符号动力学概念的基础上,该概念涉及时空动力系统的离散化。 STSA的概念已导致在动态数据驱动的应用系统(DDDAS)范式中开发了模式识别工具,其中对传感器信号的时间序列进行了划分,以获得一个符号序列,进而产生了符号序列。概率有限状态自动机(PFSA)的构造。尽管已经广泛报道了根据符号序列对PFSA进行建模的方法,但尚未进行类似的工作来研究如何找到合适的字母大小来划分时间序列,以便可以最佳地生成符号序列。本文解决了这一关键问题,并提出了一种信息理论的数据分区程序,以从时间序列中提取低维特征。关键思想在于通过最大化输入状态概率向量和模式类别之间的互信息来对时间序列进行最佳划分。所提出的程序已通过两个示例验证。第一个示例阐明了在具有正弦输入激励的Duffing系统中用于参数识别的数据分区的基本概念。第二个示例建立在化学发光数据的时间序列上,以预测实验室规模燃烧器中的稀薄吹出(LBO)现象。在两个示例中的每个示例中,都对数据分区的分类性能进行了分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号