首页> 外文期刊>Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR >A Polynomial-Time Approximation Algorithm for One Problem Simulating the Search in a Time Series for the Largest Subsequence of Similar Elements
【24h】

A Polynomial-Time Approximation Algorithm for One Problem Simulating the Search in a Time Series for the Largest Subsequence of Similar Elements

机译:一个问题的多项式逼近算法模拟时间序列搜索的最大随后的应用

获取原文
获取原文并翻译 | 示例
           

摘要

We analyze the mathematical aspects of the data analysis problem consisting in the search (selection) for a subset of similar elements in a group of objects. The problem arises, in particular, in connection with the analysis of data in the form of time series (discrete signals). One of the problems in modeling this challenge is considered, namely, the problem of searching in a finite sequence of points from the Euclidean space for the subsequence with the greatest number of terms such that the quadratic spread of the elements of this subsequence with respect to its unknown centroid does not exceed a given percentage of the quadratic spread of elements of the input sequence with respect to its centroid. It is shown that the problem is strongly NP-hard. A polynomial-time approximation algorithm is proposed. This algorithm either establishes that the problem has no solution or finds a 1/2-approximate solution if the length M _(*)of the optimal subsequence is even, or it yields a $$rac{1}{2}left(egin{array}{c}1-rac{1}{M^*} end{array}ight)$$ 1 2 ( 1 ? 1 M * ) -approximate solution if M * is odd. The time complexity of the algorithm is O ( N _(3)( N _(2)+ q )), where N is the number of points in the input sequence and q is the space dimension. The results of numerical simulation that demonstrate the effectiveness of the algorithm are presented.
机译:我们分析了在一组对象中的类似元素的搜索(选择)中组成的数据分析问题的数学方面。特别是与时间序列(离散信号)形式的数据分析有关的问题。考虑建模这种挑战的问题之一,即,以欧几里德空间的有限点搜索有限序列的问题,以获得最大的术语,使得该子序列的元素的二次传播相对于其未知的质心不超过输入序列的元素与其质心的元素二次传播的给定百分比。结果表明,问题是强烈的np-suld。提出了多项式时间近似算法。该算法建立问题没有解决方案,或者如果最佳子序列的长度m _(*)均匀,或者它会产生$$ frac {1} {2} left ( begin {array} {c} 1- frac {1} {m ^ *} end {array}右)$$ 1 2(1?1 m *) - 如果m *是奇数。算法的时间复杂度是O(n _(3)(n _(2)+ q)),其中n是输入序列中的点数,q是空间尺寸。介绍了数值模拟的结果,证明了算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号