首页> 美国政府科技报告 >Anomaly Detection in Large Sets of High-Dimensional Symbol Sequences

【24h】

Anomaly Detection in Large Sets of High-Dimensional Symbol Sequences

机译：大型高维符号序列的异常检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of detecting and describing anomalies in large sets of high-dimensional symbol sequences. The approach taken uses unsupervised clustering of sequences using the normalized longest common subsequence (LCS) as a similarity measure, followed by detailed analysis of outliers to detect anomalies. As the LCS measure is expensive to compute, the first part of the paper discusses existing algorithms, such as the Hunt-Szymanski algorithm, that have low time-complexity. We then discuss why these algorithms often do not work well in practice and present a new hybrid algorithm for computing the LCS that, in our tests, outperforms the Hunt-Szymanski algorithm by a factor of five. The second part of the paper presents new algorithms for outlier analysis that provide comprehensible indicators as to why a particular sequence was deemed to be an outlier. The algorithms provide a coherent description to an analyst of the anomalies in the sequence, compared to more normal sequences. The algorithms we present are general and domain-independent, so we discuss applications in related areas such as anomaly detection.

著录项

作者
Budalakoti, Suratna; Srivastava, Ashok N.; Akella, Ram; Turkov, Eugene;
展开▼
作者单位

展开▼
年度 2006
页码 1-18
总页数 18
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
ANOMALIES; DETECTION; SYMBOLS; MACHINE LEARNING; SEQUENTIAL ANALYSIS; DATA MINING; GREEDY ALGORITHMS; FLIGHT TESTS; GRAPHS (CHARTS); RUN TIME (COMPUTERS);

机译：异常;检测;符号;机器学习;顺序分析;数据挖掘;贪心算法;飞行测试;图表（图表）;运行时间（计算机）;

相似文献

外文文献
中文文献
专利

1. Anomaly Detection in Large Sets of High-Dimensional Symbol Sequences (sequenceMiner) [J] . NASA Tech Briefs . 2015,第12期

机译：大套高维符号序列（sequenceMiner）中的异常检测
2. Anomaly Detection and Diagnosis Algorithms for Discrete Symbol Sequences with Applications to Airline Safety [J] . Budalakoti S., Srivastava A.N., Otey M.E. IEEE transactions on systems, man and cybernetics. Part C, Applications and reviews . 2009,第1期

机译：离散符号序列的异常检测与诊断算法及其在航空安全中的应用
3. Symbol-by-symbol CDMA spreading gain adaptation and detection using OVSF sequences [J] . Lee D., Lih-feng Tsaur, Naeem M. Wireless Communications, IEEE Transactions on . 2009,第9期

机译：使用OVSF序列的逐符号CDMA扩频增益自适应和检测
4. Ordinal Outlier Algorithm for Anomaly Detection of High-Dimensional Data Sets [C] . Gang Chen, Linlin Du, Baoran An Chinese Control and Decision Conference . 2020

机译：高维数据集异常检测的序数离群值算法
5. Distribution-free Fault Identification and Anomaly Detection in High-dimensional Data [D] . Turkoz, Mehmet. 2018

机译：高维数据的无分布故障识别与异常检测
6. A Hybrid Semi-Supervised Anomaly Detection Model for High-Dimensional Data [O] . Hongchao Song, Zhuqing Jiang, Aidong Men, 2017

机译：高维数据的混合半监督异常检测模型
7. Optimal sequence detection and optimal symbol-by-symbol detection: similar algorithms [O] . Jeremiah F. Hayes, Senior Member, Juan, 1982

机译：最佳序列检测和最佳符号逐个检测：类似的算法

Anomaly Detection in Large Sets of High-Dimensional Symbol Sequences

摘要

著录项

相似文献

相关主题

期刊订阅