Unsupervised Learning of Patterns in Data Streams Using Compression and Edit Distance

机译：使用压缩和编辑距离在数据流中进行无监督模式学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many unsupervised learning methods for recognising patterns in data streams are based on fixed length data sequences, which makes them unsuitable for applications where the data sequences are of variable length such as in speech recognition, behaviour recognition and text classification. In order to use these methods on variable length data sequences, a pre-processing step is required to manually segment the data and select the appropriate features, which is often not practical in real-world applications. In this paper we suggest an unsupervised learning method that handles variable length data sequences by identifying structure in the data stream using text compression and the edit distance between 'words'. We demonstrate that using this method we can automatically cluster unlabelled data in a data stream and perform segmentation. We evaluate the effectiveness of our proposed method using both fixed length and variable length benchmark datasets, comparing it to the Self-Organising Map in the first case. The results show a promising improvement over baseline recognition systems.

机译：用于识别数据流中模式的许多无监督学习方法都是基于固定长度的数据序列，这使其不适用于数据序列具有可变长度的应用，例如语音识别，行为识别和文本分类。为了在可变长度的数据序列上使用这些方法，需要一个预处理步骤来手动分割数据并选择适当的功能，这在实际应用中通常是不实际的。在本文中，我们提出了一种无监督学习方法，该方法通过使用文本压缩和“单词”之间的编辑距离来识别数据流中的结构来处理可变长度的数据序列。我们证明了使用此方法，我们可以自动将数据流中未标记的数据聚类并执行分段。我们使用固定长度和可变长度基准数据集评估了我们提出的方法的有效性，并将其与第一种情况下的自组织图进行了比较。结果表明，与基线识别系统相比，有希望的改进。

著录项

来源
《International joint conference on artificial intelligence;IJCAI-11》|2012年|p.1231-1236|共6页
会议地点
作者
Sook-Ling Chua; Stephen Marsland; Hans W. Guesgen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. A metapattern-based automated discovery loop for integrated data mining-unsupervised learning of relational patterns [J] . Wei-Min Shen, Bing Leng IEEE Transactions on Knowledge and Data Engineering . 1996,第6期

机译：基于元模式的自动发现循环，用于集成数据挖掘-关系模式的无监督学习
2. Unsupervised Learning and Pattern Recognition of Biological Data Structures with Density Functional Theory and Machine Learning [J] . Chien-Chang Chen, Hung-Hui Juan, Meng-Yuan Tsai, Scientific reports. . 2018,第1期

机译：基于密度泛函理论和机器学习的生物数据结构的无监督学习和模式识别
3. Unsupervised deep learning and analysis of harmonic variation patterns using big data from multiple locations [J] . Ge Chenjie, Oliveira Roger A. D., Gu Irene Y. H., Electric power systems research . 2021,第May期

机译：多个位置大数据的无监督深度学习与谐波变化模式分析
4. Unsupervised Learning of Patterns in Data Streams Using Compression and Edit Distance [C] . Sook-Ling Chua, Stephen Marsland, Hans W. Guesgen International Joint Conference on Artificial Intelligence . 2011

机译：使用压缩和编辑距离无监督数据流中的模式的模式
5. Pattern Learning in Smart Homes and Offices Using Motion Sensor and Mind Wave Data: Unsupervised Approaches [D] . Zhang, Tongda. 2016

机译：使用运动传感器和脑力波数据的智能家庭和办公室的模式学习：无监督的方法
6. FuseAD: Unsupervised Anomaly Detection in Streaming Sensors Data by Fusing Statistical and Deep Learning Models [O] . Mohsin Munir, Shoaib Ahmed Siddiqui, Muhammad Ali Chattha, 2019

机译：FuseAD：通过融合统计和深度学习模型在流传感器数据中进行无监督异常检测
7. A Metapattern-Based Automated Discovery Loop for Integrated Data Mining - Unsupervised Learning of Relational Patterns [O] . Wei-Min Shen, Bing Leng 1996

机译：基于元模式的集成数据挖掘自动发现环-关系模式的无监督学习

Unsupervised Learning of Patterns in Data Streams Using Compression and Edit Distance

摘要

著录项

相似文献

相关主题

期刊订阅