Pattern Discovery for Large Mixed-Mode Database

机译：大型混合模式数据库的模式发现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In business and industry today, large databases with mixed data types (continuous and categorical) are very common. There are great needs to discover patterns from them for knowledge interpretation and understanding. In the past, for classification, this problem is solved as a discrete data problem by first discretizing the continuous data based on the class-attribute interdependence relationship. However, so far no proper solution exists when class information is unavailable. Hence, important pattern post-processing tasks such as pattern clustering and summarization cannot be applied to mixed-mode data. This paper presents a new method for solving the problem. It is based on two essential concepts. (1) Though class information is absent, yet for a correlated dataset, the attribute with the strongest interdependence with others in the group can be used to drive the discretization of the continuous data. (2) For a large database, correlated attribute groups must first be obtained by attribute clustering before (1) can be applied. Based on (1) and (2), pattern discovery methods are developed for mixed-mode data. Extensive experiments using synthetic and real world data were conducted to validate the usefulness and effectiveness of the proposed method.

机译：在当今的商业和工业，混合数据类型（连续和分类）大型数据库是非常普遍的。有很大的需求，发现从他们的模式对知识的解释和理解。在过去，对于分类，这个问题是由第一离散基于类属性的相互依赖关系的连续数据解决作为离散数据的问题。然而，当类信息不可用至今没有妥善解决存在。因此，重要的图案后处理任务，如模式聚类和总结不能被应用到混合模式的数据。本文提出了解决问题的新方法。它是基于两个基本概念。（1）虽然类的信息不存在的，但对于相关的数据集，可以使用具有与其他组中的最强的相互依赖的属性来驱动连续的数据的离散化。（2）对于一个大的数据库，相关属性组必须首先通过属性聚类（1）可应用于之前获得。基于（1）和（2）中，模式发现方法可用于混合模式数据开发的。使用合成的和现实世界的广泛数据进行了实验，以验证所提出的方法的有用性和有效性。

著录项

来源
《ACM conference on information and knowledge management》|2010年||共10页
会议地点
作者
Andrew K.C. Wong; Bin Wu; Gene P.K. Wu; Keith C.C. Chan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
data mining; pattern discovery; unsupervised discretization; attribute clustering; mixed mode data; mutual information;

机译：数据挖掘;模式发现;无监督的离散化;属性群集;混合模式数据;相互信息;

相似文献

外文文献
中文文献
专利

1. Efficient discovery of periodic-frequent patterns in very large databases [J] . R. Uday Kiran, Masaru Kitsuregawa, P. Krishna Reddy The Journal of Systems and Software . 2016,第FEBa期

机译：在超大型数据库中高效发现周期性模式
2. Knowledge discovery of weighted RFM sequential patterns from customer sequence databases [J] . Ya-Han Hu, Tony Cheng-Kui Huang, Yu-Hua Kao The Journal of Systems and Software . 2013,第3期

机译：从客户序列数据库中发现加权RFM序列模式的知识
3. Fast discovery of sequential patterns in large databases using effective time-indexing [J] . Lin MY, Hsueh SC, Chang CW Information Sciences: An International Journal . 2008,第22期

机译：使用有效的时间索引在大型数据库中快速发现顺序模式
4. Pattern Discovery for Large Mixed-Mode Database [C] . Andrew K.C. Wong, Bin Wu, Gene P.K. Wu, CIKM 10;ACM conference on information and knowledge management . 2011

机译：大型混合模式数据库的模式发现
5. Efficient correlated pattern discovery in databases [D] . Ke, Yiping 2008

机译：数据库中的高效相关模式发现
6. Comparison of response patterns in different survey designs: a longitudinal panel with mixed-mode and online-only design [O] . Nicole Rübsamen, Manas K. Akmatov, Stefanie Castell, 2017

机译：比较不同调查设计中的响应模式：带有混合模式和仅在线设计的纵向面板
7. Pattern discovery for large mixed-mode database [O] . Wong AKC, Wu B, Wu GPK, 2010

机译：大型混合模式数据库的模式发现
8. Collected Notes on the Workshop for Pattern Discovery in Large Databases [R] . Buntine, W., Delalto, M. 1991

机译：关于大型数据库中模式发现研讨会的收集说明

Pattern Discovery for Large Mixed-Mode Database

摘要

著录项

相似文献

相关主题

期刊订阅