首页> 外文会议>ACM SIGKDD workshop on useful patterns 2010 >Point-Distribution Algorithm for Mining Vector-Item Patterns
【24h】

Point-Distribution Algorithm for Mining Vector-Item Patterns

机译:向量项模式挖掘的点分布算法

获取原文
获取原文并翻译 | 示例

摘要

An algorithm is presented for rinding patterns between sets of continuous attributes and item sets. In contrast to most pattern mining approaches, the algorithm considers multiple continuous attributes as a single vector attribute. This approach results in a separate abstraction level and allows multiple vector attributes to be considered. We show that the pattern mining process can uncover relationships between the vector data and item sets. Filtering according to these patterns can be seen as feature selection at the level of the vector attributes as opposed to individual continuous attributes. In the evaluation, we show that the pattern mining algorithm can more effectively and efficiently achieve this filtering than a direct application of classification algorithms. Patterns are identified by relating item data to the distribution of objects within the vector space that is spanned by the sets of continuous attributes. The Kullback-Leibler divergence provides a quantitative measure that establishes whether the subset defined by an item set differs from the overall distribution of data points. The set-subset relationship of data points, which violates i.i.d assumptions, requires an adaptation of standard algorithms for computing the Kullback-Leibler divergence. The algorithm is evaluated on gene expression data and on a classification example problem that is constructed from time series data.
机译:提出了一种用于在连续属性集和项目集之间插入模式的算法。与大多数模式挖掘方法相比,该算法将多个连续属性视为单个矢量属性。这种方法导致一个单独的抽象级别,并允许考虑多个矢量属性。我们表明,模式挖掘过程可以发现矢量数据和项目集之间的关系。根据这些模式进行的过滤可以看作是矢量属性级别的特征选择,而不是单个连续属性。在评估中,我们表明,与直接应用分类算法相比,模式挖掘算法可以更有效地实现此过滤。通过将项目数据与矢量空间内对象的分布相关联来识别模式,该对象空间由连续属性集跨越。 Kullback-Leibler散度提供了一种定量度量,可以确定由项目集定义的子集是否与数据点的整体分布不同。违反i.i.d假设的数据点的集-子集关系需要对用于计算Kullback-Leibler散度的标准算法进行调整。根据基因表达数据和根据时间序列数据构建的分类示例问题对算法进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号