【24h】

Pattern-based similarity search for microarray data

机译:基于模式的相似性搜索微阵列数据

获取原文

摘要

One fundamental task in near-neighbor search as well as other similarity matching efforts is to find a distance function that can efficiently quantify the similarity between two objects in a meaningful way. In DNA microarray analysis, the expression levels of two closely related genes may rise and fall synchronously in response to a set of experimental stimuli. Although the magnitude of their expression levels may not be close, the patterns they exhibit can be very similar. Unfortunately, none of the conventional distance metrics such as the Lp norm can model this similarity effectively. In this paper, we study the near-neighbor search problem based on this new type of similarity. We propose to measure the distance between two genes by subspace pattern similarity, i.e., whether they exhibit a synchronous pattern of rise and fall on a subset of dimensions. We then present an efficient algorithm for subspace near-neighbor search based on pattern similarity distance, and we perform tests on various data sets to show its effectiveness.
机译:邻近搜索以及其他相似性匹配工作中的一项基本任务是找到一种距离函数,该距离函数可以以有意义的方式有效地量化两个对象之间的相似性。在DNA芯片分析中,两个紧密相关的基因的表达水平可能会响应一组实验刺激而同步上升和下降。尽管它们表达水平的大小可能不接近,但它们表现出的模式可能非常相似。不幸的是,诸如L p 范数之类的常规距离度量标准都无法有效地对这种相似性进行建模。在本文中,我们研究了基于这种新型相似性的近邻搜索问题。我们建议通过子空间模式相似性来测量两个基因之间的距离,即它们是否在维度的子集上呈现出上升和下降的同步模式。然后,我们提出了一种基于模式相似距离的有效子空间近邻搜索算法,并对各种数据集进行了测试以证明其有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号