Pattern-based similarity search for microarray data

机译：基于模式的相似性搜索微阵列数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One fundamental task in near-neighbor search as well as other similarity matching efforts is to find a distance function that can efficiently quantify the similarity between two objects in a meaningful way. In DNA microarray analysis, the expression levels of two closely related genes may rise and fall synchronously in response to a set of experimental stimuli. Although the magnitude of their expression levels may not be close, the patterns they exhibit can be very similar. Unfortunately, none of the conventional distance metrics such as the Lp norm can model this similarity effectively. In this paper, we study the near-neighbor search problem based on this new type of similarity. We propose to measure the distance between two genes by subspace pattern similarity, i.e., whether they exhibit a synchronous pattern of rise and fall on a subset of dimensions. We then present an efficient algorithm for subspace near-neighbor search based on pattern similarity distance, and we perform tests on various data sets to show its effectiveness.

机译：邻近搜索以及其他相似性匹配工作中的一项基本任务是找到一种距离函数，该距离函数可以以有意义的方式有效地量化两个对象之间的相似性。在DNA芯片分析中，两个紧密相关的基因的表达水平可能会响应一组实验刺激而同步上升和下降。尽管它们表达水平的大小可能不接近，但它们表现出的模式可能非常相似。不幸的是，诸如L p 范数之类的常规距离度量标准都无法有效地对这种相似性进行建模。在本文中，我们研究了基于这种新型相似性的近邻搜索问题。我们建议通过子空间模式相似性来测量两个基因之间的距离，即它们是否在维度的子集上呈现出上升和下降的同步模式。然后，我们提出了一种基于模式相似距离的有效子空间近邻搜索算法，并对各种数据集进行了测试以证明其有效性。

著录项

来源
《ACM SIGKDD international conference on Knowledge discovery in data mining》|2005年|P.814-819|共6页
会议地点
作者
Haixun Wang; Jian Pei; Philip S. Yu; PHaixun Wang; PJian Pei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
pattern recognition;

机译：模式识别;

相似文献

外文文献
中文文献
专利

1. Automated protein sequence database classification.I.Integration of compositional similarity search,local similarity search,and multiple sequence alignment [J] . Jerome Gracy... Bioinformatics . 1998,第2期

机译：自动化蛋白质序列数据库分类.I。组成相似性搜索，局部相似性搜索和多序列比对的整合
2. Similarity search in patents databases. The evaluations of the search quality [J] . B.L. Genin, D.S. Zolkin World Patent Information . 2021,第Mara期

机译：Patents数据库中的相似性搜索。搜索质量的评估
3. Novel DOCK clique driven 3D similarity database search tools for molecule shape matching and beyond: Adding flexibility to the search for ligand kin [J] . Good AC Journal of molecular graphics & modelling . 2007,第3期

机译：新颖的DOCK派系驱动的3D相似性数据库搜索工具，用于分子形状匹配及其他功能：增强了配体亲属搜索的灵活性
4. Pattern-based Similarity Search for Microarray Data [C] . Haixun Wang, Jian Pei, Philip S. Yu . 2005

机译：基于模式的芯片数据相似度搜索
5. Similarity Search on High Dimensional Data [D] . Liu, Yingfan. 2019

机译：相似性搜索高维数据
6. Pattern-based Search of Epigenomic Data Using GeNemo [O] . Alvin Zheng, Xiaoyi Cao, Sheng Zhong 2017

机译：使用GeNemo基于模式的表观基因组数据搜索
7. Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment [O] . J. Gracy, P. Argos 1998

机译：自动蛋白质序列数据库分类。 I.集成组成相似性搜索，局部相似性搜索和多个序列对齐的集成

Pattern-based similarity search for microarray data

摘要

著录项

相似文献

相关主题

期刊订阅