DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

Akdes Serin; Martin Vingron

首页> 外文期刊>Algorithms for Molecular Biology >DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

【24h】

DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

机译：DeBi：使用频繁项集方法发现差异表达的Biclusters

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Background The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneously. It discovers subsets of genes that are co-expressed in certain samples. Recent studies showed that biclustering has a great potential in detecting marker genes that are associated with certain tissues or diseases. Several biclustering algorithms have been proposed. However, it is still a challenge to find biclusters that are significant based on biological validation measures. Besides that, there is a need for a biclustering algorithm that is capable of analyzing very large datasets in reasonable time. Results Here we present a fast biclustering algorithm called DeBi (Differentially Expressed BIclusters). The algorithm is based on a well known data mining approach called frequent itemset. It discovers maximum size homogeneous biclusters in which each gene is strongly associated with a subset of samples. We evaluate the performance of DeBi on a yeast dataset, on synthetic datasets and on human datasets. Conclusions We demonstrate that the DeBi algorithm provides functionally more coherent gene sets compared to standard clustering or biclustering algorithms using biological validation measures such as Gene Ontology term and Transcription Factor Binding Site enrichment. We show that DeBi is a computationally efficient and powerful tool in analyzing large datasets. The method is also applicable on multiple gene expression datasets coming from different labs or platforms.

机译：背景技术通过聚类算法分析大量的高通量数据对于阐明生物系统中的基因功能非常重要。但是，传统的聚类方法有几个缺点。通过同时对基因和样本进行分组，聚类克服了这些限制。它发现在某些样品中共表达的基因子集。最近的研究表明，双聚类技术在检测与某些组织或疾病相关的标记基因方面具有巨大潜力。已经提出了几种双簇算法。但是，基于生物学验证方法来找到重要的双聚类仍然是一个挑战。除此之外，还需要一种能够在合理的时间内分析非常大的数据集的双重聚类算法。结果在这里，我们提出了一种称为DeBi（差异表达的BIclusters）的快速双聚类算法。该算法基于一种称为频繁项集的众所周知的数据挖掘方法。它发现最大尺寸的均质双聚簇，其中每个基因与样品的一个子集紧密相关。我们评估DeBi在酵母数据集，合成数据集和人类数据集上的性能。结论我们证明，与使用生物学验证方法（例如基因本体论术语和转录因子结合位点富集）的标准聚类或双聚类算法相比，DeBi算法在功能上提供了更一致的基因集。我们证明DeBi是分析大型数据集的一种计算有效且功能强大的工具。该方法还适用于来自不同实验室或平台的多个基因表达数据集。

著录项

来源
《Algorithms for Molecular Biology》 |2011年第1期|共页
作者
Akdes Serin; Martin Vingron;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类分子生物学;
关键词

相似文献

外文文献
中文文献
专利

1. Discovering frequent itemsets over transactional data streams through an efficient and stable approximate approach [J] . Kuen-Fang Jea, Chao-Wei Li Expert systems with applications . 2009,第10期

机译：通过高效，稳定的近似方法发现交易数据流上的频繁项目集
2. HighPU: a high privacy-utility approach to mining frequent itemset with differential privacy [J] . Yabin Wang, Yi Qiao, Zhaobin Liu, International Journal of Embedded Systems . 2019,第5期

机译：HIGHPU：具有差异隐私的频繁替代项目集的高隐私实用方法
3. A categorical network approach for discovering differentially expressed regulations in cancer [J] . Nikolay Balov BMC Medical Genomics . 2013,第SUPPLEMENTa3期

机译：发现癌症中差异表达调控的分类网络方法
4. A Frequent Item Graph Approach for Discovering Frequent Itemsets [C] . Kumar A. V. Senthil, Wahidabanu R. S. D. International Conference on Advanced Computer Theory and Engineering . 2008

机译：频繁的项目图形方法，用于发现频繁的项目集
5. Frequent Itemset Hiding Algorithm Using Frequent Pattern Tree Approach. [D] . Alnatsheh, Rami. 2012

机译：使用频繁模式树方法的频繁项集隐藏算法。
6. DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach [O] . Akdes Serin, Martin Vingron 2011

机译：DeBi：使用频繁项集方法发现差异表达的Biclusters
7. DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach [O] . Akdes Serin, Martin Vingron 2011

机译：DeBi：使用频繁项集方法发现差异表达的Biclusters

DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅