...
【24h】

Biclustering of Linear Patterns In Gene Expression Data

机译:基因表达数据中线性模式的二聚化

获取原文
获取原文并翻译 | 示例

摘要

Identifying a bicluster, or submatrix of a gene expression dataset wherein the genes express similar behavior over the columns, is useful for discovering novel functional gene interactions. In this article, we introduce a new algorithm for finding biClusters with Linear Patterns (CLiP). Instead of solely maximizing Pearson correlation, we introduce a fitness function that also considers the correlation of complementary genes and conditions. This eliminates the need for a priori determination of the bicluster size. We employ both greedy search and the genetic algorithm in optimization, incorporating resampling for more robust discovery. When applied to both real and simulation datasets, our results show that CLiP is superior to existing methods. In analyzing RNA-seq fly and worm time-course data from modENCODE, we uncover a set of similarly expressed genes suggesting maternal dependence. Supplementary Material is available online (at www.liebertonline.com/cmb).
机译:识别其中基因在列上表达相似行为的基因表达数据集的双簇或子矩阵可用于发现新型功能基因相互作用。在本文中,我们介绍了一种用于查找带有线性模式的biClusters的新算法(CLiP)。除了引入最大的皮尔逊相关性之外,我们还引入了适应度函数,该函数还考虑了互补基因和条件的相关性。这消除了对先验确定双簇尺寸的需要。我们在优化中采用贪婪搜索和遗传算法,并结合了重采样功能以实现更强大的发现。当将其应用于实际和模拟数据集时,我们的结果表明CLiP优于现有方法。在分析来自modENCODE的RNA-seq蝇和蠕虫的时程数据时,我们发现了一组类似表达的基因,提示母体依赖。可在线获取补充材料(网址为www.liebertonline.com/cmb)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号