...
首页> 外文期刊>PLoS One >Characterization of histone modification patterns and prediction of novel promoters using functional principal component analysis
【24h】

Characterization of histone modification patterns and prediction of novel promoters using functional principal component analysis

机译:用功能主成分分析表征组蛋白修饰模式和新型启动子预测

获取原文
           

摘要

Characterization of distinct histone methylation and acetylation binding patterns in promoters and prediction of novel regulatory regions remains an important area of genomic research, as it is hypothesized that distinct chromatin signatures may specify unique genomic functions. However, methods that have been proposed in the literature are either descriptive in nature or are fully parametric and hence more restrictive in pattern discovery. In this article, we propose a two-step non-parametric statistical inference procedure to characterize unique histone modification patterns and apply it to analyzing the binding patterns of four histone marks, H3K4me2, H3K4me3, H3K9ac, and H4K20me1, in human B-lymphoblastoid cells. In the first step, we used a functional principal component analysis method to represent the concatenated binding patterns of these four histone marks around the transcription start sites as smooth curves. In the second step, we clustered these curves to reveal several unique classes of binding patterns. These uncovered patterns were used in turn to scan the whole-genome to predict novel and alternative promoters. Our analyses show that there are three distinct promoter binding patterns of active genes. Further, 19654 regions not within known gene promoters were found to overlap with human ESTs, CpG islands, or common SNPs, indicative of their potential role in gene regulation, including being potential novel promoter regions.
机译:促进剂中不同组蛋白甲基化和乙酰化结合模式的表征仍然是基因组研究的重要领域,因为它被假设,不同的染色质签名可以指定独特的基因组功能。然而,在文献中提出的方法在自然界中描述或者是完全参数的,因此在模式发现中更具限制性。在本文中,我们提出了一种两步的非参数统计推理程序,以表征独特的组蛋白修饰模式,并将其应用于人B淋巴细胞细胞中的四个组蛋白标记,H3K4ME2,H3K4ME3,H3K9AC和H4K20ME1的结合模式。 。在第一步中,我们使用了功能性主成分分析方法,以表示转录起始位点周围的这四个组蛋标记的连接绑定模式作为平滑曲线。在第二步中,我们聚集了这些曲线以揭示几种独特的绑定模式。这些未涂覆的模式反过来使用扫描全基因组以预测新颖和替代的启动子。我们的分析表明,存在有三种不同的活性基因的启动子结合模式。此外,发现19654年不在已知的基因启动子内与人们,CpG岛或常见的SNP重叠,这表明它们在基因调节中的潜在作用,包括潜在的新型启动子区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号