首页> 外文期刊>Bioinformatics >Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data
【24h】

Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data

机译:分层隐马尔可夫模型在ChIP-chip和ChIP-seq数据联合分析中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Chromatin immunoprecipitation (ChIP) experiments followed by array hybridization, or ChIP-chip, is a powerful approach for identifying transcription factor binding sites (TFBS) and has been widely used. Recently, massively parallel sequencing coupled with ChIP experiments (ChIP-seq) has been increasingly used as an alternative to ChIP-chip, offering cost-effective genome-wide coverage and resolution up to a single base pair. For many well-studied TFs, both ChIP-seq and ChIP-chip experiments have been applied and their data are publicly available. Previous analyses have revealed substantial technology-specific binding signals despite strong correlation between the two sets of results. Therefore, it is of interest to see whether the two data sources can be combined to enhance the detection of TFBS.Results: In this work, hierarchical hidden Markov model (HHMM) is proposed for combining data from ChIP-seq and ChIP-chip. In HHMM, inference results from individual HMMs in ChIP-seq and ChIP-chip experiments are summarized by a higher level HMM. Simulation studies show the advantage of HHMM when data from both technologies co-exist. Analysis of two well-studied TFs, NRSF and CCCTC-binding factor (CTCF), also suggests that HHMM yields improved TFBS identification in comparison to analyses using individual data sources or a simple merger of the two.
机译:动机:染色质免疫沉淀(ChIP)实验,然后进行阵列杂交,或称为ChIP芯片,是鉴定转录因子结合位点(TFBS)的强大方法,已被广泛使用。最近,大规模并行测序与ChIP实验(ChIP-seq)结合已被越来越多地用作ChIP芯片的替代品,可提供经济高效的全基因组覆盖范围和高达单个碱基对的分辨率。对于许多经过充分研究的TF,ChIP-seq和ChIP芯片实验均已应用,其数据可公开获得。尽管两组结果之间有很强的相关性,但先前的分析显示出了实质性的技术特异性结合信号。因此,值得关注的是,是否可以将两个数据源组合在一起以增强对TFBS的检测。结果:在这项工作中,提出了分层的隐马尔可夫模型(HHMM),用于组合来自ChIP-seq和ChIP-chip的数据。在HHMM中,较高级别的HMM总结了ChIP-seq和ChIP芯片实验中各个HMM的推断结果。仿真研究表明,当两种技术的数据共存时,HHMM的优势。对两个经过充分研究的TFs,NRSF和CCCTC结合因子(CTCF)的分析也表明,与使用单个数据源或将两者简单合并的分析相比,HHMM可以提高TFBS的识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号