首页> 美国卫生研究院文献>other >OccuPeak: ChIP-Seq Peak Calling Based on Internal Background Modelling
【2h】

OccuPeak: ChIP-Seq Peak Calling Based on Internal Background Modelling

机译:OccuPeak:基于内部背景建模的ChIP-Seq峰调用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

ChIP-seq has become a major tool for the genome-wide identification of transcription factor binding or histone modification sites. Most peak-calling algorithms require input control datasets to model the occurrence of background reads to account for local sequencing and GC bias. However, the GC-content of reads in Input-seq datasets deviates significantly from that in ChIP-seq datasets. Moreover, we observed that a commonly used peak calling program performed equally well when the use of a simulated uniform background set was compared to an Input-seq dataset. This contradicts the assumption that input control datasets are necessary to fatefully reflect the background read distribution. Because the GC-content of the abundant single reads in ChIP-seq datasets is similar to those of randomly sampled regions we designed a peak-calling algorithm with a background model based on overlapping single reads. The application, OccuPeak, uses the abundant low frequency tags present in each ChIP-seq dataset to model the background, thereby avoiding the need for additional datasets. Analysis of the performance of OccuPeak showed robust model parameters. Its measure of peak significance, the excess ratio, is only dependent on the tag density of a peak and the global noise levels. Compared to the commonly used peak-calling applications MACS and CisGenome, OccuPeak had the highest sensitivity in an enhancer identification benchmark test, and performed similar in an overlap tests of transcription factor occupation with DNase I hypersensitive sites and H3K27ac sites. Moreover, peaks called by OccuPeak were significantly enriched with cardiac disease-associated SNPs. OccuPeak runs as a standalone application and does not require extensive tweaking of parameters, making its use straightforward and user friendly. Availability:
机译:ChIP-seq已成为在全基因组范围内识别转录因子结合或组蛋白修饰位点的主要工具。大多数调峰算法都需要输入控制数据集来模拟背景读数的发生,以解决本地测序和GC偏倚问题。但是,Input-seq数据集中的读数的GC含量与ChIP-seq数据集中的读数明显不同。此外,我们观察到,当将模拟的统一背景集与Input-seq数据集进行比较时,常用的峰调用程序表现同样出色。这与输入控制数据集对于确定性地反映背景读取分布所必需的假设相矛盾。由于ChIP-seq数据集中大量单次读取的GC含量与随机采样区域的GC含量相似,因此我们设计了一种基于背景模型的峰调用算法,该算法基于重叠的单次读取。 OccuPeak应用程序使用每个ChIP-seq数据集中存在的大量低频标签来对背景进行建模,从而避免了对其他数据集的需求。 OccuPeak性能的分析显示了健壮的模型参数。其对峰的显着性(即过量比)的度量仅取决于峰的标签密度和整体噪声水平。与常用的峰调用应用程序MACS和CisGenome相比,OccuPeak在增强子识别基准测试中具有最高的灵敏度,并且在转录因子占用与DNase I超敏位点和H3K27ac位点的重叠测试中表现出相似的性能。此外,OccuPeak调用的峰显着富集了与心脏病相关的SNP。 OccuPeak作为一个独立的应用程序运行,不需要大量的参数调整,从而使其使用直接且用户友好。可用性:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号