...
首页> 外文期刊>Frontiers in Genetics >Identifying differential transcription factor binding in ChIP-seq
【24h】

Identifying differential transcription factor binding in ChIP-seq

机译:识别ChIP-seq中的差异转录因子结合

获取原文
   

获取外文期刊封面封底 >>

       

摘要

ChIP seq is a widely used assay to measure genome-wide protein binding. The decrease in costs associated with sequencing has led to a rise in the number of studies that investigate protein binding across treatment conditions or cell lines. In addition to the identification of binding sites, new studies evaluate the variation in protein binding between conditions. A number of approaches to study differential transcription factor binding have recently been developed. Several of these methods build upon established methods from RNA-seq to quantify differences in read counts. We compare how these new approaches perform on different data sets from the ENCODE project to illustrate the impact of data processing pipelines under different study designs. The performance of normalization methods for differential ChIP-seq depends strongly on the variation in total amount of protein bound between conditions, with total read count outperforming effective library size, or variants thereof, when a large variation in binding was studied. Use of input subtraction to correct for non-specific binding showed a relatively modest impact on the number of differential peaks found and the fold change accuracy to biological validation, however a larger impact might be expected for samples with more extreme copy number variations between them. Still, it did identify a small subset of novel differential regions while excluding some differential peaks in regions with high background signal. These results highlight proper scaling for between-sample data normalization as critical for differential transcription factor binding analysis and suggest bioinformaticians need to know about the variation in level of total protein binding between conditions to select the best analysis method. At the same time, validation using fold-change estimates from qRT-PCR suggests there is still room for further method improvement.
机译:ChIP seq是一种广泛用于测定全基因组蛋白结合的检测方法。与测序相关的成本降低导致研究跨治疗条件或细胞系的蛋白质结合的研究数量增加。除了鉴定结合位点,新的研究还评估了条件之间蛋白质结合的变化。最近已经开发出许多研究差异转录因子结合的方法。这些方法中的几种基于RNA-seq的既定方法来量化读取计数的差异。我们比较了这些新方法如何对ENCODE项目的不同数据集执行效果,以说明不同研究设计下数据处理管道的影响。当研究结合的较大变化时,差异化ChIP-seq的标准化方法的性能在很大程度上取决于条件之间结合的蛋白质总量的变化,总读取数胜过有效文库大小或其变异。使用输入减法校正非特异性结合对发现的差异峰数和生物学验证的倍数变化准确性影响相对较小,但是对于它们之间存在更大拷贝数差异的样品,可能会有更大影响。尽管如此,它确实能识别出一小部分新颖的差分区域,同时排除了背景信号较高区域中的一些差分峰。这些结果强调了样本间数据归一化的适当缩放比例,这对于差异转录因子结合分析至关重要,并建议生物信息学家需要了解条件之间总蛋白质结合水平的变化,以选择最佳分析方法。同时,使用来自qRT-PCR的倍数变化估计值进行的验证表明,仍有进一步改进方法的空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号