...
首页> 外文期刊>Bioinformatics >CpGFilter: model-based CpG probe filtering with replicates for epigenome-wide association studies
【24h】

CpGFilter: model-based CpG probe filtering with replicates for epigenome-wide association studies

机译:CpGFilter:基于模型的CpG探针过滤,带有复制,用于表观基因组范围的关联研究

获取原文
获取原文并翻译 | 示例
           

摘要

The development of the Infinium HumanMethylation450 BeadChip enables epigenome-wide association studies at a reduced cost. One observation of the 450K data is that many CpG sites the beadchip interrogates have very large measurement errors. Including these noisy CpGs will decrease the statistical power of detecting relevant associations due to multiple testing correction. We propose to use intra-class correlation coefficient (ICC), which characterizes the relative contribution of the biological variability to the total variability, to filter CpGs when technical replicates are available. We estimate the ICC based on a linear mixed effects model by pooling all the samples instead of using the technical replicates only. An ultra-fast algorithm has been developed to address the computational complexity and CpG filtering can be completed in minutes on a desktop computer for a 450K data set of over 1000 samples. Our method is very flexible and can accommodate any replicate design. Simulations and a real data application demonstrate that our whole-sample ICC method performs better than replicate-sample ICC or variance-based method.
机译:Infinium HumanMethylation450 BeadChip的开发可以降低成本进行表观基因组范围的关联研究。对450K数据的一项观察是,beadchip询问的许多CpG位置都具有非常大的测量误差。由于多次测试校正,包括这些嘈杂的CpG将降低检测相关联的统计能力。我们建议使用类内相关系数(ICC)来表征生物变异性对总变异性的相对贡献,以在技术复制可用时过滤CpG。我们通过合并所有样本而不是仅使用技术复制品,基于线性混合效应模型估算ICC。已经开发了一种超快速算法来解决计算复杂性,并且可以在台式计算机上在几分钟内完成CpG过滤,以处理超过1000个样本的450K数据集。我们的方法非常灵活,可以适应任何重复设计。仿真和实际数据应用表明,我们的整体样本ICC方法的性能优于重复样本ICC或基于方差的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号