首页> 外文期刊>Molecular biology and evolution >Legacy Data Confound Genomics Studies
【24h】

Legacy Data Confound Genomics Studies

机译:遗留数据混淆基因组学研究

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Recent reports have identified differences in the mutational spectra across human populations. Although some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data are used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower quality data from the early phases of the 1kGP thus continue to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.
机译:最近的报告已经确定了不同人群的突变光谱的差异。尽管其中一些报告已在其他队列中复制,但大多数仅在 1000 基因组计划 (1kGP) 数据中报告。在调查日本人口中一个有趣的假定种群分层时,我们发现了一个以前未报道的批量效应,导致 1kGP 数据中的虚假突变调用和明显的种群分层。由于 1kGP 数据被广泛使用,我们发现批处理效应也会导致领先的插补服务器和少量可疑的 GWAS 关联进行不正确的插补。因此,来自1kGP早期阶段的低质量数据继续以隐蔽的方式污染现代研究。可能是时候停用或升级此类传统测序数据了。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号