首页> 外文期刊>Genes and genomics >SNP genotype calling and quality control for multi-batch-based studies
【24h】

SNP genotype calling and quality control for multi-batch-based studies

机译:基于批次研究的SNP基因型呼叫和质量控制

获取原文
获取原文并翻译 | 示例
           

摘要

BackgroundIn genetic analyses, the term batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.MethodsIn this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.ResultsThe proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.ConclusionsWe proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
机译:背景遗传分析,术语批量效应是指由批料异质性引起的系统差异。控制这种意想不到的效果是在分析之前的质量控制(QC)过程中最重要的一步。目前,批量效应不受统计数据控制,并且需要更新的方法。本报告,我们提出了一种新方法,以检测不同批次之间探针强度的异质性和在批次存在下调用基因型和QC的过程。影响。首先,我们对方差(MANOVA)进行了多元分析,以测试批次之间探针强度的差异。如果检测到异质性,则应使用使用每批探针强度测量的平均值的K-METIMOI算法聚类受试者,并且应分别调用不同簇中的受试者的基因型。拟议方法用于评估3619的基因分型数据由1074例阿尔茨海默病患者组成的受试者,296例,具有轻度认知障碍(MCI)和1153个对照。所提出的方法提高了所谓的基因型的准确性,无需过滤许多受试者和SNP,因此是控制批量效应的合理方法.Conclusionswe提出了一种以探针强度测量检测批量效应的新策略,并呼叫基因型存在批量效果。所提出的方法对实际数据的应用表明它产生了平衡的方法。此外,所提出的方法可以扩展到具有简单修改的​​各种场景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号