首页> 外文期刊>Model assisted statistics and applications >Repurposing kinship coefficients as a sample integrity method for next generation sequencing data in a clinical setting
【24h】

Repurposing kinship coefficients as a sample integrity method for next generation sequencing data in a clinical setting

机译:将亲属系数重新用作临床环境中下一代测序数据的样本完整性方法

获取原文
获取原文并翻译 | 示例
       

摘要

BACKGROUND AND OBJECTIVES: Kinship coefficients measure relatedness between two individuals and have wide usage in genetic applications. In this study, we repurpose the kinship coefficient to directly facilitate sample tracking to identify potential sample swaps. Such sample integrity metrics are particularly important for the following two scenarios in large-scale clinical studies: First, multiple biological samples from the same individual were routinely processed as unique samples or technical replicates. Querying the relatedness of genomic data of two samples can identify sample swaps prior to inappropriate inclusion in data analysis. In the second scenario, different biological analytes from the same samples were run across multiple platforms and it is critical to establish the correct mapping for each individual sample, linking genomic information derived from multiple platforms to the same sample. For both cases, all downstream inferences rely on such correct mapping. Kinship coefficients can directly measure the mapping accuracy and ensure the required sample integrity. MATERIALS AND METHODS: We first describe the general concept of kinship coefficients and focus on the novel adaptations on feature (i.e. variants and/or SNPs) selection utilizing expressed variants to make it suitable for the clinical setting. RESULTS: We illustrate the adapted kinship coefficients estimate in two studies: one for lung fibrosis where multiple samples were routinely collected from each patient and one for thyroid cancers where a cohort of samples was run on different platforms. CONCLUSION: We demonstrate the effectiveness of using kinship coefficients to improve sample integrity and discuss potential improvements in the methodology.
机译:背景与目的:亲缘关系系数可以衡量两个个体之间的相关性,在遗传应用中具有广泛的用途。在本研究中,我们重新调整了亲属系数,以直接促进样本跟踪以识别潜在的样本交换。对于大规模临床研究中的以下两种情况,此类样品完整性指标特别重要:首先,来自同一个人的多个生物样品被常规加工为独特样品或技术重复样品。查询两个样本的基因组数据的相关性可以在不适当地包括在数据分析之前确定样本交换。在第二种情况下,来自同一样品的不同生物分析物跨多个平台运行,因此至关重要的是为每个单独的样品建立正确的图谱,并将源自多个平台的基因组信息链接到同一样品。对于这两种情况,所有下游推断都依赖于这种正确的映射。亲属关系系数可以直接测量映射精度,并确保所需的样品完整性。材料和方法:我们首先描述亲属系数的一般概念,并着重于利用表达的变体使其适合临床环境的特征(即变体和/或SNP)选择的新颖适应。结果:我们在两项研究中说明了调整后的亲属系数估计值:一项是针对肺纤维化的,这是从每位患者常规收集的多个样本;另一项是对于甲状腺癌的,这组样本在不同的平台上运行。结论:我们证明了使用亲属系数来改善样品完整性的有效性,并讨论了该方法的潜在改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号