首页> 外文期刊>Genes and genomics >SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples
【24h】

SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples

机译:SM-RCNV:检测测序样本中复制拷贝数变异的统计方法

获取原文
获取原文并翻译 | 示例
           

摘要

BackgroundCopy number variation (CNV) is an important form of genomic structural variation and is linked to dozens of human diseases. Using next-generation sequencing (NGS) data and developing computational methods to characterize such structural variants is significant for understanding the mechanisms of diseases.ObjectiveThe objective of this study is to develop a new statistical method of detection recurrent CNVs across multiple samples from genomic sequences.MethodsA statistical method is carried out to detect recurrent CNVs, referred to as SM-RCNV. This method uses a statistic associated with each location by combining the frequency of variation at one location across whole samples and the correlation among consecutive locations. The weights of the frequency and correlation are trained using real datasets with known CNVs. P-value is assessed for each location on the genome by permutation testing.ResultsCompared with six peer methods, SM-RCNV outperforms the peer methods under receiver operating characteristic curves. SM-RCNV successfully identifies many consistent recurrent CNVs, most of which are known to be of biological significance and associated with diseased genes. The validation rate of SM-RCNV in the CEU call set and YRI call set with Database of Genomic Variants are 258/328 (79%) and (157/309) 51%, respectively.ConclusionSM-RCNV is a well-grounded statistical framework for detecting recurrent CNVs from multiple genomic sequences, providing valuable information to study genomes in human diseases. The source code is freely available at https://sourceforge.net/projects/sm-rcnv/.
机译:BackgroundCopy数变异(CNV)是基因组结构变异的重要形式,与数十种人类疾病联系在一起。使用下一代测序(NGS)数据和开发计算方法表征这种结构变体对理解疾病的机制来说是重要的。目的本研究的目的是在来自基因组序列的多个样品上开发一种检测复发性CNV的新统计方法。方法进行统计方法以检测复发性CNV,称为SM-RCNV。该方法通过将跨整个样本的一个位置处的变化频率和连续位置之间的相关性相结合,使用与每个位置相关联的统计。使用具有已知CNV的真实数据集进行频率和相关性的权重。通过置换测试对基因组上的每个位置进行评估p值。与六个对等方法进行评分,SM-RCNV优于接收器操作特性曲线下的对等方法。 SM-RCNV成功地识别许多一致的复发性CNV,其中大部分是已知的生物学意义和与患病基因相关的。 CEU呼叫集中的SM-RCNV的验证率和具有基因组变体数据库的YRI呼叫集是258/328(79%)和(157/309)分别为51%.ConclusionsM-RCNV是一个良好接地的统计框架用于检测来自多种基因组序列的复发性CNV,提供有价值的信息,以研究人类疾病中的基因组。源代码在https://sourceforge.net/projects/sm-rcnv/上自由使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号