...
首页> 外文期刊>BMC Bioinformatics >Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples
【24h】

Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples

机译:基因特征预测肿瘤样本中的畸形,废话和框架突变

获取原文
           

摘要

Because driver mutations provide selective advantage to the mutant clone, they tend to occur at a higher frequency in tumor samples compared to selectively neutral (passenger) mutations. However, mutation frequency alone is insufficient to identify cancer genes because mutability is influenced by many gene characteristics, such as size, nucleotide composition, etc. The goal of this study was to identify gene characteristics associated with the frequency of somatic mutations in the gene in tumor samples. We used data on somatic mutations detected by genome wide screens from the Catalog of Somatic Mutations in Cancer (COSMIC). Gene size, nucleotide composition, expression level of the gene, relative replication time in the cell cycle, level of evolutionary conservation and other gene characteristics (totaling 11) were used as predictors of the number of somatic mutations. We applied stepwise multiple linear regression to predict the number of mutations per gene. Because missense, nonsense, and frameshift mutations are associated with different sets of gene characteristics, they were modeled separately. Gene characteristics explain 88% of the variation in the number of missense, 40% of nonsense, and 23% of frameshift mutations. Comparisons of the observed and expected numbers of mutations identified genes with a higher than expected number of mutations- positive outliers. Many of these are known driver genes. A number of novel candidate driver genes was also identified. By comparing the observed and predicted number of mutations in a gene, we have identified known cancer-associated genes as well as 111 novel cancer associated genes. We also showed that adding the number of silent mutations per gene reported by genome/exome wide screens across all cancer type (COSMIC data) as a predictor substantially exceeds predicting accuracy of the most popular cancer gene predicting tool - MutsigCV.
机译:因为驾驶员突变为突变体克隆提供选择性的优势,所以与选择性中性(乘客)突变相比,它们倾向于在肿瘤样品中的较高频率发生。然而,单独的突变频率不足以鉴定癌症基因,因为可变性受到许多基因特征的影响,例如尺寸,核苷酸组合物等。本研究的目的是鉴定与基因中的体细胞突变频率相关的基因特性肿瘤样品。我们使用来自癌症(宇宙)的体细胞突变目录的基因组宽屏幕检测到的体细胞突变数据。基因尺寸,核苷酸组合物,基因的表达水平,细胞周期中的相对复制时间,进化守恒水平和其他基因特征(总共11)用作体细胞突变数量的预测因子。我们应用逐步多次线性回归来预测每个基因的突变数。因为畸形,废话和颤音突变与不同的基因特征组相关,所以它们被分开进行了建模。基因特征解释了88%的畸形次数,占废话的40%和23%的颤音突变。观察和预期突变数的比较鉴定了高于预期的突变阳性异常值的基因。其中许多是已知的驱动基因。还确定了许多新型候选司机基因。通过比较基因中的观察和预测的突变数,我们已经确定了已知的癌症相关基因以及111个新的癌症相关基因。我们还表明,通过基因组/外壳宽屏幕在所有癌症类型(宇宙数据)上增加了每种基因的沉默突变的数量,作为预测器,基本上超过了最受欢迎的癌症基因预测工具的预测精度 - mutsigcv。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号