首页> 美国卫生研究院文献>other >Read-mapping using personalized diploid reference genome for RNA sequencing data reduced bias for detecting allele-specific expression
【2h】

Read-mapping using personalized diploid reference genome for RNA sequencing data reduced bias for detecting allele-specific expression

机译:使用个性化二倍体参考基因组进行RNA测序数据的读取映射减少了检测等位基因特异性表达的偏差

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Next generation sequencing (NGS) technologies have been applied extensively in many areas of genetics and genomics research. A fundamental problem when comes to analyzing NGS data is mapping short sequencing reads back to the reference genome. Most of existing software packages rely on a single uniform reference genome and do not automatically take into the consideration of genetic variants. On the other hand, large proportions of incorrectly mapped reads affect the correct interpretation of the NGS experimental results. As an example, Degner et al. showed that detecting allele-specific expression from RNA sequencing data was biased toward the reference allele. In this study, we developed a method that utilize DirectX 11 enabled graphics processing unit (GPU)’s parallel computing power to produces a personalized diploid reference genome based on all known genetic variants of that particular individual. We show that using such a personalized diploid reference genome can improve mapping accuracy and significantly reduce the bias toward reference allele in allele-specific expression analysis. Our method can be applied to any individual that has genotype information obtained either from array-based genotyping or resequencing. Besides the reference genome, no additional changes to alignment algorithm are needed for performing read mapping therefore one can utilize any of the existing read mapping tools and achieve the improved read mapping result. C++ and GPU compute shader source code of the software program is available at: .
机译:下一代测序(NGS)技术已广泛应用于遗传学和基因组学研究的许多领域。分析NGS数据时的一个基本问题是将短测序读图映射回参考基因组。现有的大多数软件包都依赖一个统一的参考基因组,并且不会自动考虑遗传变异。另一方面,错误映射的读段的很大一部分会影响NGS实验结果的正确解释。例如,Degner等人。结果表明,从RNA测序数据中检测等位基因特异性表达偏向于参考等位基因。在这项研究中,我们开发了一种方法,该方法利用具有DirectX 11功能的图形处理单元(GPU)的并行计算能力,根据特定个体的所有已知遗传变异来生成个性化的二倍体参考基因组。我们显示,使用这种个性化的二倍体参考基因组可以提高定位精度,并显着降低等位基因特异性表达分析中对参考等位基因的偏倚。我们的方法可以应用于具有通过基于阵列的基因分型或重新测序获得基因型信息的任何个体。除参考基因组外,无需对比对算法进行任何其他更改即可执行读取映射,因此可以利用任何现有的读取映射工具来获得改进的读取映射结果。该软件程序的C ++和GPU计算着色器源代码位于:。

著录项

相似文献

  • 外文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号