首页> 美国卫生研究院文献>Journal of Biomolecular Techniques : JBT >Multi-Sample Pooling and Illumina Genome Analyzer Sequencing Methods to Determine Gene Sequence Variation for Database Development
【2h】

Multi-Sample Pooling and Illumina Genome Analyzer Sequencing Methods to Determine Gene Sequence Variation for Database Development

机译:用于确定数据库开发基因序列变异的多样本池和Illumina基因组分析仪测序方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Determination of sequence variation within a genetic locus to develop clinically relevant databases is critical for molecular assay design and clinical test interpretation, so multisample pooling for Illumina genome analyzer (GA) sequencing was investigated using the RET proto-oncogene as a model. Samples were Sanger-sequenced for RET exons 10, 11, and 13–16. Ten samples with 13 known unique variants (“singleton variants” within the pool) and seven common changes were amplified and then equimolar-pooled before sequencing on a single flow cell lane, generating 36 base reads. For comparison, a single “control” sample was run in a different lane. After alignment, a 24-base quality score-screening threshold and 3` read end trimming of three bases yielded low background error rates with a 27% decrease in aligned read coverage. Sequencing data were evaluated using an established variant detection method (percent variant reads), by the presented subtractive correction method, and with SNPSeeker software. In total, 41 variants (of which 23 were singleton variants) were detected in the 10 pool data, which included all Sanger-identified variants. The 23 singleton variants were detected near the expected 5% allele frequency (average 5.17%±0.90% variant reads), well above the highest background error (1.25%). Based on background error rates, read coverage, simulated 30, 40, and 50 sample pool data, expected singleton allele frequencies within pools, and variant detection methods; ≥30 samples (which demonstrated a minimum 1% variant reads for singletons) could be pooled to reliably detect singleton variants by GA sequencing.
机译:确定基因座中的序列变异以开发临床相关数据库对于分子测定设计和临床测试解释至关重要,因此,以RET原癌基因为模型,研究了Illumina基因组分析仪(GA)测序的多样品合并。对RET外显子10、11和13-16进行Sanger测序。扩增具有10个已知的13个独特变体(池中“单个变体”)和7个常见变化的样品,然后等摩尔汇集,然后在单个流通池通道上测序,产生36个碱基读数。为了进行比较,在不同泳道中运行了一个“对照”样品。对齐后,以24个碱基为基础的质量得分筛选阈值和三个碱基的3末端读取修整产生了较低的背景错误率,对齐的读取覆盖率降低了27%。使用已建立的变异检测方法(变异读数百分比),提出的减除校正方法以及SNPSeeker软件评估测序数据。在10个库数据中总共检测到41个变体(其中23个为单例变体),其中包括所有Sanger识别的变体。在预期的5%等位基因频率(平均读数为5.17%±0.90%变异)附近检测到23个单例变异,远高于最高背景误差(1.25%)。基于背景错误率,读取覆盖率,模拟的30、40和50个样本池数据,池中预期的单例等位基因频率以及变异检测方法;可以合并≥30个样本(证明单例的最小1%变异读取),以通过GA测序可靠地检测单例变异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号