首页> 外文期刊>Analytical and Bioanalytical Chemistry >Automated single-nucleotide polymorphism analysis using fluorescence excitation–emission spectroscopy and one-class classifiers
【24h】

Automated single-nucleotide polymorphism analysis using fluorescence excitation–emission spectroscopy and one-class classifiers

机译:使用荧光激发-发射光谱和一类分类器自动进行单核苷酸多态性分析

获取原文
获取原文并翻译 | 示例
           

摘要

We have developed a new method of highly automated SNP (single nucleotide polymorphism) analysis for identification of genotypes. The data were generated by the Taqman reaction. A total of 18 half-plates were analysed for different genes, each consisting of 48 wells, including six synthetic DNA samples, three background samples, and 39 human DNA samples. Fluorescence spectra were obtained from each well. The characteristics of the spectra depended on whether the genotype originated from one of three classes—homozygotic wild-type, mutant, or heterozygote. The main problems are: (1) spectral variation from one half-plate to another is sometimes very substantial; (2) the spectra of heterozygotic samples vary substantially; (3) outliers are common; and (4) not all possible alleles are represented on each half-plate so the number of types of spectra can vary, depending on the gene being analysed. We solved these problems by using a signal-standardisation technique (piecewise direct standardisation, PDS) and then built two one-class classifiers based on PCA models (PCA data description) to identify the two types of homozygote. The remaining samples were tested to see whether they could be approximated well by a linear combination of the spectra of two types of homozygote. If they could, they were identified as heterozygotic; if not, they were identified as outliers. The results are characterised by very low false-positive errors and 2 to 6% overall false-negative errors.
机译:我们已经开发出一种用于鉴定基因型的高度自动化的SNP(单核苷酸多态性)分析的新方法。数据是通过塔克曼反应产生的。分析了总共18个半板的不同基因,每个半板由48个孔组成,包括6个合成DNA样品,3个背景样品和39个人DNA样品。从每个孔获得荧光光谱。光谱的特征取决于基因型是否来自三类之一:纯合野生型,突变型或杂合子。主要问题是:(1)从一个半板到另一半板的光谱变化有时非常大; (2)杂合子样品的光谱差异很大; (3)异常值很常见; (4)并非所有可能的等位基因都显示在每个半板上,因此光谱类型的数量可以变化,具体取决于所分析的基因。我们通过使用信号标准化技术(逐段直接标准化,PDS)解决了这些问题,然后基于PCA模型(PCA数据描述)建立了两个一类分类器,以识别两种纯合子。测试其余样品,看是否可以通过两种纯合子光谱的线性组合很好地近似它们。如果可以的话,它们被鉴定为杂合的。如果不是,则将它们识别为离群值。结果的特征在于极低的假阳性误差和2%至6%的总假阴性误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号