DNA base-calling using polynomial classifiers

机译：使用多项式分类器进行DNA碱基检出

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Base-calling is one of many problems that can be solved using pattern recognition, the act of classifying raw data based on prior or statistical information extracted from the data into various classes. In this paper, we propose a new framework using polynomial classifiers to model electropherogram traces obtained from ABI sequencing machines to perform base-calling. Initially, pre-processing, which includes segmented normalization and peak sharpening, needs to be performed to reduce the imperfections caused in a trace as a result of the chemistry involved. Discriminative feature vectors are then extracted from the chromatogram traces and are expanded to a higher dimensional space by second order polynomial expansion. A linear classifier is then trained and bases are classified respectively. Chromatogram traces that were chosen for analysis belong to Homo sapiens, Saccharomyces mikatae and Drosophila melanogaster. Simulation results indicated an accuracy of up to 99.2% upon testing three different chromatogram traces consisting of about 600 to 800 bases each. The proposed model's performance was compared to the existing standards: ABI and PHRED in terms of insertion, deletion and substitution errors. Simulation evidence indicated that the designed model performs comparably or slightly better than ABI in terms of deletion and insertion errors. Moreover, polynomial classifier resulted in negligible substitution errors compared to ABI. Polynomial classifier was also observed to perform comparable to PHRED in terms of deletion error and substitution errors. The results obtained demonstrate the potential of this model to perform base-calling.

机译：碱基检出是可以使用模式识别解决的许多问题之一，模式识别是基于从数据中提取的先验或统计信息将原始数据分类为各种类别的行为。在本文中，我们提出了一个使用多项式分类器的新框架，以对从ABI测序仪获得的电泳图进行建模以执行碱基检出。最初，需要进行预处理，包括分段归一化和峰锐化，以减少痕量由于所涉及的化学反应而引起的缺陷。然后从色谱图中提取出可区分的特征向量，并通过二阶多项式展开将其展开到更高维的空间。然后训练线性分类器，并分别对基础进行分类。选择进行分析的色谱图痕迹属于智人，米酒酵母和果蝇。仿真结果表明，测试三种不同的色谱图痕迹（每条约600至800个碱基）时，准确度高达99.2％。将拟议模型的性能与现有标准（ABI和PHRED）在插入，删除和替换错误方面进行了比较。仿真证据表明，在删除和插入错误方面，设计模型的性能与ABI相当或稍好。此外，与ABI相比，多项式分类器产生的替代误差可忽略不计。在删除错误和替换错误方面，还观察到多项式分类器的性能可与PHRED媲美。获得的结果证明了该模型执行碱基检出的潜力。

著录项

来源
《The 2010 International Joint Conference on Neural Networks》|2010年|1-5|共5页
会议地点
作者
Mohammed Omniyah G.; Assaleh Khaled T.; Husseini Ghaleb A.; Majdalawieh Amin F.; Woodward Scott R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工神经网络与计算;
关键词

相似文献

外文文献
中文文献
专利

1. Highly Conductive Nucleotide Analogue Facilitates Base-Calling in Quantum-Tunneling-Based DNA Sequencing [J] . Furuhata Takafumi, Ohshiro Takahito, Akimoto Gaku, ACS nano . 2019,第5期

机译：高导电核苷酸类似物促进量子隧道基DNA测序中的基础呼叫
2. An adaptive decorrelation method removes Illumina DNA base-calling errors caused by crosstalk between adjacent clusters [J] . Bo Wang, Lin Wan, Anqi Wang, Scientific reports. . 2017,第1期

机译：自适应去相关方法除去由相邻簇之间的串扰引起的Illumina DNA键呼叫误差
3. Novel algorithms for accurate DNA base-calling [J] . Omniyah G. Mohammed, Khaled T. Assaleh, Ghaleb A. Husseini, Journal of Biomedical Science and Engineering . 2013,第2期

机译：精确的DNA碱基调用的新算法
4. DNA base-calling using polynomial classifiers [C] . Mohammed Omniyah G., Assaleh Khaled T., Husseini Ghaleb A., The 2010 International Joint Conference on Neural Networks . 2010

机译：使用多项式分类器进行DNA碱基检出
5. Advances in SCA and RF-DNA fingerprinting through enhanced linear regression attacks and application of random forest classifiers. [D] . Patel, Hiren J. 2014

机译：通过增强的线性回归攻击和随机森林分类器的应用，在SCA和RF-DNA指纹识别方面取得了进展。
6. DNA Base-Calling from a Nanopore Using a Viterbi Algorithm [O] . Winston Timp, Jeffrey Comer, Aleksei Aksimentiev 2012

机译：使用维特比算法从纳米孔进行DNA碱基检出
7. Novel algorithms for accurate DNA base-calling [O] . Omniyah G. Mohammed, Khaled T. Assaleh, Ghaleb A. Husseini, 2013

机译：精确的DNA碱基调用的新算法

DNA base-calling using polynomial classifiers

摘要

著录项

相似文献

相关主题

期刊订阅