首页> 外文会议>IEEE International Conference on Bioinformatics and Bioengineering >Species Identification using Partial DNA Sequence: A Machine Learning Approach
【24h】

Species Identification using Partial DNA Sequence: A Machine Learning Approach

机译:使用部分DNA序列的物种鉴定:机器学习方法

获取原文

摘要

Species identification with partial DNA sequences has proved effective for different organisms. DNA barcode is a short genetic marker in an organism's DNA to identify which species it belongs to. In this work, we analyze the effectiveness of supervised machine learning methods to classify species with DNA barcode. We choose specimens from phylogenetically diverse species belonging to the animal, plant and fungus kingdoms. We consider the supervised machine learning methods, simple logistic function, random forest, PART, instance-based k-nearest neighbor, attribute-based classifier, and bagging. The analysis of results on various datasets shows that the classification performances of the selected methods are encouraging, and has an accuracy of 93.66% on average. This result shows 6% improvement compared to the state-of-art DNA barcode classification methods, which have 88.37% accuracy on average.
机译:用部分DNA序列的物种鉴定已证明对不同的生物有效。 DNA条形码是生物体DNA中的短遗传标记,以确定其所属的物种。在这项工作中,我们分析了监督机器学习方法的有效性与DNA条形码分类物种。我们选择属于动物,植物和真菌王国的系统源性不同物种的标本。我们考虑监督机器学习方法,简单的逻辑函数,随机林,部分,基于实例的基于邻居,基于属性的分类器和袋装。各种数据集的结果分析表明所选方法的分类性能令人鼓舞,平均精度为93.66%。与最先进的DNA条形码分类方法相比,该结果显示出6%的改进,平均具有88.37%的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号