首页> 美国卫生研究院文献>Scientific Reports >HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
【2h】

HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine

机译:HRGPred:具有k-mer核苷酸组成特征和支持向量机的除草剂抗性基因预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is time consuming and expensive. Thus, a supervised learning-based computational model has been proposed in this study, which is first of its kind for the prediction of seven classes of GETS. The cDNA sequences of the genes were initially transformed into numeric features based on the k-mer compositions and then supplied as input to the support vector machine. In the proposed SVM-based model, the prediction occurs in two stages, where a binary classifier in the first stage discriminates the genes involved in conferring the resistance to herbicides from other genes, followed by a multi-class classifier in the second stage that categorizes the predicted herbicide resistant genes in the first stage into any one of the seven resistant classes. Overall classification accuracies were observed to be ~89% and >97% for binary and multi-class classifications respectively. The proposed model confirmed higher accuracy than the homology-based algorithms viz., BLAST and Hidden Markov Model. Besides, the developed computational model achieved ~87% accuracy, while tested with an independent dataset. An online prediction server HRGPred () has also been established to facilitate the prediction of GETS by the scientific community.
机译:抗除草剂性(HR)是农业生产者和环保主义者的主要关注点。由于编码除草剂靶位/蛋白质(GETS)的基因的突变,赋予了对常用除草剂的抗性。通过湿实验室实验鉴定这些基因既费时又昂贵。因此,本研究提出了一种基于监督学习的计算模型,该模型首次用于预测7类GETS。该基因的cDNA序列首先根据k-mer组成转化为数字特征,然后作为输入提供给支持向量机。在提出的基于SVM的模型中,预测分两个阶段进行,第一阶段的二元分类器将涉及赋予除草剂抗性的基因与其他基因区分开,第二阶段的多分类器将其分类在第一阶段将预测的除草剂抗性基因分为七个抗性类别中的任何一个。对于二分类分类和多分类分类,整体分类精度分别为〜89%和> 97%。与基于同源性的算法,即BLAST和Hidden Markov模型相比,该模型的准确性更高。此外,在使用独立数据集进行测试时,开发的计算模型达到了约87%的准确性。还建立了在线预测服务器HRGPred(),以促进科学界对GETS的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号