...
首页> 外文期刊>Neural computing & applications >A neural network based multi-classifier system for gene identification in DNA sequences
【24h】

A neural network based multi-classifier system for gene identification in DNA sequences

机译:基于神经网络的用于DNA序列基因识别的多分类器系统

获取原文
获取原文并翻译 | 示例
           

摘要

The paper presents a neural network based multi-classifier system for the identification of Escheri-chia coli promoter sequences in strings of DNA. As each gene in DNA is preceded by a promoter sequence, the successful location of an E. coli promoter leads to the identification of the corresponding E. coli gene in the DNA sequence. A set of 324 known E. coli promoters and a set of 429 known non-promoter sequences were encoded using four different encoding methods. The encoded sequences were then used to train four different neural networks. The classification results of the four individual neural networks were then combined through an aggregation function, which used a variation of the logarithmic opinion pool method. The weights of this function were determined by a genetic algorithm. The multi-classifier system was then tested on 159 known promoter sequences and 171 non-promoter sequences not contained in the training set. The results obtained through this study proved that the same data set, when presented to neural networks in different forms, can provide slightly varying results. It also proves that when different opinions of more classifiers on the same input data are integrated within a multi-classifier system, we can obtain results that are better than the individual performances of the neural networks. The performances of our multi-classifier system outperform the results of other prediction systems for E. coli promoters developed so far.
机译:本文提出了一种基于神经网络的多分类器系统,用于鉴定DNA串中的大肠杆菌启动子序列。由于DNA中的每个基因都带有启动子序列,因此,大肠杆菌启动子的成功定位导致DNA序列中相应大肠杆菌基因的鉴定。使用四种不同的编码方法编码了一组324个已知的大肠杆菌启动子和一组429个已知的非启动子序列。然后将编码的序列用于训练四个不同的神经网络。然后,通过使用对数意见库方法的变体的聚合函数,将四个单独的神经网络的分类结果进行合并。该功能的权重通过遗传算法确定。然后在训练集中未包含的159个已知启动子序列和171个非启动子序列上测试了多分类器系统。通过这项研究获得的结果证明,当以不同形式将相同的数据集呈现给神经网络时,可以提供略有不同的结果。这也证明了,当将相同输入数据上更多分类器的不同意见整合到一个多分类器系统中时,我们可以获得的结果要优于神经网络的单个性能。我们的多分类器系统的性能优于迄今为止开发的其他针对大肠杆菌启动子的预测系统的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号