首页> 外文学位 >Integration of gene predictions using artificial neural networks.
【24h】

Integration of gene predictions using artificial neural networks.

机译:使用人工神经网络整合基因预测。

获取原文
获取原文并翻译 | 示例

摘要

In eukaryotes, especially in human beings, only a small proportion of genomic DNA sequence consists of functional fragments that are called exons. Exons are separated by non-functional fragments called introns. Many whole-genome sequences are available in the public domain and accessible over the World Wide Web. The challenge to scientists today is to correctly identify genes and their functions from these genomic sequences. Although many prediction engines are available over the Web to facilitate such identification, their scope of application and their capacity for prediction are limited. This thesis integrates three of the most prominent such engines, GrailExp, GenScan, and MZEF using a Multilayer Perceptron and a Mixture of Experts neural networks, in order to improve the capability and confidence of prediction.; The system was trained using 575 predictions that are mapped to known target values from 33 human genomic sequences, and tested using the prediction results from another unrelated set of 28 human genomic sequences. This thesis has identified two major drawbacks to the accuracy of prediction by individual engines, (i) contradictory predictions even within an engine, and (ii) inconsistency of prediction between the engines. Analysis of variance was performed over the result, and demonstrates that the integration system has significantly better recovery, by 25% on average, than individual prediction engines. The system based on a multilayers perceptron is available for exon prediction of human genomic DNA at http://www.cbr.nrc.ca/pany/integ.html.
机译:在真核生物中,特别是在人类中,只有一小部分的基因组DNA序列由称为外显子的功能片段组成。外显子被称为内含子的非功能性片段隔开。许多全基因组序列可在公共领域获得,并可通过万维网访问。今天,科学家面临的挑战是从这些基因组序列中正确识别基因及其功能。尽管可以通过Web使用许多预测引擎来促进这种识别,但是它们的应用范围和预测能力受到限制。本文使用多层感知器专家混合神经网络集成了三个最著名的此类引擎GrailExp,GenScan和MZEF,以提高性能和预测的信心。该系统使用575个预测进行了训练,这些预测被映射到来自33个人类基因组序列的已知目标值,并使用另一组不相关的28个人类基因组序列的预测结果进行了测试。本论文确定了单个引擎的预测准确性的两个主要缺点,(i)即使在引擎内部也存在矛盾的预测,以及(ii)引擎之间的预测不一致。对结果进行了方差分析,结果表明,与单个预测引擎相比,集成系统的回收率平均明显好于25%。基于多层感知器的系统可用于http://www.cbr.nrc.ca/pany/integ.html上的人类基因组DNA外显子预测。

著录项

  • 作者

    Pan, Youlian.;

  • 作者单位

    Dalhousie University (Canada).;

  • 授予单位 Dalhousie University (Canada).;
  • 学科 Computer Science.; Artificial Intelligence.; Biology Molecular.
  • 学位 M.C.Sc.
  • 年度 2002
  • 页码 86 p.
  • 总页数 86
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;人工智能理论;分子遗传学;
  • 关键词

  • 入库时间 2022-08-17 11:46:08

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号