首页> 美国政府科技报告 >Pattern recognition in DNA sequences: The intron-exon junction problem.
【24h】

Pattern recognition in DNA sequences: The intron-exon junction problem.

机译:DNa序列中的模式识别:内含子 - 外显子连接问题。

获取原文

摘要

One of the fundamental problems facing the field of genomic sequence analysis is the difficulty in locating relatively small coding regions of DNA within the much larger non-coding regions. Neural networks, linguistic analysis and various types of expert systems have been used with various degrees of success to address this problem. We have developed several methods for recognizing the presence of splice junctions and coding DNA which are based on artificial intelligence, linguistic and statistical approaches. The triplet vocabulary in and around splice junctions has been analyzed for primates, and the occurrences of preferred triplets in potential junctions seems to be a very selective method for distinguishing true junctions from otherwise similar sequences. given a 50% mix of true and false junctions, this method scores 93%--95% correct. Several approaches have been used to identify exons. These include a frame bias matrix algorithm and an algorithm which estimates the fractal dimension of dinucleotide usage. Attempts are underway to combine the outputs of the various methods using a rule-based approach to improve the overall performance of these predictors. 13 refs., 4 figs.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号