首页> 外文会议>International Conference on Genome Informatics >Detection of the Splicing Sites With Kernel Method Approaches Dealing With Nucleotide Doublets
【24h】

Detection of the Splicing Sites With Kernel Method Approaches Dealing With Nucleotide Doublets

机译:用核苷酸法处理核苷酸双峰的拼接位点检测拼接点

获取原文

摘要

The RNA-splicing is a very important phenomenon in eukaryotic cells to make pre mRNAs into mature forms. Despite its importance, biochemical details of the mechanism of RNA-splicing have not been understood completely. In this research, using machine learning approaches, we aimed to discriminate the true splice sites from pseudo sites which wouldn't be spliced. Support Vector Machine (SVM) is known as an efficient machine learning method used in many scientific disciplines including bioinformatics [3]. As several researchers have shown, DNA sequence data can be treated with adequate design of kernel functions i.e. core parts of SVM [1, 2, 4]. We focused on patterns of nucleotide doublets of DNA sequence and designed a kernel which might better represent the feature of sequence around splice sites. By examining more than 10,000 human splicing donor (5' ends of introns) and acceptor (3' ends of introns) sites, we found that the accuracies of this methods were 94.49% and 93.94% respectively. These results imply higher efficiency of this method than those observed with other kernel methods and methods based on first- and second-order Markov models. This method would reflect properties of sequence data and would give us insights for the mechanisms of the RNA-splicing.
机译:RNA剪接是真核细胞中的一个非常重要的现象,使前MRNA成熟形成。尽管其重要性,但尚未完全理解RNA剪接机制的生化细节。在这项研究中,使用机器学习方法,我们旨在区分来自伪站点的真正的剪接位点,这些位点不会被拼接。支持向量机(SVM)被称为在许多科学学科中使用的有效机器学习方法,包括生物信息学[3]。正如几个研究人员所示,DNA序列数据可以用足够的内核功能设计,即SVM [1,2,4]的核心部分。我们专注于DNA序列的核苷酸双重的模式,并设计了一个可能更好地代表剪接位点周围的序列特征的核。通过检查10,000多个人剪接供体(内含子的5'末端)和受体(内含子的3个'末端),我们发现该方法的精度分别为94.49%和93.94%。这些结果意味着这种方法的效率更高,而不是基于第一和二阶马尔可夫模型的其他内核方法和方法观察到的方法。该方法将反映序列数据的属性,并将为我们提供RNA拼接机制的见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号