首页> 美国卫生研究院文献>Biology Direct >A high-performance approach for predicting donor splice sites based on short window size and imbalanced large samples
【2h】

A high-performance approach for predicting donor splice sites based on short window size and imbalanced large samples

机译:一种基于短窗口大小和不平衡大样本来预测供体剪接位点的高性能方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundSplice sites prediction has been a long-standing problem in bioinformatics. Although many computational approaches developed for splice site prediction have achieved satisfactory accuracy, further improvement in predictive accuracy is significant, for it is contributing to predict gene structure more accurately. Determining a proper window size before prediction is necessary. Overly long window size may introduce some irrelevant features, which would reduce predictive accuracy, while the use of short window size with maximum information may performs better in terms of predictive accuracy and time cost. Furthermore, the number of false splice sites following the GT–AG rule far exceeds that of true splice sites, accurate and rapid prediction of splice sites using imbalanced large samples has always been a challenge. Therefore, based on the short window size and imbalanced large samples, we developed a new computational method named chi-square decision table (χ2-DT) for donor splice site prediction.
机译:背景拼接位点预测一直是生物信息学中的一个长期存在的问题。尽管开发了许多用于剪接位点预测的计算方法,均获得了令人满意的准确性,但预测准确性的进一步提高是显着的,因为它有助于更​​准确地预测基因结构。在预测之前确定适当的窗口大小是必要的。太长的窗口大小可能会引入一些不相关的功能,这会降低预测精度,而将短窗口大小与最大信息结合使用可能会在预测精度和时间成本方面表现更好。此外,遵循GT–AG规则的错误剪接位点的数量远远超过了真正的剪接位点,使用不平衡的大样本准确,快速地预测剪接位点一直是一个挑战。因此,基于短窗口大小和大样本不平衡的情况,我们开发了一种新的计算方法,称为卡方决策表(χ 2 -DT),用于供体剪接位点预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号