首页> 美国卫生研究院文献>Bioinformatics >Approximate Bayesian feature selection on a large meta-dataset offers novel insights on factors that effect siRNA potency
【2h】

Approximate Bayesian feature selection on a large meta-dataset offers novel insights on factors that effect siRNA potency

机译:大型元数据集上的近似贝叶斯特征选择提供了影响siRNA效能的因素的新颖见解

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Short interfering RNA (siRNA)-induced RNA interference is an endogenous pathway in sequence-specific gene silencing. The potency of different siRNAs to inhibit a common target varies greatly and features affecting inhibition are of high current interest. The limited success in predicting siRNA potency being reported so far could originate in the small number and the heterogeneity of available datasets in addition to the knowledge-driven, empirical basis on which features thought to be affecting siRNA potency are often chosen. We attempt to overcome these problems by first constructing a meta-dataset of 6483 publicly available siRNAs (targeting mammalian mRNA), the largest to date, and then applying a Bayesian analysis which accommodates feature set uncertainty. A stochastic logistic regression-based algorithm is designed to explore a vast model space of 497 compositional, structural and thermodynamic features, identifying associations with siRNA potency.>Results: Our algorithm reveals a number of features associated with siRNA potency that are, to the best of our knowledge, either under reported in literature, such as anti-sense 5′ −3′ motif ‘UCU’, or not reported at all, such as the anti-sense 5′ -3′ motif ‘ACGA’. These findings should aid in improving future siRNA potency predictions and might offer further insights into the working of the RNA-induced silencing complex (RISC).>Contact: >Supplementary information: are available at Bioinformatics online.
机译:>动机:短干扰RNA(siRNA)诱导的RNA干扰是序列特异性基因沉默中的内源性途径。不同的siRNA抑制共同靶标的能力差异很大,并且影响抑制的功能引起了人们的高度关注。迄今为止,在预测siRNA效能方面取得的成功有限,可能是由于可用的数据集的数量少且异质性以及知识驱动的经验基础以及经常被认为会影响siRNA效能的经验基础所致。我们试图通过首先构建6483个公开可用的siRNA(针对哺乳动物mRNA)的元数据集(迄今为止最大的),然后应用适应特征集不确定性的贝叶斯分析来克服这些问题。设计一种基于随机逻辑回归的算法,探索497个组成,结构和热力学特征的巨大模型空间,确定与siRNA效能的关联。>结果:我们的算法揭示了与siRNA效能相关的许多功能。据我们所知,要么在文献中报道,例如反义5'-3'基序'UCU',要么根本没有报道,例如在反义5'-3'基序' ACGA'。这些发现将有助于改善未来的siRNA效能预测,并可能为RNA诱导沉默复合物(RISC)的工作提供进一步的见解。>联系方式: >补充信息:在在线生物信息学上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号