首页> 外文会议>19th international symposium on high performance distributed computing 2010 >Exploring the RNA Folding Energy Landscape Using Scalable Distributed Cyberinfrastructure
【24h】

Exploring the RNA Folding Energy Landscape Using Scalable Distributed Cyberinfrastructure

机译:使用可扩展的分布式网络基础设施探索RNA折叠能量格局

获取原文
获取原文并翻译 | 示例

摘要

The increasing significance of RNAs in transcriptional or post-transcriptional gene regulation processes has generated considerable interest towards the prediction of RNA folding and its sensitivity to environmental factors. We use Boltzmann-weighted sampling to generate RNA secondary structures, which are used to characterize the energy landscape, via the distributions of energies and base-pair distances. Depending upon the length of an RNA, the number of sequences investigated, and the sample size of generated structures — generating and analyzing sufficient samples can be computationally challenging. We introduce and develop a lightweight and extensible runtime environment that is effective across a range of RNA sizes and other parameters, as well as over a range of infrastructure - from traditional HPC grids to clouds, without requiring any changes at the application or user level. The Adaptive Distributed Application Management System (ADAMS) is built upon an extensbile and interoperable pilot-job and supports the concurrent execution of a broad range of task sizes across a range of infrastructure. We use ADAMS to investigate the folding energy landscape for two RNA systems of different sizes: a set of S-adenosyl methionine (SAM) binding RNA sequences known as SAM-I riboswitches and the S gene of the Bovine Corona Virus (BCoV) RNA genome that comprises 4092 nucleotides. Results of the energy and base-pair distance distributions suggest different energy landscapes, implying different folding dynamics. With obtained results, we demonstrated the possibility of utilizing this protocol to explore microscopic origins for reported sequence-dependent variation of binding affinity and gene expression in the two RNA systems.
机译:RNA在转录或转录后基因调控过程中的重要性日益提高,已引起人们对RNA折叠及其对环境因素敏感性的预测的兴趣。我们使用玻尔兹曼加权采样生成RNA二级结构,该二级结构用于通过能量分布和碱基对距离来表征能量格局。根据RNA的长度,研究的序列数以及生成的结构的样本大小,生成和分析足够的样本可能在计算上具有挑战性。我们引入并开发了一种轻量级且可扩展的运行时环境,该环境可在一系列RNA大小和其他参数以及从传统HPC网格到云的一系列基础架构上有效,而无需在应用程序或用户级别进行任何更改。自适应分布式应用管理系统(ADAMS)建立在可扩展且可互操作的领航员的基础上,并支持在各种基础架构中同时执行各种任务。我们使用ADAMS来研究两个大小不同的RNA系统的折叠能态:一组称为SAM-I核糖开关的S-腺苷甲硫氨酸(SAM)结合RNA序列和牛冠状病毒(BCoV)RNA基因组的S基因包含4092个核苷酸。能量和碱基对距离分布的结果表明不同的能量分布,这意味着不同的折叠动力学。有了获得的结果,我们证明了利用该方案探索微观起源的可能性,以报道两个RNA系统中结合亲和力和基因表达的序列依赖性变异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号