【24h】

Data-Farming: Systemic Tree-like Infrastructure Development of the Data- and Knowledge Bases on functional DNA and RNA sites

机译:数据农业:基于功能性DNA和RNA站点的数据库和知识库的系统性树状基础结构开发

获取原文
获取原文并翻译 | 示例

摘要

At molecular level, life-processes are realized through interactions between DNA, RNA, and proteins. The interacting fragments are denoted as "functional sites"; the site clusters - as "regulatory regions"; the protein coding DNA(RNA) regions as "genes". For DNA(RNA) sequences, Genome Annotation is a field of life-science predicting of functional sites, regulatory regions, and genes, which should be verified experimentally. So, the applets predicting the positioning of above described structural units by similarity to the already known patterns are needed. All the experimental evidence on DNA(RNA) sequences, sites, regulatory regions, genes is distributed within over 400 databases. To apply these data for Genome Annotation, the complementary data portions should be retrieved from the databases and treated by the "Knowledge Discovery" computations in order to develop the applets suitable for the sequence analysis. Since a number of various "Knowledge Discovery" computations contain some common steps, we have stored the intermediate results obtained at these steps within an intermediate database, called the "data-filial" database. This yielded the data-filial tree, called "data-farming", growing from the databases to the applets for sequence analysis. Our data-farming infrastructure for (ⅰ) functional site activity prediction, (ⅱ) gene regulatory region recognition, and (ⅲ) "High/Low"- gene expression estimate, is described and discussed here, URL=.
机译:在分子水平上,生命过程是通过DNA,RNA和蛋白质之间的相互作用实现的。相互作用的片段称为“功能位点”;网站集群-作为“监管区域”;编码DNA(RNA)区域的蛋白质为“基因”。对于DNA(RNA)序列,基因组注释是生命科学领域中预测功能位点,调控区和基因的领域,应通过实验进行验证。因此,需要通过类似于已知模式来预测上述结构单元的位置的小程序。有关DNA(RNA)序列,位点,调控区,基因的所有实验证据均分布在400多个数据库中。要将这些数据应用于基因组注释,应从数据库中检索互补数据部分,并通过“知识发现”计算进行处理,以便开发适用于序列分析的小程序。由于许多各种“知识发现”计算都包含一些常用步骤,因此我们已将在这些步骤中获得的中间结果存储在称为“数据孝顺”数据库的中间数据库中。这样就产生了一个数据孝顺树,称为“数据农场”,它从数据库生长到用于顺序分析的小程序。我们在此处描述和讨论了用于(data)功能位点活动预测,(ⅱ)基因调控区识别和(ⅲ)“高/低”-基因表达估计的数据农业基础设施。URL=

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号