首页> 外文期刊>Intelligent Data Analysis >TRRD: Technology for extraction, storage, and use of knowledge about the structural-functional organization of the transcriptional regulatory regions in the eukaryotic genes
【24h】

TRRD: Technology for extraction, storage, and use of knowledge about the structural-functional organization of the transcriptional regulatory regions in the eukaryotic genes

机译:TRRD:提取,储存和使用有关真核基因转录调控区结构功能组织的知识的技术

获取原文
获取原文并翻译 | 示例
           

摘要

Abstract. We have developed the Transcription Regulatory Regions Database (TRRD, http://www.bionet.nsc.ru/trrd/) designednfor storage of data on the structural-functional organization of transcriptional regulatory regions of the eukaryotic genes and theirnexpression patterns. TRRDcontains experimentally supported data only. Knowledge extraction from scientific literature, storagenand further application of the results are all stepwise, conforming to the Data Mining technology: i) knowledge extraction fromnscientific publications; ii) preprocessing (data cleaning, syntactic and semantic analysis; iii) data transformation; iv) applicationnfor prediction; v) interpretation of the obtained knowledge to resolve the timely issues in bioinformatics. TRRD contains ancompilation of data on 2 344 genes, their 14 407 expression patterns, 3 490 regulatory units, and 10 135 transcription factornbinding sites. TRRD is filled in by manual annotation of scientific publications. The information incorporated into TRRD is thenresult of annotations of 7 609 scientific papers. Sequence Retrieval System (SRS) is the main tool for search and navigation innTRRD. A large number of indexed fields in its SRS version allow the user to generate queries both within and between libraries.nTRRD has thesauruses and search systems that provide additional options for data access. TRRD is currently linked to 20nworldwide information resources, including EMBL/GeneBank, Ensembl, EPD, SWISS-PROT, TRANSFAC, GDB, GeneCards,nMGD, RGD, GO. The links serve as a framework for integration in a distributed network environment.
机译:抽象。我们已经开发了转录调控区数据库(TRRD,http://www.bionet.nsc.ru/trrd/),用于存储有关真核基因转录调控区的结构功能组织及其表达模式的数据。 TRRD仅包含实验支持的数据。从科学文献中提取知识,存储和进一步使用结果均符合数据挖掘技术:i)从科学出版物中提取知识; ii)预处理(数据清理,句法和语义分析; iii)数据转换; iv)预测应用; v)对获得的知识进行解释,以解决生物信息学中的及时问题。 TRRD包含有关2344个基因,它们的14407个表达模式,3490个调节单位和10135个转录因子结合位点的数据的汇编。 TRRD通过对科学出版物进行人工注释来填充。然后,TRRD中包含的信息就是7 609篇科学论文的注释结果。序列检索系统(SRS)是用于搜索和导航inTRRD的主要工具。 SRS版本中的大量索引字段使用户可以在库内部和库之间生成查询。nTRRD具有同义词库和搜索系统,可为数据访问提供其他选项。 TRRD目前已链接到20n个全球信息资源,包括EMBL / GeneBank,Ensembl,EPD,SWISS-PROT,TRANSFAC,GDB,GeneCards,nMGD,RGD,GO。这些链接用作在分布式网络环境中集成的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号