首页> 外文期刊>Intelligent data analysis >Trrd: Technology For Extraction, Storage, And Use Of Knowledge About The Structural-functional Organization Of The Transcriptional Regulatory Regions In The Eukaryotic Genes
【24h】

Trrd: Technology For Extraction, Storage, And Use Of Knowledge About The Structural-functional Organization Of The Transcriptional Regulatory Regions In The Eukaryotic Genes

机译:Trrd:提取,存储和使用真核基因转录调控区结构功能组织知识的技术

获取原文
获取原文并翻译 | 示例
           

摘要

We have developed the Transcription Regulatory Regions Database (TRRD, http://www.bionet.nsc.ru/trrd/) designed for storage of data on the structural-functional organization of transcriptional regulatory regions of the eukaryotic genes and their expression patterns. TRRD contains experimentally supported data only. Knowledge extraction from scientific literature, storage and further application of the results are all stepwise, conforming to the Data Mining technology: i) knowledge extraction from scientific publications; ii) preprocessing (data cleaning, syntactic and semantic analysis; iii) data transformation; iv) application for prediction; v) interpretation of the obtained knowledge to resolve the timely issues in bioinformatics. TRRD contains a compilation of data on 2 344 genes, their 14 407 expression patterns, 3 490 regulatory units, and 10 135 transcription factor binding sites. TRRD is filled in by manual annotation of scientific publications. The information incorporated into TRRD is the result of annotations of 7 609 scientific papers. Sequence Retrieval System (SRS) is the main tool for search and navigation in TRRD. A large number of indexed fields in its SRS version allow the user to generate queries both within and between libraries. TRRD has thesauruses and search systems that provide additional options for data access. TRRD is currently linked to 20 worldwide information resources, including EMBL/GeneBank, Ensembl, EPD, SWISS-PROT, TRANSFAC, GDB, GeneCards, MGD, RGD, GO. The links serve as a framework for integration in a distributed network environment.
机译:我们已经开发了转录调控区数据库(TRRD,http://www.bionet.nsc.ru/trrd/),用于存储真核基因转录调控区的结构功能组织及其表达方式的数据。 TRRD仅包含实验支持的数据。从科学文献中提取知识,存储和进一步应用结果都符合数据挖掘技术:i)从科学出版物中提取知识; ii)预处理(数据清理,句法和语义分析; iii)数据转换; iv)申请预测; v)对获得的知识进行解释,以解决生物信息学中的及时问题。 TRRD包含有关2344个基因,它们的14407个表达模式,3490个调节单位和10135个转录因子结合位点的数据汇编。 TRRD通过对科学出版物进行人工注释来填充。 TRRD中包含的信息是7 609篇科学论文的注释结果。序列检索系统(SRS)是TRRD中用于搜索和导航的主要工具。 SRS版本中的大量索引字段允许用户在库内部和库之间生成查询。 TRRD具有同义词库和搜索系统,可为数据访问提供其他选项。 TRRD目前与20个全球信息资源相关联,包括EMBL / GeneBank,Ensembl,EPD,SWISS-PROT,TRANSFAC,GDB,GeneCards,MGD,RGD,GO。这些链接用作在分布式网络环境中集成的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号