首页> 外文期刊>Genomics & Informatics >Standard-based Integration of Heterogeneous Large-scale DNA Microarray Data for Improving Reusability.
【24h】

Standard-based Integration of Heterogeneous Large-scale DNA Microarray Data for Improving Reusability.

机译:基于标准的异构大规模DNA微阵列数据集成,可提高可重用性。

获取原文
           

摘要

Gene Expression Omnibus (GEO) has kept the largest amount of gene-expression microarray data that have grown exponentially. Microarray data in GEO have been generated in many different formats and often lack standardized annotation and documentation. It is hard to know if preprocessing has been applied to a dataset or not and in what way. Standard-based integration of heterogeneous data formats and metadata is necessary for comprehensive data query, analysis and mining. We attempted to integrate the heterogeneous microarray data in GEO based on Minimum Information About a Microarray Experiment (MIAME) standard. We unified the data fields of GEO Data table and mapped the attributes of GEO metadata into MIAME elements. We also discriminated non-preprocessed raw datasets from others and processed ones by using a two-step classification method. Most of the procedures were developed as semi-automated algorithms with some degree of text mining techniques. We localized 2,967 Platforms, 4,867 Series and 103,590 Samples with covering 279 organisms, integrated them into a standard-based relational schema and developed a comprehensive query interface to extract. Our tool, GEOQuest is available at http://www.snubi.org/software/GEOQuest/.
机译:基因表达综合(GEO)保留了大量呈指数增长的基因表达微阵列数据。 GEO中的微阵列数据已经以多种不同的格式生成,并且通常缺乏标准化的注释和文档。很难知道是否对数据集进行了预处理以及以何种方式进行了预处理。基于标准的异构数据格式和元数据的集成对于全面的数据查询,分析和挖掘是必需的。我们试图根据关于微阵列实验的最低限度信息(MIAME)标准将异构微阵列数据整合到GEO中。我们统一了GEO数据表的数据字段,并将GEO元数据的属性映射到MIAME元素中。我们还使用两步分类方法将非预处理的原始数据集与其他数据集和已处理的原始数据集区分开。大多数程序都是使用某种程度的文本挖掘技术开发为半自动算法。我们对涵盖279种生物的2967个平台,4867个系列和103590个样本进行了本地化,将它们集成到基于标准的关系模式中,并开发了一个全面的查询界面以进行提取。我们的工具GEOQuest可从http://www.snubi.org/software/GEOQuest/获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号