...
首页> 外文期刊>Bioinformatics >Gene clustering by Latent Semantic Indexing of MEDLINE abstracts
【24h】

Gene clustering by Latent Semantic Indexing of MEDLINE abstracts

机译:基于MEDLINE摘要的潜在语义索引的基因聚类。

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: A major challenge in the interpretation of high-throughput genomic data is understanding the functional associations between genes. Previously, several approaches have been described to extract gene relationships from various biological databases using term-matching methods. However, more flexible automated methods are needed to identify functional relationships (both explicit and implicit) between genes from the biomedical literature. In this study, we explored the utility of Latent Semantic Indexing (LSI), a vector space model for information retrieval, to automatically identify conceptual gene relationships from titles and abstracts in MEDLINE citations.Results: We found that LSI identified gene-to-gene and keyword-to-gene relationships with high average precision. In addition, LSI identified implicit gene relationships based on word usage patterns in the gene abstract documents. Finally, we demonstrate here that pairwise distances derived from the vector angles of gene abstract documents can be effectively used to functionally group genes by hierarchical clustering. Our results provide proof-of-principle that LSI is a robust automated method to elucidate both known (explicit) and unknown (implicit) gene relationships from the biomedical literature. These features make LSI particularly useful for the analysis of novel associations discovered in genomic experiments.
机译:动机:高通量基因组数据的解释中的主要挑战是了解基因之间的功能关联。以前,已经描述了几种使用术语匹配方法从各种生物学数据库中提取基因关系的方法。但是,需要更灵活的自动化方法来识别生物医学文献中基因之间的功能关系(显式和隐式)。在这项研究中,我们探索了潜在语义索引(LSI)这一用于信息检索的向量空间模型的用途,该工具可从MEDLINE引用中的标题和摘要中自动识别概念上的基因关系。结果:我们发现LSI可以识别基因对基因以及关键字与基因之间的关系,平均精度很高。此外,LSI根据基因摘要文档中的单词使用模式识别了隐性基因关系。最后,我们在这里证明从基因抽象文档的向量角度得出的成对距离可以有效地用于通过层次聚类对基因进行功能分组。我们的结果提供了原理证明,即LSI是一种强大的自动化方法,可以从生物医学文献中阐明已知(显性)和未知(隐性)基因之间的关系。这些功能使LSI特别适用于分析基因组实验中发现的新型关联。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号