首页> 美国卫生研究院文献>other >Generic Information Can Retrieve Known Biological Associations: Implications for Biomedical Knowledge Discovery
【2h】

Generic Information Can Retrieve Known Biological Associations: Implications for Biomedical Knowledge Discovery

机译:通用信息可以检索已知的生物学关联:对生物医学知识发现的启示

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

MotivationWeighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins) others generic (e.g., ‘Homo sapiens’). Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI) by filtering generic concepts (node filtering) or links to generic concepts (edge filtering) from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering), and assessed the retrieval performance of networks composed of generic information alone.
机译:动机基于文本挖掘的文献构建的加权语义网络可用于检索已知的蛋白质-蛋白质或基因-疾病关联,并且已被证明可以在文献中明确表述之前对其进行预测。我们的文本挖掘系统可识别超过640,000种生物医学概念:有些是特定的(即基因或蛋白质的名称),而另一些则是通用的(例如,“智人”)。通用概念可能在自动信息检索,提取和推理中起重要作用,但也可能导致概念超载,混淆性低,甚至是虚假链接的检索和推理。在这里,我们尝试通过从加权语义网络过滤通用概念(节点过滤)或到通用概念的链接(边缘过滤)来优化蛋白质-蛋白质相互作用(PPI)的检索性能。首先,我们基于网络属性定义了度量,这些度量量化了概念的特殊性。然后,使用这些指标,我们从网络中系统地过滤了通用信息,同时监视了已知蛋白质-蛋白质相互作用的检索性能。我们还系统地过滤了来自网络的特定信息(逆过滤),并评估了仅由通用信息组成的网络的检索性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号