A graph-search framework for associating gene identifiers with documents

William W Cohen; Einat Minkov

首页> 外文期刊>BMC Bioinformatics >A graph-search framework for associating gene identifiers with documents

【24h】

A graph-search framework for associating gene identifiers with documents

机译：一个图形搜索框架，用于将基因标识符与文档相关联

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background One step in the model organism database curation process is to find, for each article, the identifier of every gene discussed in the article. We consider a relaxation of this problem suitable for semi-automated systems, in which each article is associated with a ranked list of possible gene identifiers, and experimentally compare methods for solving this geneId ranking problem. In addition to baseline approaches based on combining named entity recognition (NER) systems with a "soft dictionary" of gene synonyms, we evaluate a graph-based method which combines the outputs of multiple NER systems, as well as other sources of information, and a learning method for reranking the output of the graph-based method. Results We show that named entity recognition (NER) systems with similar F-measure performance can have significantly different performance when used with a soft dictionary for geneId-ranking. The graph-based approach can outperform any of its component NER systems, even without learning, and learning can further improve the performance of the graph-based ranking approach. Conclusion The utility of a named entity recognition (NER) system for geneId-finding may not be accurately predicted by its entity-level F1 performance, the most common performance measure. GeneId-ranking systems are best implemented by combining several NER systems. With appropriate combination methods, usefully accurate geneId-ranking systems can be constructed based on easily-available resources, without resorting to problem-specific, engineered components.

机译：背景技术模型生物体数据库策策过程中的一步是为每篇文章找到文章中讨论的每个基因的标识符。我们考虑放松该问题，适用于半自动系统，其中每种物品与可能的基因标识符的排名列表相关，并通过实验比较求解该基因排名问题的方法。除了基于基因组合的基线方法，除了基因同义词的“软词典”的命名实体识别（NER）系统，我们评估了基于图形的方法，该方法组合了多个NER系统的输出，以及其他信息来源，以及一种rerank基于图形方法的输出的学习方法。结果我们表明，当与基因分词一起使用时，具有类似F测量性能的命名实体识别（NER）系统可以具有显着不同的性能。基于图形的方法可以胜过其组件内部系统，即使没有学习，也可以进一步提高基于图形的排名方法的性能。结论目的识别（NER）系统的效用可能无法通过其实体级F1性能准确预测，最常见的性能测量。通过组合几个NER系统，最好地实现基因排名系统。通过适当的组合方法，可以基于易于使用的资源来构建有用的准确基因排名系统，而无需借助特定于问题的工程组件。

著录项

来源
《BMC Bioinformatics 》 |2006年第1期| 共页
作者
William W Cohen; Einat Minkov;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Depot- and obesity-related differences in adipogenesisAdipocyte hypertrophy and hyperplasia are known to facilitate lipid storage in adipose tissues by increasing adipocyte cell size and number, respectively. Adipogenesis is the process resulting in adipose tissue hyperplasia. Although depot-specific differences and obesity-related modulation of adipocyte size are well documented, available data on adipogenesis and adipose tissue hyperplasia are less conclusive. Most studies support a reduction of adipogenesis in the obese state. Preadipocytes of the subcutaneous fat depot appear to be more responsive to adipogenic stimulation compared with those from visceral fat compartments in most studies. A number of studies support the notion that adipose tissue expansion through hyperplasia reduces ectopic lipid excess and obesity-related complications. Several genetic variants have been identified in the genes coding for adipogenesis-regulating proteins. While some of these variants have been clearly associated with the phenotypes of obesity and obesity-related alterations, available data highlight the importance of considering gene–gene and gene–diet interactions. [J] . Julie. Lessard, André. Tchernof Clinical lipidology. . 2012 ,第5期

机译：脂肪形成与肥胖相关的差异已知脂肪细胞肥大和增生分别通过增加脂肪细胞的大小和数量来促进脂质在脂肪组织中的存储。脂肪形成是导致脂肪组织增生的过程。尽管已经有很多文献记载了贮库特异性差异和肥胖相关的脂肪细胞大小调节，但有关脂肪形成和脂肪组织增生的可用数据尚无定论。大多数研究支持在肥胖状态下减少脂肪形成。在大多数研究中，与来自内脏脂肪区室的脂肪细胞相比，皮下脂肪库的前脂肪细胞似乎对脂肪刺激更为敏感。许多研究支持这样的观点，即通过增生的脂肪组织扩张可以减少异位脂质过多和肥胖相关的并发症。在编码脂肪形成调节蛋白的基因中已经鉴定出几种遗传变异。尽管其中一些变异与肥胖症的表型和与肥胖有关的改变明显相关，但现有数据突出了考虑基因-基因和基因-饮食相互作用的重要性。
2. Surgery in World War II. Thoracic Surgery—Volume I. Edited by Frank B. Berry, M.D.; Associate Editor, Elizabeth M. McFetridge, M.A.; with seven other contributors. Prepared and published under the direction of Lieutenant-General Leonard D. Heaton, the Surgeon General, United States Army. Editor-in-Chief, Colonel John Boyd Coates, Jun., M.C., U.S.A. 10x7? in. Pp. xxiv+394, with 70 figures. Index. 1963. Washington: Superintendent of Documents, U.S. Government Printing Office, Washington, D.C. Price $4.25 [J] . John W. Jackson The Journal of Bone and Joint Surgery. British VolumecBritish Orthopaedic Association , Australian Orthopaedic Association , Canadian Orthopaedic Association . . . [et al] . 1963 ,第4期

机译：第二次世界大战中的外科手术。胸外科手术-第I卷，医学博士Frank B. Berry编辑;副编辑，伊丽莎白·麦克菲特里奇（MA）与其他七个贡献者。在美国陆军外科医生伦纳德·希顿中将的指导下编写和出版。主编约翰·博伊德·科茨（John Boyd Coates）上校，美国马萨诸塞州，六月10x7？ in。Pp。 xxiv + 394，含70个数字。指数。 1963年。华盛顿：文件总监，美国政府印刷局，华盛顿特区，价格$ 4.25
3. A DOCUMENT RANKING APPROACH BASED ON WEIGHTED-GENE/PROTEIN IN LARGE BIOMEDICAL DOCUMENTS USING MAPREDUCE FRAMEWORK [J] . K.S.S. Joseph Sastry, Venkata Daya Sagar Ketaraju International journal of simulation: systems, science and technology . 2018 ,第6aaPagea2期

机译：基于MapReduce框架的大型生物医学文件中加权基因/蛋白的文献排名方法
4. NON-LEXICAL APPROACHES TO IDENTIFYING ASSOCIATIVE RELATIONS IN THE GENE ONTOLOGY [C] . OLIVIER BODENREIDER, MARC AUBRY, ANITA BURGUN Pacific Symposium on Biocomputing(PSB); 20050104-08; Hawaii,HI(US) . 2005

机译：识别基因本体论中关联关系的非严格方法
5. Design and evaluation of an associative classification framework to identify disease cohorts in the electronic health record. [D] . Welch, Susan Rea. 2011

机译：设计和评估关联分类框架，以识别电子健康记录中的疾病队列。
6. A graph-search framework for associating gene identifiers with documents [O] . William W Cohen, Einat Minkov 2006

机译：图搜索框架用于将基因标识符与文档相关联
7. A graph-search framework for associating gene identifiers with documents [O] . Cohen William W, Minkov Einat 2006

机译：图搜索框架，用于将基因标识符与文档相关联

A graph-search framework for associating gene identifiers with documents

摘要

著录项

相似文献

相关主题

期刊订阅