Identifying gene and protein mentions in text using conditional random fields

Ryan McDonald; Fernando Pereira

首页> 外文期刊>BMC Bioinformatics >Identifying gene and protein mentions in text using conditional random fields

【24h】

Identifying gene and protein mentions in text using conditional random fields

机译：使用条件随机字段识别文本中的基因和蛋白质提及

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background We present a model for tagging gene and protein mentions from text using the probabilistic sequence tagging framework of conditional random fields (CRFs). Conditional random fields model the probability P ( t | o ) of a tag sequence given an observation sequence directly, and have previously been employed successfully for other tagging tasks. The mechanics of CRFs and their relationship to maximum entropy are discussed in detail. Results We employ a diverse feature set containing standard orthographic features combined with expert features in the form of gene and biological term lexicons to achieve a precision of 86.4% and recall of 78.7%. An analysis of the contribution of the various features of the model is provided.

机译：背景我们提供了一种使用条件随机场（CRF）的概率序列标记框架标记文本中的基因和蛋白质的模型。有条件的随机字段直接为给定观察序列的标签序列的概率P（t | o）建模，并且先前已成功地用于其他标签任务。详细讨论了CRF的机理及其与最大熵的关系。结果我们采用了包含标准正字法特征和专家特征（以基因和生物学术语词典的形式）相结合的多样化特征集，以达到86.4％的精确度和78.7％的召回率。提供了对模型各种功能的贡献的分析。

著录项

来源
《BMC Bioinformatics 》 |2005年第1期| 共页
作者
Ryan McDonald; Fernando Pereira;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学 ;
关键词

相似文献

外文文献
中文文献
专利

1. De-identifying Swedish clinical text - refinement of a gold standard and experiments with Conditional random fields [J] . Hercules Dalianis, Sumithra Velupillai Journal of Biomedical Semantics . 2010 ,第1期

机译：取消识别瑞典临床文本-完善金标准和条件随机场实验
2. Enhanced Identifying Gene Names from Biomedical Literature with Conditional Random Fields [J] . Wei-Zhong Qian, Chong Fu, Hong-Rong Cheng, 中国电子科技：英文版 . 2009 ,第003期

机译：具有条件随机场的生物医学文献中增强的鉴定基因名称
3. Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts [J] . Lee Wangjin, Kim Kyungmo, Lee Eun Young, Computers in Biology and Medicine . 2018 ,第期

机译：临床命名实体识别的条件随机字段：韩国临床文本的比较研究
4. Flytxt_NTNU at SemEval-2018 Task 8: Identifying and Classifying Malware Text Using Conditional Random Fields and Naive Bayes Classifiers [C] . Utpal Kumar Sikdar, Biswanath Barik, Bjoern Gamback International workshop on semantic evaluation;Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2018

机译：Flytxt_NTNU在SemEval-2018上的任务8：使用条件随机字段和朴素贝叶斯分类器识别和分类恶意软件文本
5. Conditional Random Fields With Lasso and Its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection [D] . Liu, Xiyuan. 2019

机译：基于真菌感染表达水平的带套索条件随机场及其在大麦基因分类中的应用
6. Identifying gene and protein mentions in text using conditional random fields [O] . Ryan McDonald, Fernando Pereira 2005

机译：使用条件随机字段识别文本中的基因和蛋白质提及
7. Identifying gene and protein mentions in text using conditional random fields [O] . Ryan Mcdonald, O Pereira 2011

机译：使用条件随机字段识别文本中的基因和蛋白质

Identifying gene and protein mentions in text using conditional random fields

摘要

著录项

相似文献

相关主题

期刊订阅