AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature

Chang Darby Tien-Hao; Ke Chao-Hsuan; Lin Jung-Hsin; Chiang Jung-Hsien

首页> 外文期刊>Bioinformatics >AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature

【24h】

AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature

机译：AutoBind：从生物学文献中自动提取蛋白质-配体结合亲和力数据

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: Determination of the binding affinity of a proteinlig- and complex is important to quantitatively specify whether a particular small molecule will bind to the target protein. Besides, collection of comprehensive datasets for protein-ligand complexes and their corresponding binding affinities is crucial in developing accurate scoring functions for the prediction of the binding affinities of previously unknown protein-ligand complexes. In the past decades, several databases of protein-ligand-binding affinities have been created via visual extraction from literature. However, such approaches are time-consuming and most of these databases are updated only a few times per year. Hence, there is an immediate demand for an automatic extraction method with high precision for binding affinity collection. Result: We have created a new database of protein-ligand-binding affinity data, AutoBind, based on automatic information retrieval. We first compiled a collection of 1586 articles where the binding affinities have been marked manually. Based on this annotated collection, we designed four sentence patterns that are used to scan full-text articles as well as a scoring function to rank the sentences that match our patterns. The proposed sentence patterns can effectively identify the binding affinities in full-text articles. Our assessment shows that AutoBind achieved 84.22% precision and 79.07% recall on the testing corpus. Currently, 13 616 protein-ligand complexes and the corresponding binding affinities have been deposited in AutoBind from 17 221 articles.

机译：动机：确定蛋白质与复合物的结合亲和力对于定量确定特定小分子是否会结合靶蛋白很重要。此外，收集蛋白质-配体复合物及其对应结合亲和力的全面数据集对于开发准确的评分功能，以预测先前未知的蛋白质-配体复合物的结合亲和力至关重要。在过去的几十年中，通过从文献中目视提取，建立了多个蛋白质-配体结合亲和力数据库。但是，这种方法很耗时，而且大多数数据库每年仅更新几次。因此，迫切需要用于结合亲和力收集的高精度的自动提取方法。结果：我们基于自动信息检索，创建了一个新的蛋白质-配体结合亲和力数据数据库AutoBind。我们首先汇编了1586篇文章的集合，其中已手动标记了绑定亲和力。基于此带注释的集合，我们设计了四个句子模式，这些模式用于扫描全文文章，以及一种计分功能以对与我们的模式匹配的句子进行排名。所提出的句子模式可以有效地识别全文文章中的绑定亲和力。我们的评估表明，AutoBind在测试语料库上实现了84.22％的精度和79.07％的召回率。目前，已有13 221篇文章在AutoBind中保存了13 616种蛋白质-配体复合物和相应的结合亲和力。

著录项

来源
《Bioinformatics》 |2012年第16期|共7页
作者
Chang Darby Tien-Hao; Ke Chao-Hsuan; Lin Jung-Hsin; Chiang Jung-Hsien;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature [J] . Chang Darby Tien-Hao, Ke Chao-Hsuan, Lin Jung-Hsin, Bioinformatics . 2012,第16期

机译：AutoBind：从生物学文献中自动提取蛋白质-配体结合亲和力数据
2. Automatic Literature Metadata Extraction from DataCite Services [J] . Kun Ma Recent patents on computer science . 2018,第1期

机译：DataCite服务的自动文献元数据提取
3. A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction [J] . Zach Jensen, Edward Kim, Soonhyoung Kwon, ACS Central Science . 2019,第5期

机译：通过自动文献数据提取实现的沸石合成的机器学习方法
4. Integrating semantic transcriptomic data analysis and knowledge extraction from biological literature [C] . Podpecan Vid, ef Stefan Institute Ljubljana Slovenia, Miljkovic Dragana, IEEE International Conference on Bioinformatics and Biomedicine . 2013

机译：整合语义转录组数据分析和生物学文献中的知识提取
5. Internet data extraction based on automatic regular expression inference. [D] . Lin, Ye. 2007

机译：基于自动正则表达式推断的Internet数据提取。
6. A Machine Learning Approach to Zeolite Synthesis Enabledby Automatic Literature Data Extraction [O] . Zach Jensen, Edward Kim, Soonhyoung Kwon, 2019

机译：启用沸石合成的机器学习方法通过自动文献数据提取
7. AutoBind: automatic extraction of protein–ligand-binding affinity data from biological literature [O] . Darby Tien-Hao Chang, Chao-Hsuan Ke, Jung-Hsin Lin, 2012

机译：Autobind：自动提取生物文学中的蛋白质 - 配体结合亲和力数据
8. Investigation of Procedures for Automatic Resonance Extraction from Noisy Transient Electromagnetics Data. Volume I. Investigation of Resonance Extraction Procedures [R] . Auton, J. R., Van Blaricum, M. L. 1981

机译：噪声瞬态电磁数据自动共振提取程序研究。第一卷。共振提取程序的研究

AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature

摘要

著录项

相似文献

相关主题

期刊订阅