Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs

机译：通过从依赖图学习句法模式来提取疾病-症状关系

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Disease-symptom relationships are of primary importance for biomedical informatics, but databases that catalog them are incomplete in comparison with the state of the art available in the scientific literature. We propose in this paper a novel method for automatically extracting disease-symptom relationships from text, called SPARE (standing for Syntactic PAttern for Relationship Extraction). This method is composed of 3 successive steps: first, we learn patterns from the dependency graphs; second, we select best patterns based on their respective quality and specificity (their ability to identify only disease-symptom relationships); finally, the patterns are used on new texts for extracting disease-symptom relationships. We experimented SPARE on a corpus of 121,796 abstracts of PubMed related to 457 rare diseases. The quality of the extraction has been evaluated depending on the pattern quality and specificity. The best F-measure obtained is 55.65% (for specificity ≥ 0.5 and quality ≥ 0.5). To provide an insight on the novelty of disease-symptom relationship extracted, we compare our results to the content of phenotype databases (OrphaData and OMIM). Our results show the feasibility of automatically extracting disease-symptom relationships, including true relationships that were not already referenced in phenotype databases and may involve complex symptom descriptions.

机译：疾病症状关系对生物医学信息学至关重要，但是与科学文献中现有的技术水平相比，对它们进行分类的数据库并不完整。我们在本文中提出了一种新的自动从文本中提取疾病-症状关系的方法，称为SPARE（代表关系提取的句法模式）。该方法包括3个连续的步骤：首先，我们从依赖图中学习模式；其次，我们根据它们各自的质量和特异性（它们仅能识别疾病-症状关系的能力）选择最佳模式；最后，将这些模式用于新文本以提取疾病-症状关系。我们在与457种罕见病有关的121,796篇PubMed摘要上对SPARE进行了实验。已经根据图案质量和特异性评估了提取质量。获得的最佳F量度为55.65％（对于特异性≥0.5和质量≥0.5）。为了提供对疾病-症状关系提取的新颖性的见解，我们将我们的结果与表型数据库（OrphaData和OMIM）的内容进行了比较。我们的结果表明自动提取疾病-症状关系的可行性，包括表型数据库中尚未引用的真实关系，可能涉及复杂的症状描述。

著录项

来源
《Workshop on biomedical natural language processing 2015》|2015年|71-80|共10页
会议地点 Beijing(CA)
作者
Mohsen Hassan; Olfa Makkaoui; Adrien Coulet; Yannick Toussaint;
展开▼
作者单位

LORIA (CNRS, Inria, Universite de Lorraine), Campus scientifique, Vandoeuvre-les-Nancy, F-54506, France;

LORIA (CNRS, Inria, Universite de Lorraine), Campus scientifique, Vandoeuvre-les-Nancy, F-54506, France;

LORIA (CNRS, Inria, Universite de Lorraine), Campus scientifique, Vandoeuvre-les-Nancy, F-54506, France;

LORIA (CNRS, Inria, Universite de Lorraine), Campus scientifique, Vandoeuvre-les-Nancy, F-54506, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-26 14:23:26

相似文献

外文文献
中文文献
专利

1. Revisiting syntactic development in deaf and hearing children from a dependency approach Comment on "Dependency distance: a new perspective on syntactic patterns in natural languages" by Haitao Liu et al. [J] . Yan Jingqi Physics of life reviews . 2017,第期

机译：从抚养方法评估聋人和听力子女的句法发展评论“依赖距离：海地刘等人的”自然语言句法模式的新观点“。
2. Dependency distance: A new perspective on the syntactic development in second language acquisition Comment on "Dependency distance: A new perspective on syntactic patterns in natural language" by Haitao Liu et al. [J] . Jiang Jingyang, Ouyang Jinghui Physics of life reviews . 2017,第期

机译：依赖距离：海地刘等人对“依赖距离”评论句法发展的新视角。
3. On the relation between dependency distance, crossing dependencies, and parsing Comment on "Dependency distance: a new perspective on syntactic patterns in natural languages" by Haitao Liu et al. [J] . Gomez-Rodriguez Carlos Physics of life reviews . 2017,第期

机译：海涛刘等人的依赖距离，交叉依赖关系与解析评论的关系。
4. Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs [C] . Mohsen Hassan, Olfa Makkaoui, Adrien Coulet, Workshop on biomedical natural language processing . 2015

机译：从依赖图中学习句法模式提取疾病 - 症状关系
5. THE RELATIONSHIP OF AUDITORY AND COGNITIVE PROCESSES TO SYNTACTIC PATTERNS OF LEARNING DISABLED AND NORMAL CHILDREN [D] . WREN, CAROL THOMPSON. 1980

机译：听觉障碍和认知过程与学习障碍儿童和正常儿童的句法特征的关系
6. Automatic Authorship Detection Using Textual Patterns Extracted from Integrated Syntactic Graphs [O] . Helena Gómez-Adorno, Grigori Sidorov, David Pinto, 2016

机译：使用从综合句法图中提取的文本模式自动检测作者身份
7. Automatic Authorship Detection Using Textual Patterns Extracted from Integrated Syntactic Graphs [O] . Helena Gómez-Adorno, Grigori Sidorov, David Pinto, 2016

机译：使用从集成句法图中提取的文本模式进行自动作者检测

Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs

摘要

著录项

相似文献

相关主题

期刊订阅