Open Language Learning for Information Extraction

机译：用于信息提取的开放语言学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Open Information Extraction (IE) systems extract relational tuples from text, without requiring a pre-specified vocabulary, by identifying relation phrases and associated arguments in arbitrary sentences. However, state-of-the-art Open IE systems such as ReVerb and WOE share two important weaknesses -they extract only relations that are mediated by verbs, and they ignore context, thus extracting tuples that are not asserted as factual. This paper presents ollie, a substantially improved Open IE system that addresses both these limitations. First, ollie achieves high yield by extracting relations mediated by nouns, adjectives, and more. Second, a context-analysis step increases precision by including contextual information from the sentence in the extractions. ollie obtains 2.7 times the area under precision-yield curve (AUC) compared to ReVerb and 1.9 times the AUC of WOE~(parse).

机译：开放信息提取（IE）系统通过识别任意句子中的关系短语和相关自变量，从文本中提取关系元组，而无需预先指定的词汇表。但是，最新的Open IE系统（例如ReVerb和WOE）具有两个重要的弱点-它们仅提取动词介导的关系，而它们忽略上下文，从而提取未断言为事实的元组。本文介绍了ollie，这是一个经过重大改进的Open IE系统，可以解决这两个局限性。首先，ollie通过提取名词，形容词等介导的关系来获得高收益。其次，上下文分析步骤通过将句子中的上下文信息包括在提取中来提高准确性。与ReVerb相比，ollie的精确产量曲线（AUC）面积为2.7倍，WOE〜（解析）的AUC的面积为1.9倍。

著录项

来源
《Conference on empirical methods in natural language processing;Conference on computational natural language learning》|2012年|523-534|共12页
会议地点
作者
Mausam; Michael Schmitz; Robert Bart; Stephen Soderland; Oren Etzioni;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feature Extraction and Analysis of Natural Language Processing for Deep Learning English Language [J] . Wang Dongyang, Su Junli, Yu Hongbin Quality Control, Transactions . 2020,第期

机译：深度学习英语语言自然语言处理的特征提取与分析
2. Learning (k, l)-contextual tree languages for information extraction from web pages [J] . Stefan Raeymaekers, Maurice Bruynooghe, Jan Van den Bussche Machine Learning . 2008,第2a3期

机译：学习（k，l）-上下文树语言以从网页中提取信息
3. Automatic extraction of bilingual word pairs using inductive chain learning in various languages [J] . Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi Information Processing & Management . 2006,第5期

机译：使用各种语言的归纳链学习自动提取双语单词对
4. Speaker Invariant Feature Extraction for Zero-Resource Languages with Adversarial Learning [C] . Taira Tsuchiya, Naohiro Tawara, Testuji Ogawa, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：扬声器不变特征提取对零资源语言的对抗学习
5. Language learning strategy use and proficiency: The relationship between patterns of reported language learning strategy (LLS) use by speakers of other languages (SOL) and proficiency with implications for the teaching/learning situation. [D] . Griffiths, Carol. 2003

机译：语言学习策略的使用和熟练程度：其他语言（SOL）的讲者使用的报告语言学习策略（LLS）的模式与熟练程度之间的关系，这对教学/学习情况具有影响。
6. Natural language processing and machine learning to enable automatic extraction and classification of patients’ smoking status from electronic medical records [O] . Andrea Caccamisi, Leif Jørgensen, Hercules Dalianis, 2020

机译：自然语言加工和机器学习可以从电子医疗记录自动提取和分类患者的吸烟状态
7. Automatic Extraction of Bilingual Word Pairs from Parallel Corpora with Various Languages Using Learning for Adjacent Information [O] . Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi 2014

机译：利用相邻信息学习从不同语言的平行语料库中自动提取双语词对

Open Language Learning for Information Extraction

摘要

著录项

相似文献

相关主题

期刊订阅