首页> 外文会议>Knowledge-Based Systems for Safety Critical Applications >Querying text databases for efficient information extraction

【24h】

Querying text databases for efficient information extraction

机译：查询文本数据库以进行有效的信息提取

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A wealth of information is hidden within unstructured text. This information is often best exploited in structured or relational form, which is suited for sophisticated query processing, for integration with relational databases, and for data mining. Current information extraction techniques extract relations from a text database by examining every document in the database, or use filters to select promising documents for extraction. The exhaustive scanning approach is not practical or even feasible for large databases, and the current filtering techniques require human involvement to maintain and to adapt to new databases and domains. We develop an automatic query-based technique to retrieve documents useful for the extraction of user-defined relations from large text databases, which can be adapted to new domains, databases, or target relations with minimal human effort. We report a thorough experimental evaluation over a large newspaper archive that shows that we significantly improve the efficiency of the extraction process by focusing only on promising documents.

机译：大量信息隐藏在非结构化文本中。通常最好以结构化或关系形式来利用此信息，该信息适合于复杂的查询处理，与关系数据库的集成以及数据挖掘。当前的信息提取技术通过检查数据库中的每个文档来从文本数据库中提取关系，或者使用过滤器选择有希望的文档以进行提取。对于大型数据库，穷举扫描方法不切实际甚至不可行，并且当前的过滤技术需要人为维护和适应新的数据库和域。我们开发了一种基于自动查询的技术，该技术可检索可用于从大型文本数据库中提取用户定义的关系的文档，该文档可轻松适应新的域，数据库或目标关系。我们对大型报纸档案馆进行了全面的实验评估，结果表明，仅关注有前途的文件，我们就大大提高了提取过程的效率。

著录项

来源
《Knowledge-Based Systems for Safety Critical Applications 》|1994年|p.113-124|共12页
会议地点
作者
Agichtein E.; Gravano L.;
展开▼
作者单位

Columbia Univ., USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Natural language querying of databases: an information extraction approach in the conceptual query language [J] . Owei V. International journal of human-computer studies . 2000 ,第4期

机译：数据库的自然语言查询：概念查询语言中的信息提取方法
2. Top-k typicality queries and efficient query answering methods on large databases [J] . Ming Hua, Jian Pei, Ada W. C. Fu, VLDB journal . 2009 ,第3期

机译：大型数据库上的Top-k典型性查询和有效的查询回答方法
3. An efficient query optimization strategy for spatio-temporal queries in video databases [J] . G. Uenel, M.E. Doenderler, Oe. Ulusoy, The Journal of Systems and Software . 2004 ,第1期

机译：视频数据库中时空查询的高效查询优化策略
4. Querying text databases for efficient information extraction [C] . Agichtein, E., Gravano, . 2003

机译：查询文本数据库以进行有效的信息提取
5. INTEGRATION OF SOLID MODELING AND DATABASE MANAGEMENT FOR CAD/CAM (QUERY LANGUAGE, GEOMETRIC DATABASE, FEATURE EXTRACTION) [D] . LEE, YUNG-CHIA. 1984

机译：CAD / CAM的实体建模与数据库管理的集成（查询语言，几何数据库，特征提取）
6. NeuroExtract: Facilitating Neuroscience-oriented Retrieval from Broadly-focused Bioscience Databases Using Text-based Query Mediation [O] . Chiquito J. Crasto, Peter Masiar, Perry L. Miller 2007

机译：NeuroExtract：使用基于文本的查询中介从广泛关注的生物科学数据库中促进面向神经科学的检索
7. Querying text databases for efficient information extraction [O] . Agichtein Eugene, Gravano Luis 2003

机译：查询文本数据库以进行有效的信息提取

Querying text databases for efficient information extraction

摘要

著录项

相似文献

相关主题

期刊订阅