Constructing Efficient Information Extraction Pipelines

机译：构建高效信息提取管道

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Information Extraction (IE) pipelines analyze text through several stages. The pipeline's algorithms determine both its effectiveness and its run-time efficiency. In real-work! tasks, however, IE pipelines often fail acceptable run-times because1 they analyze too much task-irrelevant text. This raises two interesting questions: I) How much "efficiency potential" depends on the scheduling of a pipeline's algorithms? 2) Is it possible to devise a reliable method to construct efficient IE pipelines? Both questions are addressed in this paper. In particular, we show how to optimize the run-time efficiency of IE pipelines under a given set of algorithms. We evaluate pipelines for three algorithm sets on an industrially relevant task: the extraction of market forecasts from news articles. Using a system-independent measure, we demonstrate that efficiency gains of up to one order of magnitude are possible without compromising a pipeline's original effectiveness.

机译：信息提取（即）管道通过几个阶段分析文本。管道的算法决定了其有效性及其运行时间效率。在实际工作！但是，即管道上的任务通常会失败的运行时间，因为它分析了太多的任务 - 无关文本。这提高了两个有趣的问题：i）“效率潜力”取决于管道算法的调度？ 2）是否可以设计可靠的方法来构建有效的IE管道？这篇论文都解决了这两个问题。特别是，我们展示了如何在给定的一组算法下优化IE管道的运行时效率。我们在工业相关任务上评估三种算法的管道：新闻文章的市场预测提取。使用独立于系统的措施，我们证明，在不影响管道的原始效果的情况下，可以获得高达一个数量级的效率提升。

著录项

来源
《ACM international conference on information and knowledge management》|2011年||共4页
会议地点
作者
Henning Wachsmuth; Benno Stein; Gregor Engels;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
information extraction; run-time efficiency;

机译：信息提取;运行时效率;

相似文献

外文文献
中文文献
专利

1. University of California, Irvine-Pathology Extraction Pipeline: The pathology extraction pipeline for information extraction from pathology reports [J] . Ashish Naveen, Dahm Lisa, Boicey Charles Health informatics journal . 2014,第4期

机译：加州大学尔湾分校病理提取管线：用于从病理报告中提取信息的病理提取管线
2. Constructing amidoxime-modified porous adsorbents with open architecture for cost-effective and efficient uranium extraction [J] . Zhangnan Li, Qinghao Meng, Yajie Yang, Chemical science . 2020,第18期

机译：用开放式建筑构建偕胺肟改性多孔吸附剂，用于成本效益高效的铀提取
3. Can treatment wetlands be constructed on drained peatlands for efficient purification of peat extraction runoff? (Special Issue: Properties, processes and ecological functions of floodplain, peatland, and paddy soils.) [J] . Postila H., Saukkoriipi J., Heikkinen K., Geoderma: An International Journal of Soil Science . 2014,第Null期

机译：可以在排水的泥炭地上建造处理湿地，以有效净化泥炭提取径流吗？（特刊：洪泛区，泥炭地和水稻土的特性，过程和生态功能。）
4. Constructing Efficient Information Extraction Pipelines [C] . Henning Wachsmuth, Benno Stein, Gregor Engels ACM international conference on information and knowledge management . 2011

机译：构建高效的信息提取管道
5. Human-in-the-loop Tools for Constructing and Debugging Data Extraction Pipelines [D] . Hanafi, Maeda F. 2020

机译：用于构建和调试数据提取管道的人机循环工具
6. Constructing amidoxime-modified porous adsorbents with open architecture for cost-effective and efficient uranium extraction [O] . Zhangnan Li, Qinghao Meng, Yajie Yang, 2020

机译：用开放式建筑构建偕胺肟改性多孔吸附剂用于成本效益高效的铀提取
7. Constructing amidoxime-modified porous adsorbents with open architecture for cost-effective and efficient uranium extraction [O] . Zhangnan Li, Qinghao Meng, Yajie Yang, 2020

机译：用开放式建筑构建偕胺肟改性多孔吸附剂，用于成本效益高效的铀提取
8. Report to Congress (7th) on Progress Made in Licensing and Constructing the Alaska Natural Gas Pipeline [R] . 2009

机译：向国会报告（第7次）关于许可和建设阿拉斯加天然气管道的进展情况

Constructing Efficient Information Extraction Pipelines

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅