Bootstrapping for example-based data extraction

机译：自举基于示例的数据提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The effortless generation of wrappers for Web data sources is a crucial task if proper access to the huge amount of semi-structured data on the Web is to be granted. In particular, the development of strategies for wrapper generation based on user-given examples is currently one of the most promising research directions in Web data extraction. In this paper we show how to use a pre-existing data repository to automatically generate examples and allow full automated example-based data extraction. To demonstrate the feasibility of our approach we provide a number of results obtained from experiments we carried out and discuss how our ideas can be used to improve extraction rates and for providing resilience and adaptiveness for example-based generated wrappers.

机译：如果要允许对Web上大量的半结构化数据进行适当访问，那么毫不费力地为Web数据源生成包装器是一项至关重要的任务。特别是，基于用户给出的示例开发包装器生成策略的方法是当前Web数据提取中最有希望的研究方向之一。在本文中，我们展示了如何使用预先存在的数据存储库自动生成示例，并允许基于示例的全自动数据提取。为了证明我们方法的可行性，我们提供了从我们进行的实验中获得的大量结果，并讨论了如何将我们的想法用于提高提取率以及为基于示例的生成包装程序提供弹性和适应性。

著录项

来源
《Proceedings of the Tenth international conference on Information and knowledge management》|2001年|P.371-378|共8页
会议地点
作者
Paulo B. Golgher; Altigran S. da Silva; Alberto H. F. Laender; Berthier Ribeiro-Neto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息管理系统;
关键词

相似文献

外文文献
中文文献
专利

1. Statistical Uncertainty Analysis for Small-Sample, High Log-Variance Data: Cautions for Bootstrapping and Bayesian Bootstrapping [J] . Mostofian Barmak, Zuckerman Daniel M. Journal of chemical theory and computation: JCTC . 2019,第6期

机译：小型样本，高日志方差数据的统计不确定性分析：引导和贝叶斯举动的注意事项
2. Bootstrapping complex time‐to‐event data without individual patient data, with a view toward time‐dependent exposures [J] . Bluhmki Tobias, Putter Hein, Allignol Arthur, Statistics in medicine . 2019,第20期

机译：在没有各个患者数据的情况下引导复杂的时间 - 事件数据，视图朝着时间依赖的曝光
3. Data exploration using example-based methods [J] . Bdlint Molndr Computing reviews . 2020,第1期

机译：使用基于示例的方法进行数据探索
4. Bootstrapping for Example-Based Data Extraction [C] . Berthier Ribeiro-Neto, Alberto H. F. Laender, Altigran S. da Silva, International conference on information and knowledge management . 2001

机译：引导用于基于示例的数据提取
5. Coping with Data-sparsity in Example-based Machine Translation. [D] . Gangadharaiah, Rashmi. 2011

机译：在基于示例的机器翻译中应对数据稀疏性。
6. Bootstrapping complex time‐to‐event data without individual patient data with a view toward time‐dependent exposures [O] . Tobias Bluhmki, Hein Putter, Arthur Allignol, -1

机译：引导复杂的时间到事件数据而无需单独的患者数据以期获得与时间有关的暴露
7. Domain adaptation of web data extraction based on bootstrapping method [O] . Dong-Lan Liu, Xin Liu, Lei Ma, 2017

机译：基于引导方法的Web数据提取域改编
8. Expanding the Recall of Relation Extraction by Bootstrapping [R] . Tomita, J. , Soderland, S. , Etzioni, O. 2006

机译：通过Bootstrapping扩展关系提取的召回

Bootstrapping for example-based data extraction

摘要

著录项

相似文献

相关主题

期刊订阅