On the Automatic Extraction of Data from the Hidden Web

机译：从隐藏的网络中自动提取数据

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are easy and precise) and from a data management perspective (static pages need not be maintained; databases can be accessed directly), automated agents have greater difficulty accessing data behind forms. In this paper we present a method for automatically filling in forms to retrieve the associated dynamically generated pages. Using our approach automated agents can begin to systematically access portions of the "hidden Web."

机译：仅通过填写HTML表单以查询基础数据源即可访问越来越多的Web数据。从用户角度（查询简单而精确）和从数据管理角度（无需维护静态页面；可以直接访问数据库）来看，这是最受欢迎的，但是自动化代理在访问表单后面的数据时遇到了更大的困难。在本文中，我们提出了一种自动填写表单以检索关联的动态生成页面的方法。使用我们的方法，自动化代理可以开始系统地访问“隐藏的Web”的各个部分。

著录项

来源
《ER 2001 Workshops on HUMACS, DASWIS, ECOMO, and DAMA, Nov 27-30, 2001, Yokohama, Japan》|2001年|p.212-226|共15页
会议地点 Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP);Yokohama(JP)
作者
Stephen W. Liddle; Sai Ho Yau; David W. Embley;
展开▼
作者单位

Information Systems Group and Computer Science Department Brigham Young University Provo, UT 84602, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
入库时间 2022-08-26 14:30:18

相似文献

外文文献
中文文献
专利

1. Automatic generation of agents for collecting hidden Web pages for data extraction [J] . Juliano Palmieri Lage, Altigran S. da Silva, Paulo B. Golgher, Data & Knowledge Engineering . 2004,第2期

机译：自动生成用于收集隐藏网页以进行数据提取的代理
2. Hidden data states-based complex terminology extraction from textual web data model [J] . Quantum electronics . 2020,第6期

机译：基于隐藏的基于数据状态的复杂术语从文本Web数据模型提取
3. Automatic labeling of hidden web data using Multi-Heuristics Annotator [J] . Umamageswari Baskaran, Kalpana Ramanujam Future Computing and Informatics Journal . 2018,第2期

机译：使用Multi-Heuristics注释器自动标记隐藏的Web数据
4. On the Automatic Extraction of Data from the Hidden Web [C] . Stephen W. Liddle, Sai Ho Yau, David W. Embley, Workshops on conceptual modeling . 2002

机译：关于隐藏网站的自动提取数据
5. An algebraic foundation for automatic semantic data integration on the hidden Web. [D] . Hosain, Md. Shazzad. 2009

机译：在隐藏的Web上自动进行语义数据集成的代数基础。
6. SteinerNet: a web server for integrating ‘omic’ data to discover hidden components of response pathways [O] . Nurcan Tuncbag, Scott McCallum, Shao-shan Carol Huang, 2012

机译：SteinerNet：一种网络服务器用于集成组学数据以发现响应路径的隐藏组件
7. A Novel Technique for Data Extraction from Hidden Web Databases [O] . 2011

机译：从隐藏的Web数据库中提取数据的新技术

On the Automatic Extraction of Data from the Hidden Web

摘要

著录项

相似文献

相关主题

期刊订阅