首页> 外文会议>Web information systems and mining >Associating Labels and Elements of Deep Web Query Interface Based on DOM
【24h】

Associating Labels and Elements of Deep Web Query Interface Based on DOM

机译:基于DOM的深度Web查询接口的标签和元素关联

获取原文
获取原文并翻译 | 示例

摘要

Query interface schema extraction is an important issue for Deep Web data acquisition and integration. In order to obtain the query interface schema, it is firstly required to associate elements and labels of Deep Web query interface correctly. Due to the fact that query interface on HTML page can be parsed as well structured DOM, we proposed an effective algorithm for associating elements and labels of Deep Web query interface based on hierarchical DOM. Our algorithm mainly adopted the nearest-neighbor-distance and other two useful heuristic rules to associate the most related label of a given control element. The experimental results on real query interfaces show that our proposed algorithm is highly effective
机译:查询接口架构提取是Deep Web数据获取和集成的重要问题。为了获得查询接口架构,首先需要正确关联Deep Web查询接口的元素和标签。由于可以解析HTML页面上的查询界面以及结构化的DOM,因此,我们提出了一种有效的基于分层DOM的Deep Web查询界面的元素和标签关联算法。我们的算法主要采用最近邻距离和其他两个有用的启发式规则来关联给定控制元素的最相关标签。在真实查询接口上的实验结果表明,该算法是有效的。

著录项

  • 来源
  • 会议地点 Chengdu(CN)
  • 作者单位

    School of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin, 541004, P.R. China,College of Computer and Information Science, Southwest University. Chongqing, 400715, P.R. China;

    School of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin, 541004, P.R. China;

    College of Computer and Information Science, Southwest University. Chongqing, 400715, P.R. China;

    School of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin, 541004, P.R. China;

    School of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin, 541004, P.R. China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Deep web; element and label association; DOM; query interface;

    机译:深网;元素和标签关联; DOM;查询界面;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号