【24h】

Regression testing for wrapper maintenance

机译:包装维护的回归测试

获取原文
获取原文并翻译 | 示例

摘要

Recent work on Internet information integration assumes a library of wrappers, specialized information extraction procedures. Maintaining wrappers is difficult, because the formatting regularities on which they rely often change. The wrapper verification problem is to determine whether a wrapper is correct. Standard regression testing approaches are inappropriate, because both the formatting regularities and a site's underlying content may change. We introduce RAPTURE, a fully-implemented, domain-independent verification algorithm. RAPTURE uses well-motivated heuristics to compute the similarity between a wrapper's expected and observed output. Experiments with 27 actual Internet sites show a substantial performance improvement over standard regression testing.
机译:Internet信息集成的最新工作假定包装器库是专门的信息提取程序。维护包装器很困难,因为包装器所依赖的格式规则经常会发生变化。包装器验证问题是确定包装器是否正确。标准回归测试方法是不合适的,因为格式设置规则和网站的基础内容都可能更改。我们介绍RAPTURE,这是一种完全实现的,独立于域的验证算法。 RAPTURE使用动机良好的试探法来计算包装程序的预期输出和观察到的输出之间的相似度。在27个实际的Internet站点上进行的实验表明,与标准回归测试相比,其性能有了显着提高。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号