首页> 外文会议>Intelligent Data Engineering and Automated Learning >A case-based transformation from HTML to XML: an extended abstract
【24h】

A case-based transformation from HTML to XML: an extended abstract

机译:从HTML到XML的基于案例转换:扩展摘要

获取原文

摘要

Recently, a huge quantity of HTML documents have been created in Internet, which really constitute a treasury of information. HTML, however, is designed mainly for reading with browsers, and not suitable for machine processing, whereas XML was proposed as a solution for this problem. In this paper, we give a case-based transformation method from HTML documents to XML ones. There are many series of HTML pages in actual Web sites, and each page of a series usually has a quite similar structure with each other. Therefore a case-based transformation must be a promising method in practice for a semi-automatic transformation from HTML to XML. Throughout experimental evaluations, we show this case-based method achieved a highly accurate transformation, i.e., 85% of actual 80 pages can be transformed in a correct way, with this case-based method.
机译:最近,在互联网上创建了大量的HTML文件,这确实构成了信息的财政部。然而,HTML的设计主要用于阅读浏览器,而不适合机器处理,而XML被提出为该问题的解决方案。在本文中,我们向XML文档提供了一种基于案例的转换方法。实际网站中有许多HTML页面,并且系列的每个页面通常都具有相似的结构彼此。因此,基于案例的转换必须是从HTML到XML的半自动转换的实践中的有希望的方法。在整个实验评估中,我们展示了基于案例的方法实现了高精度的转换,即,85%的实际80页可以以正确的方式转换,采用这种基于案例的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号