The Realization of Web Information Extraction Based on XML

机译：基于XML的Web信息提取实现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper introduces a method of web information extraction based on XML. Firstly, it converts the data from HTML to XHTML with tidy tools, and then locates the anchor which is tied to content by path expression, at last maps extraction result to XML file with XSL. This is a method of converting unstructured data to structured data, which is possible for application program to use data of web. An example is realized about earthquake information extraction. The extraction rules are simple, robust and the codes can be widely adopted.

机译：本文介绍了一种基于XML的Web信息提取方法。首先，它将来自HTML的数据与整洁的工具转换为XHTML，然后将锚点定位为通过路径表达式绑定到内容的锚点，最后映射提取结果与XSL的XML文件。这是将非结构化数据转换为结构化数据的方法，该数据可以使用Web的数据。关于地震信息提取实现了一个例子。提取规则简单，稳健，可以广泛采用代码。

著录项

来源
《International Conference on Management Science and Intelligent Control》|2011年||共3页
会议地点
作者
Shu Qin HUANG;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
eXtensible Markup Language; Web Information Extraction; extensible Stylesheet; Language unstructured; data structured data;

机译：可扩展标记语言;Web信息提取;可扩展样式表;语言非结构化;数据结构数据;

相似文献

外文文献
中文文献
专利

1. Realization Mechanism of Intelligent Comparison-Shopping Systems based on Web Information Extraction [J] . Xun Wang Haiwei Jin Zhenyue Chen International journal of computer science and network security . 2006,第6期

机译：基于Web信息提取的智能比较购物系统的实现机制
2. Plane partition realization of (web of) $W$ $$ mathcal{W} $$ -algebra minimal models [J] . Koichi Harada, Yutaka Matsuo The journal of high energy physics . 2019,第2期

机译：平面分区（Web的网页） $w$$ mathcal {w} $$ -algebra最小型号$
3. Effective Web data extraction with standard XML technologies [J] . Jussi Myllymaki Computer networks . 2002,第5期

机译：使用标准XML技术进行有效的Web数据提取
4. The Realization of Web Information Extraction Based on XML [C] . Shu Qin HUANG International Conference on Management Science and Intelligent Control . 2011

机译：基于XML的Web信息提取实现
5. Realization of resource-efficient embedded Web services using Representational State Transfer (REST) packaging and roll-back streaming XML (RBStreX) parser. [D] . Chee Er, Chang. 2011

机译：使用代表性状态传输（REST）打包和回滚流XML（RBStreX）解析器来实现资源有效的嵌入式Web服务。
6. A full XML-based approach to creating hypermedia learning modules in web-based environments: application to a pathology course [O] . Pascal Staccini, Jean-Charles Dufour, Michel Joubert, 2003

机译：在基于Web的环境中用于创建超媒体学习模块的基于XML的完整方法：应用于病理学课程
7. The extraction of ϕ–N total cross section from d(γ,pK+K−)n [O] . X. Qian, W. Chen, H. Gao, 2009

机译： φ - n 总横截面< mml：math altimg =“si2.gif”overflow =“滚动”xmlns：xocs =“http://www.elsevier.com/xml/xocs/dtd”xmlns：xs =“http://www.w3.org / 2001 / xmlschema“xmlns：xsi =”http://www.w3.org/2001/xmlschema-instance“xmlns =”http://www.elsevier.com/xml/ja/dtd“xmlns：ja =” http://www.elsevier.com/xml/ja/dtd“xmlns：m ml =“http://www.w3.org/1998/math/mathml”xmlns：tb =“http://www.elsevier.com/xml/common/table/dtd”xmlns：sb =“http：/ /www.elsevier.com/xml/common/struct-bib/dtd“xmlns：ce =”http://www.elsevier.com/xml/common/dtd“xmlns：xlink =”http：//www.w3 .org / 1999 / xlink“xmlns：cals =”http://www.elsevier.com/xml/common/cals/dtd“> d （ γ ， p k + k - ） n

The Realization of Web Information Extraction Based on XML

摘要

著录项

相似文献

相关主题

期刊订阅