Information Extraction Using XPath

机译：使用XPath提取信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To improve the classification accuracy of documents, it will be important to characterize not only words but also their relations among words. The classification method from this point of view will need another approach for the analysis of documents. In this paper, first, how to find the pattern tree in the XML data tree as the embedded sub-tree is developed simply by applying XPath technique. This problem is applicable to the search of the characterized words and their relations in the XML documents. Second, next problem is what kind of words and their relations exist in the XML documents. This problem is how to find the most frequent patterns in the documents, which is called often the most frequent sub-trees in the XML domain. The second problem finding the most frequent sub-trees is solved simply here by applying XPath technique.

机译：为了提高文档的分类准确度，不仅要表征单词，而且还要表征单词之间的关系，这一点很重要。从这个角度来看，分类方法将需要另一种方法来分析文件。本文首先通过使用XPath技术简单地开发了如何在XML数据树中找到模式树作为嵌入式子树。此问题适用于在XML文档中搜索特征词及其关系。其次，下一个问题是XML文档中存在什么样的单词及其关系。问题是如何在文档中找到最频繁的模式，这在XML域中通常被称为最频繁的子树。通过使用XPath技术，这里可以简单地解决找到最频繁的子树的第二个问题。

著录项

来源
《International conference on knowledge-based and intelligent information and engineering systems;KES 2010》|2010年|p.104-112|共9页
会议地点
作者
Masashi Okada; Naohiro Ishii; Ippei Torii;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
入库时间 2022-08-26 15:03:36

相似文献

外文文献
中文文献
专利

1. OXPath: A language for scalable data extraction, automation, and crawling on the deep web [J] . Tim Furche, Georg Gottlob, Giovanni Grasso, The VLDB journal . 2013,第1期

机译：OXPath：一种用于可扩展的数据提取，自动化和在深度网络上进行爬网的语言
2. OXPath: A language for scalable data extraction, automation, and crawling on the deep web [J] . Tim Furche, Georg Gottlob, Giovanni Grasso, The VLDB Journal . 2013,第1期

机译：OXPath：一种用于可扩展的数据提取，自动化和在深度网络上进行爬网的语言
3. Predicate enrichment of aligned XPaths for wrapper induction [J] . Nielandt Joachim, Bronselaer Antoon, de Tre Guy Expert Systems with Application . 2016,第Juna期

机译：对齐的XPath的谓词丰富化，用于包装器归纳
4. Sample-based XPath Ranking for Web Information Extraction [C] . Oliver Jundt, Maurice van Keulen Conference of the European Society for Fuzzy Logic and Technology . 2013

机译：基于样本的XPath排名为Web信息提取
5. Parallel XML and XPath Parsing [D] . Zhang, Ying. 2018

机译：并行XML和XPath解析
6. Querying archetype-based EHRs by search ontology-based XPath engineering [O] . Stefan Kropf, Alexandr Uciteli, Katrin Schierle, 2018

机译：通过基于搜索本体的XPath工程查询基于原型的EHR
7. Sample-based XPath Ranking for Web Information Extraction [O] . Jundt, Oliver, van Keulen, Maurice 2013

机译：基于样本的XPath排名，用于Web信息提取
8. Using the XPATHS computer code. [R] . Cable, G. D. 1989

机译：使用XpaTHs计算机代码。

Information Extraction Using XPath

摘要

著录项

相似文献

相关主题

期刊订阅