XML Document Classification Using Closed Frequent Subtree

机译：使用封闭的频繁子树进行XML文档分类

获取原文

获取原文并翻译 | 示例

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

An efficient classification approach for XML documents is introduced in this paper, which lies in combining the content with the structure of XML documents to compute the similarity between the categories and documents. It is based on the Support Vector Machine (SVM) algorithm and the Structured Link Vector Model (SLVM) which used closed frequent subtrees as the structural units. The document tree pruning strategy was applied to improve the classification system while the link information between the documents was considered to get better classification results. We did experiments on the INEX XML mining data sets combining these techniques, and the results showed that our approach performs better than any other competitor's approach on XML classification.

机译：本文介绍了一种有效的XML文档分类方法，该方法是将内容与XML文档的结构相结合，以计算类别与文档之间的相似度。它基于支持向量机（SVM）算法和结构化链接向量模型（SLVM），后者使用封闭的频繁子树作为结构单元。应用文档树修剪策略来改进分类系统，同时考虑文档之间的链接信息以获得更好的分类结果。我们结合这些技术对INEX XML挖掘数据集进行了实验，结果表明，我们的方法在XML分类方面比任何其他竞争对手的方法都要好。

著录项

来源
《Web-age information management》|2012年|350-359|共10页
会议地点 Harbin(CN)
作者
Songlin Wang; Yihong Hong; Jianwu Yang;
展开▼
作者单位

Institute of Computer Sci. Tech., Peking University, Beijing 100871, China;

Institute of Computer Sci. Tech., Peking University, Beijing 100871, China;

Institute of Computer Sci. Tech., Peking University, Beijing 100871, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
XML Document; Text Classification; Frequent Subtree;

机译：XML文件；文字分类；频繁的子树;

相似文献

外文文献
中文文献
专利

1. Mining of closed frequent subtrees from frequently updated databases [J] . Viet Anh Nguyen, Akihiro Yamamoto Intelligent data analysis . 2012,第6期

机译：从频繁更新的数据库中挖掘封闭的频繁子树
2. XML subtree reconstruction from relational storage of XML documents [J] . Artem Chebotko, Mustafa Atay, Shiyong Lu, Data & Knowledge Engineering . 2007,第2期

机译：从XML文档的关系存储重构XML子树
3. Discovering Frequent Subtrees from XML Data Using Neural Networks [J] . SUN Wei, LIU Da-xin, WANG Tong Wuhan University Journal of Natural Sciences . 2006,第1期

机译：使用神经网络从XML数据中发现频繁的子树
4. XML Document Classification Using Closed Frequent Subtree [C] . Songlin Wang, Yihong Hong, Jianwu Yang International Workshops on Web-Age Information Management . 2012

机译：XML文档分类使用封闭频繁的子树
5. A bottom-up approach for XML document classification. [D] . Wu, Junwei. 2009

机译：XML文档分类的自底向上方法。

XML Document Classification Using Closed Frequent Subtree

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅