An Ontology Based Approach for Chinese Web Texts Classification

G.Y. Wei; G.X. Wu; Y.Y. Gu; Y. Ling

首页> 外文期刊>Information Technology Journal >An Ontology Based Approach for Chinese Web Texts Classification

【24h】

An Ontology Based Approach for Chinese Web Texts Classification

机译：基于本体的中文网页文本分类方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The world wide web is a vast resource of information and services that continues to grow rapidly. Developing an automatic classifier, which has ability of classifying documents into appropriate categories predefined in the topic structure based on document contents is a crucial task. Traditional methods of documents classification need characteristic abstraction and classifier training. The work of collecting trainable text terms is laborious and time-consuming. In order to solve the problem, this study proposes an ontology based approach to improve the efficiency and effectiveness of Chinese web documents classification and retrieval. First, the approach establishes an ontology model based on knowledge base. Second, it creates ontology for each subclass of the classification system. It uses RDFS to convert knowledge into ontology and to define the relations among ontology. Finally, web documents classification is performed automatically using the ontology relevance calculating algorithm. Present experiments show that the accuracy of ontology based approach is very close to most classical methods includes Support Vector Machines, K-Nearest Neighbor and Latent Semantic Analysis. Additionally, ontology based algorithm is more stable and robust and can obtain better recalling rate than other three methods.

机译：万维网是不断增长的大量信息和服务资源。开发自动分类器是一项至关重要的任务，它具有将文档分类为基于文档内容在主题结构中预定义的适当类别的能力。传统的文档分类方法需要特征抽象和分类器训练。收集可训练的文本术语的工作既费力又费时。为了解决该问题，本研究提出了一种基于本体的方法，以提高中文网络文档分类和检索的效率。首先，该方法建立了一个基于知识库的本体模型。其次，它为分类系统的每个子类创建本体。它使用RDFS将知识转换为本体并定义本体之间的关系。最后，使用本体相关性计算算法自动执行Web文档分类。当前的实验表明，基于本体的方法的准确性非常接近于大多数经典方法，包括支持向量机，K最近邻和潜在语义分析。另外，基于本体的算法比其他三种方法更稳定，更健壮，并且召回率更高。

著录项

来源
《Information Technology Journal》 |2008年第5期|共6页
作者
G.Y. Wei; G.X. Wu; Y.Y. Gu; Y. Ling;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. An Ontology Based Approach for Chinese Web Texts Classification [J] . G. Y. Wei, G. X. Wu, Y. Y. Gu, Information Technology Journal . 2008,第5期

机译：基于本体的中文网页文本分类方法
2. A Novel Approach for Ontology- Based Feature Vector Generation for Web Text Document Classification [J] . Mohamed K. Elhadad, Khaled M. Badran, Gouda I. Salama International journal of software innovation . 2018,第1期

机译：基于本体的特征向量的Web文本文档分类新方法
3. A novel approach for ontology-based dimensionality reduction for web text document classification [J] . Elhadad Mohamed K., Badran Khaled Shafee S., Salama Gouda I. International journal of software innovation . 2017,第4期

机译：基于本体的Web文本文档分类降维的新方法
4. A novel approach for ontology-based dimensionality reduction for web text document classification [C] . Mohamed K. Elhadad, KhaledM. Badran, Gouda I. Salama IEEE/ACIS International Conference on Computer and Information Science . 2017

机译：基于本体的Web文本文档分类降维的新方法
5. An ontology-driven concept-based information retrieveal approach for Web documents. [D] . Li, Zhan. 2010

机译：基于本体的基于概念的Web文档信息检索方法。
6. Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation [O] . Kimberly Van Auken, Joshua Jaffery, Juancarlos Chan, 2009

机译：蛋白质亚细胞定位的半自动化管理：基于文本挖掘的基因本体（GO）细胞成分管理方法
7. An Ontology Based Approach for Chinese Web Texts Classification [O] . G.Y. Wei, G.X. Wu, Y.Y. Gu, 2008

机译：基于本体的中国网络文本分类方法
8. Case-Based Classification Alternatives to Ontologies for Automated Web Service Discovery and Integration. [R] . Ladner, R., Warner, E., Petry, F., 2006

机译：基于案例的分类替代自动化Web服务发现和集成的本体。

An Ontology Based Approach for Chinese Web Texts Classification

摘要

著录项

相似文献

相关主题

期刊订阅