首页> 外文期刊>Information Technology Journal >An Ontology Based Approach for Chinese Web Texts Classification
【24h】

An Ontology Based Approach for Chinese Web Texts Classification

机译:基于本体的中文网页文本分类方法

获取原文
       

摘要

The world wide web is a vast resource of information and services that continues to grow rapidly. Developing an automatic classifier, which has ability of classifying documents into appropriate categories predefined in the topic structure based on document contents is a crucial task. Traditional methods of documents classification need characteristic abstraction and classifier training. The work of collecting trainable text terms is laborious and time-consuming. In order to solve the problem, this study proposes an ontology based approach to improve the efficiency and effectiveness of Chinese web documents classification and retrieval. First, the approach establishes an ontology model based on knowledge base. Second, it creates ontology for each subclass of the classification system. It uses RDFS to convert knowledge into ontology and to define the relations among ontology. Finally, web documents classification is performed automatically using the ontology relevance calculating algorithm. Present experiments show that the accuracy of ontology based approach is very close to most classical methods includes Support Vector Machines, K-Nearest Neighbor and Latent Semantic Analysis. Additionally, ontology based algorithm is more stable and robust and can obtain better recalling rate than other three methods.
机译:万维网是不断增长的大量信息和服务资源。开发自动分类器是一项至关重要的任务,它具有将文档分类为基于文档内容在主题结构中预定义的适当类别的能力。传统的文档分类方法需要特征抽象和分类器训练。收集可训练的文本术语的工作既费力又费时。为了解决该问题,本研究提出了一种基于本体的方法,以提高中文网络文档分类和检索的效率。首先,该方法建立了一个基于知识库的本体模型。其次,它为分类系统的每个子类创建本体。它使用RDFS将知识转换为本体并定义本体之间的关系。最后,使用本体相关性计算算法自动执行Web文档分类。当前的实验表明,基于本体的方法的准确性非常接近于大多数经典方法,包括支持向量机,K最近邻和潜在语义分析。另外,基于本体的算法比其他三种方法更稳定,更健壮,并且召回率更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号