首页> 外文会议>International Conference on Pattern Recognition and Machine Intelligence >An Automatic Approach to Classify Web Documents Using a Domain Ontology
【24h】

An Automatic Approach to Classify Web Documents Using a Domain Ontology

机译:使用域本体对Web文档进行分类的自动方法

获取原文

摘要

This paper suggests an automated method for document classification using an ontology, which expresses terminology information and vocabulary contained in Web documents by way of a hierarchical structure. Ontology-based document classification involves determining document features that represent the Web documents most accurately, and classifying them into the most appropriate categories after analyzing their contents by using at least two pre-defined categories per given document features. In this paper, Web documents are classified in real time not with experimental data or a learning process, but by similar calculations between the terminology information extracted from Web texts and ontology categories. This results in a more accurate document classification since the meanings and relationships unique to each document are determined.
机译:本文建议使用本体进行文档分类的自动化方法,其表示通过分层结构表示Web文档中包含的术语信息和词汇。基于本体的文档分类涉及确定最准确地表示Web文档的文档功能,并在通过使用每个给定文档功能的至少两个预定义的类别分析其内容后将它们分类为最合适的类别。在本文中,Web文档实时分类而不是实验数据或学习过程,而是通过从Web文本和本体类别中提取的术语信息之间的类似计算。这导致更准确的文档分类,因为确定了每个文档唯一的含义和关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号