首页> 外文会议>Integrated Design amp; Process Technology vol.1(IDPT-Vol.1, 2005) >WEB DOCUMENT CLASSIFICATION BY CONSIDERINGWORD-CONCEPT AND CONCEPT-ONTOLOGY SUPPORT
【24h】

WEB DOCUMENT CLASSIFICATION BY CONSIDERINGWORD-CONCEPT AND CONCEPT-ONTOLOGY SUPPORT

机译:考虑单词概念和本体概念支持的Web文档分类

获取原文
获取原文并翻译 | 示例

摘要

With the rapid growth of Semantic Web, retrievalorientedrntask is among the best-developed in semanticrnweb applications. One of the most important techniquesrnused for this task is document classification. Integratingrnthe background knowledge with the classification algorithmrnto improve the classification efficiency and accuracyrnis a very interesting topic. The paper proposes arnnovel classification algorithm based on Naive Bayesrnapproach. Given a set of ontologies,our task is to classifyrndocuments with the ontologies by calculating the supportrnof the documents to every ontology. Firstly we will obtainrnthe support Pco as the semantically-based supportrnfor a concept to its affiliated ontology through enrichingrnthe concept relevant words in ontology. Secondly, wernwill calculate the support Pwc for each word in the webrndocument, which reflects the support of the word in thernweb document to every ontology concept. Further, wernwill integrate the two kinds of support Pco and Pwc torncalculate the support of the web document to each ontology.rnSo far, we can classify the document to the ontologyrnhaving largest support value or to multi ontologiesrnhaving support values larger than a threshold. The experimentalrnresults show our algorithm is effective forrnontology based classification.
机译:随着语义Web的快速发展,面向检索的任务已成为语义Web应用程序中最先进的任务之一。用于此任务的最重要技术之一是文档分类。将背景知识与分类算法集成在一起以提高分类效率和准确性是一个非常有趣的话题。提出了一种基于朴素贝叶斯方法的arnnovel分类算法。给定一组本体,我们的任务是通过计算文档对每种本体的支持度来对文档和本体进行分类。首先,通过丰富本体中与概念相关的词,我们将获得支持Pco作为其相关本体的概念的基于语义的支持。其次,我们将计算Web文档中每个单词的支持Pwc,这将Web文档中的单词对每个本体概念的支持反映出来。此外,我们将整合两种支持Pco和Pwc,以计算Web文档对每个本体的支持。到目前为止,我们可以将文档分类为具有最大支持值的本体,或分类为具有大于阈值的支持值的多个本体。实验结果表明我们的算法是基于有效的分类学分类的。

著录项

  • 来源
  • 会议地点 Beijing(CN)
  • 作者单位

    Department of Computer Science and TechnologyrnUniversity of Science and Technology of ChinarnHefei, Anhui, 230027, P.R. China;

    Department of Computer Science and TechnologyrnUniversity of Science and Technology of ChinarnHefei, Anhui, 230027, P.R. China;

    Department of Computer Science and TechnologyrnUniversity of Science and Technology of ChinarnHefei, Anhui, 230027, P.R. China;

    Department of EECSrnUniversity of CaliforniarnIrvine, CA 92697;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-26 14:23:39

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号