首页> 外文学位 >Effect of ontology hierarchy on a concept vector machine's ability to classify web documents.
【24h】

Effect of ontology hierarchy on a concept vector machine's ability to classify web documents.

机译:本体层次结构对概念向量机对Web文档进行分类的能力的影响。

获取原文
获取原文并翻译 | 示例

摘要

As the quantity of text documents created on the web grows the ability of experts to manually classify them has decreased. Because people need to find and organize this information, interest has grown in developing automatic means of categorizing these documents. In this effort, ontologies have been developed that capture domain specific knowledge in the form of a hierarchy of concepts.;Support Vector Machines are machine learning methods that are widely used for automated document categorization. Recent studies suggest that the classification accuracy of a Support Vector Machine may be improved by using concepts defined by a domain ontology instead of using the words that appear in the document. However, such studies have not taken into account the hierarchy inherent in the relationship between concepts. The goal of this dissertation was to investigate whether the hierarchical relationships among concepts in ontologies can be exploited to improve the classification accuracy of web documents by a Support Vector Machine.;Concept vectors that capture the hierarchy of domain ontologies were created and used to train a Support Vector Machine. Tests conducted using the benchmark Reuters-21578 data set indicate that the Support Vector Machines achieve higher classification accuracy when they make use of the hierarchical relationships among concepts in ontologies.
机译:随着在网络上创建的文本文档数量的增长,专家对其进行手动分类的能力下降了。由于人们需要查找和整理这些信息,因此人们对开发自动分类这些文档的方法的兴趣日益浓厚。在这种努力下,已经开发出以概念层次结构的形式捕获领域特定知识的本体。支持向量机是广泛用于自动化文档分类的机器学习方法。最近的研究表明,通过使用领域本体定义的概念而不是使用文档中出现的单词,可以提高支持向量机的分类精度。但是,此类研究并未考虑概念之间关系固有的层次结构。本文的目的是研究支持向量机是否可以利用本体中概念之间的层次关系来提高Web文档的分类准确性。;创建了捕获域本体层次结构的概念向量,并将其用于训练支持向量机。使用基准的Reuters-21578数据集进行的测试表明,当支持向量机利用本体中概念之间的层次关系时,它们可以实现更高的分类精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号