首页> 外文会议>International Conference on Advanced Science and Technology >Using an Integrated Ontology Database to Categorize Web Pages
【24h】

Using an Integrated Ontology Database to Categorize Web Pages

机译:使用集成的本体数据库对网页分类

获取原文

摘要

As we know, current classification methods are mostly based on the VSM (Vector Space Model), which only accounts for term frequency in the documents, and ignores important semantic relationships between key terms. We proposed a system that uses an integrated ontologies and Natural Language Processing techniques to index texts. Traditional Words matrix is replaced by Concepts based matrix. For this purpose, we developed fully automated methods for mapping keywords to their corresponding ontology concepts. Support Vector Machine a successful machine learning technique is used for classification. Experimental results shows that our proposed method dose improve text classification performance significantly.
机译:如我们所知,当前的分类方法主要基于VSM(Vector Space Model),其仅在文档中占术语频率,并且忽略关键术语之间的重要语义关系。我们提出了一个系统,它使用集成的本体和自然语言处理技术来索引文本。传统单词矩阵被基于概念的矩阵替换。为此目的,我们开发了完全自动化的方法,用于将关键字映射到相应的本体概念。支持向量机使用成功的机器学习技术进行分类。实验结果表明,我们提出的方法剂量显着提高文本分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号