首页> 外文会议>Annual Workshop on Semantic Web and Ontology(SWON2006) >Ontology-based similarity measure for text clustering
【24h】

Ontology-based similarity measure for text clustering

机译:基于本体的文本聚类的相似性度量

获取原文

摘要

A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced. To improve document clustering, a document similarity measure based on cosine vector and keywords frequency in documents is proposed, but also with an input ontology. The ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology, and by means of semantic knowledge, the ontology can improve the effects of document similarity measure and feedback of information retrieval systems. Two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described.
机译:介绍了一种组合基于类别的基于关键字的概念的方法,用于更好的信息检索系统。 为了提高文档聚类,提出了一种基于余弦矢量和文档中的关键字频率的文档相似度测量,但也具有输入本体。 本体是特定的域,包括由对本体类别类别的重要性组织的关键字列表,并且通过语义知识,本体可以提高文档相似度测量的影响和信息检索系统的反馈。 还描述了评估该相似度测量性能的两种方法以及与标准余弦矢量相似度测量的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号