Ontology-based similarity measure for text clustering

Yan Duanwu; Li Xiaopeng; Wang Lei

首页> 外文期刊>Journal of Southeast University >Ontology-based similarity measure for text clustering

【24h】

Ontology-based similarity measure for text clustering

机译：基于本体的文本聚类相似性度量

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced. To improve document clustering, a document similarity measure based on cosine vector and keywords frequency in documents is proposed, but also with an input ontology. The ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology, and by means of semantic knowledge, the ontology can improve the effects of document similarity measure and feedback of information retrieval systems. Two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described.

机译：介绍了一种将基于类别和基于关键字的概念相结合的方法，以提供更好的信息检索系统。为了改进文档聚类，提出了一种基于余弦矢量和文档中关键词频率的文档相似度度量方法，并提出了一种输入本体。本体是特定于领域的，并且包括根据对本体的类别的重要程度而组织的关键字列表，并且借助语义知识，本体可以改善文档相似性度量和信息检索系统的反馈的效果。还介绍了两种评估此相似性度量的性能以及与标准余弦矢量相似性度量进行比较的方法。

著录项

来源
《Journal of Southeast University》 |2006年第3期|p.389-393|共5页
作者
Yan Duanwu; Li Xiaopeng; Wang Lei;
展开▼
作者单位

Department of Information Management, Nanjing University of Science and Technology, Nanjing 210094, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类一般工业技术;
关键词
similarity measure; text clustering; ontology; information retrieval system;

机译：相似度度量文本聚类本体信息检索系统;
入库时间 2022-08-17 23:57:05

相似文献

外文文献
中文文献
专利

1. Comparison of Ontology-Based Semantic-Similarity Measures in the Biomedical Text [J] . Ahmad Fayez S. Althobaiti Journal of Computer and Communications . 2017,第2期

机译：生物医学文本中基于本体的语义相似性度量的比较
2. Clustering rule bases using ontology-based similarity measures [J] . Saeed Hassanpour, Martin J. OConnor, Amar K. Das Journal of web semantics: . 2014,第MARa期

机译：使用基于本体的相似性度量对规则库进行聚类
3. Medical Document Clustering Using Ontology-Based Term Similarity Measures [J] . Zhang Xiaodan, Jing Liping, Hu Xiaohua, International Journal of Data Warehousing and Mining . 2008,第1期

机译：使用基于本体的术语相似性度量的医学文档聚类
4. Self-adaptive GA,Quantitative Semantic Similarity Measures and Ontology-based Text Clustering [C] . Chengzhi ZHANG, Wei SONG, Chenghua LI, The 2008 IEEE International Conference on Natural Language Processing and Knowledge Engineering（IEEE NLP-KE 2008）(2008IEEE自然语言处理与知识工程国际会议)论文集 . 2008

机译：自适应遗传算法，定量语义相似性度量和基于本体的文本聚类
5. Ontology-based similarity for clustering in text space. [D] . Assem, Nasser. 2002

机译：基于本体的文本空间聚类相似度。
6. GO functional similarity clustering depends on similarity measure clustering method and annotation completeness [O] . Meng Liu, Paul D. Thomas 2019

机译：GO功能相似性聚类取决于相似性度量聚类方法和注释完整性
7. Self-adaptive GA, quantitative semantic similarity measures and ontology-based text clustering [O] . Zhang Chengzhi, Song Wei, Li Chenghua, 2008

机译：自适应遗传算法，定量语义相似性度量和基于本体的文本聚类

Ontology-based similarity measure for text clustering

摘要

著录项

相似文献

相关主题

期刊订阅