...
首页> 外文期刊>International Journal of Computer Trends and Technology >An Efficient Concept-Based Mining Model for Enhancing Text Clustering
【24h】

An Efficient Concept-Based Mining Model for Enhancing Text Clustering

机译:基于概念的高效挖掘模型,用于增强文本聚类

获取原文

摘要

The common techniques in text mining are based on the statistical analysis of a term, either word or phrase.Text is represented by the words it mentions, and thematic similarity is based on the proportion of words that texts have in common. The complex is constructed using groups of cooccurring words (term associations) identified using traditional data mining methods. Disjoint subsections of the complex (connect components) represent general concepts within the documents’ concept space. A new conceptbased mining model composed of four components, is proposed to improve the text clustering quality. By exploiting the semantic structure of the sentences in documents, a better text clustering result is achieved.
机译:文本挖掘中的常用技术是基于对单词(无论是单词还是短语)的统计分析,文本由其提及的单词表示,主题相似性基于文本共有的单词比例。使用通过使用传统数据挖掘方法识别的同现单词(术语关联)组构造复合体。复合体(连接组件)的不相交小节代表文档概念空间中的一般概念。为了提高文本聚类质量,提出了一种新的基于概念的,由四个部分组成的挖掘模型。通过利用文档中句子的语义结构,可以获得更好的文本聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号