首页> 外文期刊>International Journal of Computer Trends and Technology >Discriminative Features Selection in Text Mining Using TF-IDF Scheme
【24h】

Discriminative Features Selection in Text Mining Using TF-IDF Scheme

机译:使用TF-IDF方案的文本挖掘中的歧视性特征选择

获取原文
           

摘要

——This paper describes technique for discriminative features selection in Text mining. 'Text mining’ is the discovery of new, previously unknown information, by computer. Discriminative features are the most important keywords or terms inside document collection which describe the informative news included in the document collection. Generated keyword set are used to discover Association Rules amongst keywords labeling the document. For feature extraction Information Retrieval Scheme i.e. TFIDF is used. This system uses previous work, which contains Text Preprocessing Phases (filtration and stemming). This work serves as basis for Association Rule Mining Phase. Association rule mining represents a Text Mining technique and its goal is to find interesting association or correlation relationships among a large set of data items. With massive amounts of data continuously being collected and stored in databases, many companies are becoming interested in mining association rules from their databases to increase their profits Knowledge discovery in databases (KDD) is the process of finding useful information and pattern in data.
机译:-本文介绍了文本挖掘中的判别特征选择技术。 “文本挖掘”是通过计算机发现以前未知的新信息。区别特征是文档集合中最重要的关键字或术语,它们描述文档集合中包含的信息性新闻。生成的关键字集用于在标记文档的关键字中发现关联规则。对于特征提取,使用信息检索方案,即TFIDF。该系统使用以前的工作,其中包含文本预处理阶段(过滤和词干提取)。这项工作是关联规则挖掘阶段的基础。关联规则挖掘代表一种文本挖掘技术,其目标是在大量数据项之间找到有趣的关联或相关关系。随着大量数据的不断收集和存储在数据库中,许多公司开始对从其数据库中挖掘关联规则以增加其利润感兴趣,而数据库中的知识发现(KDD)是寻找有用信息和数据模式的过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号