...
首页> 外文期刊>International Journal of Performability Engineering >Four-Layer Feature Selection Method for Scientific Literature based on Optimized K-Medoids and Apriori Algorithms
【24h】

Four-Layer Feature Selection Method for Scientific Literature based on Optimized K-Medoids and Apriori Algorithms

机译:基于优化K-yemoids和Apriori算法的科学文献四层特征选择方法

获取原文
获取原文并翻译 | 示例
           

摘要

With the increase in scientific literature, classifying scientific literature has become an important focus. Effectively selecting representative features from scientific literature has become a key step in scientific literature classification and influences the performance of scientific literature classification. According to the structural characteristics of scientific literature, we combine an optimized K-medoids algorithm, which firstly adopts information entropy to empower clustering objects to correct the distance function and then employs the corrected distance function to select the optimal initial clustering centres, with the Apriori algorithm to propose a four-layer feature selection method. The proposed feature selection method firstly divides scientific literature into four layers according to their structural characteristics, selects features layer by layer from the former three layers by means of the optimized K-medoids algorithm, subsequently mines the maximum frequent item sets from the fourth layer by the Apriori algorithm to act as the features of the fourth layer, and finally merges selected features of every layer and eliminates duplicate features to obtain the final feature set. Experimental results show that the proposed four-layer feature selection method achieves higher performance in scientific literature classification.
机译:随着科学文献的增加,课程归类于科学文学已成为一个重要的重点。有效地选择科学文学的代表特征已成为科学文献分类的关键步骤,影响科学文献分类的性能。根据科学文献的结构特征,结合了优化的K-METOIDS算法,首先采用信息熵来赋予集群对象来校正距离功能,然后采用校正的距离功能来选择最佳初始聚类中心,并使用APRiori选择最佳初始聚类中心。算法提出四层特征选择方法。所提出的特征选择方法首先将科学文献分为四层,根据它们的结构特征,通过优化的k-myoids算法从前三层选择特征层,随后挖掘从第四层的最大频繁项目集合APRIORI算法用作第四层的特征,最后合并每个层的所选特征,并消除重复的功能以获得最终功能集。实验结果表明,所提出的四层特征选择方法在科学文献分类中实现了更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号