首页> 外文期刊>International Journal of Computer Science & Information Technology (IJCSIT) >On Feature Selection Algorithms and Feature Selection Stability Measures : A Comparative Analysis
【24h】

On Feature Selection Algorithms and Feature Selection Stability Measures : A Comparative Analysis

机译:特征选择算法与特征选择稳定性测度的比较分析

获取原文
           

摘要

Data mining is indispensable for business organizations for extracting useful information from the hugevolume of stored data which can be used in managerial decision making to survive in the competition. Dueto the day-to-day advancements in information and communication technology, these data collected from ecommerceand e-governance are mostly high dimensional. Data mining prefers small datasets than highdimensional datasets. Feature selection is an important dimensionality reduction technique. The subsetsselected in subsequent iterations by feature selection should be same or similar even in case of smallperturbations of the dataset and is called as selection stability. It is recently becomes important topic ofresearch community. The selection stability has been measured by various measures. This paper analysesthe selection of the suitable search method and stability measure for the feature selection algorithms andalso the influence of the characteristics of the dataset as the choice of the best approach is highly problemdependent.
机译:数据挖掘对于企业组织从大量存储的数据中提取有用的信息是必不可少的,这些信息可用于管理决策中以在竞争中生存。由于信息和通信技术的日新月异,从电子商务和电子政务收集的这些数据大多是高维的。数据挖掘比高维数据集更喜欢小型数据集。特征选择是一种重要的降维技术。即使在数据集扰动较小的情况下,通过特征选择在后续迭代中选择的子集也应相同或相似,这称为选择稳定性。它已成为近来研究界的重要课题。选择稳定性已通过各种方法测量。本文分析了针对特征选择算法的合适搜索方法和稳定性测度的选择,以及由于最佳方法的选择高度依赖问题,对数据集特征的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号