...
首页> 外文期刊>International journal of computers, communications and control >Data Dimensionality Reduction for Data Mining: A Combined Filter-Wrapper Framework
【24h】

Data Dimensionality Reduction for Data Mining: A Combined Filter-Wrapper Framework

机译:减少数据挖掘的数据维数:组合的Filter-Wrapper框架

获取原文
           

摘要

Knowledge Discovery in Databases aims to extract new, interesting and potential useful patterns from large amounts of data. It is a complex process whose central point is data mining, which effectively builds models from data. Data type, quality and dimensionality are some factors which affect performance of data mining task. Since the high dimensionality of data can cause some troubles, as data overload, a possible solution could be its reduction. Sampling and filtering reduce the number of cases in a dataset, whereas features reduction can be achieved by feature selection. This paper aims to present a combined method for feature selection, where a filter based on correlation is applied on whole features set to find the relevant ones, and then, on these features a wrapper is applied in order to find the best features subset for a specified predictor. It is also presented a case study for a data set provided by TERAPERS a personalized speech therapy system.
机译:数据库中的知识发现旨在从大量数据中提取新的,有趣的和潜在的有用模式。这是一个复杂的过程,其中心点是数据挖掘,可以从数据有效地构建模型。数据类型,质量和维数是影响数据挖掘任务性能的一些因素。由于数据的高维度可能会引起一些麻烦,因为数据过载,因此可能的解决方案是减少它。采样和过滤减少了数据集中的案例数量,而特征减少可以通过特征选择来实现。本文旨在提出一种组合的特征选择方法,其中将基于相关性的过滤器应用于整个特征集以找到相关特征,然后在这些特征上应用包装器以找到最佳特征子集。指定的预测变量。还提供了TERAPERS个性化语音治疗系统提供的数据集的案例研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号