首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Feature subset selection and feature ranking for multivariate time series
【24h】

Feature subset selection and feature ranking for multivariate time series

机译:多元时间序列的特征子集选择和特征排名

获取原文
获取原文并翻译 | 示例
           

摘要

Feature subset selection (FSS) is a known technique to preprocess the data before performing any data mining tasks, e.g., classification and clustering. FSS provides both cost-effective predictors and a better understanding of the underlying process that generated the data. We propose a family of novel unsupervised methods for feature subset selection from multivariate time series (MTS) based on common principal component analysis, termed CLeVer. Traditional FSS techniques, such as recursive feature elimination (RFE) and Fisher criterion (FC), have been applied to MTS data sets, e.g., brain computer interface (BCI) data sets. However, these techniques may lose the correlation information among features, while our proposed techniques utilize the properties of the principal component analysis to retain that information. In order to evaluate the effectiveness of our selected subset of features, we employ classification as the target data mining task. Our exhaustive experiments show that CLeVer outperforms RFE, FC, and random selection by up to a factor of two in terms of the classification accuracy, while taking up to 2 orders of magnitude less processing time than RFE and FC.
机译:特征子集选择(FSS)是在执行任何数据挖掘任务(例如分类和聚类)之前预处理数据的已知技术。 FSS既提供了具有成本效益的预测器,又提供了对生成数据的基本过程的更好理解。我们提出了一系列基于通用主成分分析的从多元时间序列(MTS)进行特征子集选择的新颖无监督方法,称为CLeVer。诸如递归特征消除(RFE)和Fisher准则(FC)之类的传统FSS技术已应用于MTS数据集,例如脑计算机接口(BCI)数据集。但是,这些技术可能会丢失特征之间的相关性信息,而我们提出的技术会利用主成分分析的属性来保留该信息。为了评估所选特征子集的有效性,我们将分类用作目标数据挖掘任务。我们的详尽实验表明,就分类精度而言,CLeVer的性能比RFE,FC和随机选择高出2倍,而处理时间比RFE和FC少2个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号