首页> 外文期刊>Computational statistics >Supervised classification of curves via a combined use of functional data analysis and tree-based methods
【24h】

Supervised classification of curves via a combined use of functional data analysis and tree-based methods

机译:通过结合使用功能数据分析和基于树的方法对曲线进行监督分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Technological advancement led to the development of tools to collect vast amounts of data usually recorded at temporal stamps or arriving over time, e.g. data from sensors. Common ways of analysing this kind of data also involve supervised classification techniques; however, despite constant improvements in the literature, learning from high-dimensional data is always a challenging task due to many issues such as, for example, dealing with the curse of dimensionality and looking for a trade-off between complexity and accuracy. Nowadays, research in functional data analysis (FDA) and statistical learning is very lively to address these drawbacks adequately. This study offers a supervised classification strategy that combines FDA and tree-based procedures. Specifically, we introduce functional classification trees, functional bagging, and functional random forest exploiting the functional principal components decomposition as a tool to extract new features and build functional classifiers. In addition, we introduce new tools to support the understanding of the classification rules, such as the functional empirical separation prototype, functional predicted separation prototype, and the leaves' functional deviance. Furthermore, we suggest some possible solutions for choosing the number of functional principal components and functional classification trees to be implemented in the supervised classification procedure. This research aims to provide an approach to improve the accuracy of the functional classifier, serve the interpretation of the functional classification rules, and overcome the classical drawbacks due to the high-dimensionality of the data. An application on a real dataset regarding daily electrical power demand shows the functioning of the supervised classification proposal. A simulation study with nine scenarios highlights the performance of this approach and compares it with other functional classification methods. The results demonstrate that this line of research is exciting and promising; indeed, in addition to the benefits of the suggested interpretative tools, we exceed the previously established accuracy records on a dataset available online.
机译:技术进步导致了工具的发展,以收集通常记录在时间邮票上或随时间推移到达的大量数据,例如来自传感器的数据。分析此类数据的常用方法还涉及监督分类技术;然而,尽管文献不断改进,但由于许多问题,例如处理维度的诅咒以及寻找复杂性和准确性之间的权衡,从高维数据中学习始终是一项具有挑战性的任务。如今,功能数据分析(FDA)和统计学习的研究非常活跃,可以充分解决这些缺点。本研究提供了一种结合了 FDA 和基于树的程序的监督分类策略。具体而言,我们引入了功能分类树、功能袋和功能随机森林,利用功能主成分分解作为提取新特征和构建功能分类器的工具。此外,我们还引入了新的工具来支持对分类规则的理解,例如功能经验分离原型、功能预测分离原型和叶子的功能偏差。此外,我们提出了一些可能的解决方案,用于选择要在监督分类过程中实现的功能主成分和功能分类树的数量。本研究旨在为提高函数分类器的准确率、服务于函数分类规则的解释以及克服数据高维性导致的经典缺陷提供一种方法。关于日常电力需求的真实数据集上的应用程序显示了监督分类提案的功能。通过9种情景的仿真研究,重点介绍了该方法的性能,并将其与其他功能分类方法进行了比较。结果表明,这一系列研究令人兴奋和有希望;事实上,除了建议的解释工具的好处外,我们还超越了以前在网上可用的数据集上建立的准确性记录。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号