首页> 外文会议>SoutheastCon >An empirical analysis of feature engineering for predictive modeling
【24h】

An empirical analysis of feature engineering for predictive modeling

机译:预测建模特征工程的实证分析

获取原文

摘要

Machine learning models, such as neural networks, decision trees, random forests and gradient boosting machines accept a feature vector and provide a prediction. These models learn in a supervised fashion where a set of feature vectors with expected output is provided. It is very common practice to engineer new features from the provided feature set. Such engineered features will either augment, or replace portions of the existing feature vector. These engineered features are essentially calculated fields, based on the values of the other features. Engineering such features is primarily a manual, time-consuming task. Additionally, each type of model will respond differently to different types of engineered features. This paper reports on empirical research to demonstrate what types of engineered features are best suited to which machine learning model type. This is accomplished by generating several datasets that are designed to benefit from a particular type of engineered feature. The experiment demonstrates to what degree the machine learning model is capable of synthesizing the needed feature on its own. If a model is capable of synthesizing an engineered feature, it is not necessary to provide that feature. The research demonstrated that the studied models do indeed perform differently with various types of engineered features.
机译:机器学习模型,如神经网络,决策树,随机森林和梯度升压机接受特征向量并提供预测。这些模型以监督方式学习,其中提供了一组具有预期输出的特征向量。从提供的功能集中创造新功能是非常常见的做法。这些工程特征将增加或替换现有特征向量的部分。这些工程特征基本上是基于其他功能的值计算的字段。工程这些功能主要是手动,耗时的任务。此外,每种类型的模型都会对不同类型的工程特征进行不同的反应。本文报告了实证研究,以展示什么类型的工程特征最适合其学习模型类型。这是通过生成多个数据集来实现的,该数据集旨在受益于特定类型的工程特征。实验表明,机器学习模型能够自己合成所需功能的程度。如果模型能够合成工程特征,则不必提供该功能。该研究表明,研究的模型确实与各种类型的工程特征不同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号