首页> 外文会议>SoutheastCon >An empirical analysis of feature engineering for predictive modeling

【24h】

An empirical analysis of feature engineering for predictive modeling

机译：预测建模特征工程的实证分析

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Machine learning models, such as neural networks, decision trees, random forests and gradient boosting machines accept a feature vector and provide a prediction. These models learn in a supervised fashion where a set of feature vectors with expected output is provided. It is very common practice to engineer new features from the provided feature set. Such engineered features will either augment, or replace portions of the existing feature vector. These engineered features are essentially calculated fields, based on the values of the other features. Engineering such features is primarily a manual, time-consuming task. Additionally, each type of model will respond differently to different types of engineered features. This paper reports on empirical research to demonstrate what types of engineered features are best suited to which machine learning model type. This is accomplished by generating several datasets that are designed to benefit from a particular type of engineered feature. The experiment demonstrates to what degree the machine learning model is capable of synthesizing the needed feature on its own. If a model is capable of synthesizing an engineered feature, it is not necessary to provide that feature. The research demonstrated that the studied models do indeed perform differently with various types of engineered features.

机译：机器学习模型，如神经网络，决策树，随机森林和梯度升压机接受特征向量并提供预测。这些模型以监督方式学习，其中提供了一组具有预期输出的特征向量。从提供的功能集中创造新功能是非常常见的做法。这些工程特征将增加或替换现有特征向量的部分。这些工程特征基本上是基于其他功能的值计算的字段。工程这些功能主要是手动，耗时的任务。此外，每种类型的模型都会对不同类型的工程特征进行不同的反应。本文报告了实证研究，以展示什么类型的工程特征最适合其学习模型类型。这是通过生成多个数据集来实现的，该数据集旨在受益于特定类型的工程特征。实验表明，机器学习模型能够自己合成所需功能的程度。如果模型能够合成工程特征，则不必提供该功能。该研究表明，研究的模型确实与各种类型的工程特征不同。

著录项

来源
《SoutheastCon》|2016年|1 v.|共6页
会议地点
作者
Jeff Heaton;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Feature Engineering and Selection: A Practical Approach for Predictive Models [J] . Butcher Brandon, Smith Brian J. The American statistician . 2020,第3期

机译：功能工程和选择：预测模型的实用方法
2. Bayesian analysis of empirical software engineering cost models [J] . Chulani S., Boehm B., Steece B. IEEE Transactions on Software Engineering . 1999,第4期

机译：贝叶斯经验软件工程成本模型分析
3. Bayesian analysis of empirical software engineering cost models [J] . Chulani S., Boehm B. IEEE Transactions on Software Engineering . 1999,第4期

机译：贝叶斯经验软件工程成本模型分析
4. An empirical analysis of feature engineering for predictive modeling [C] . Jeff Heaton SoutheastCon . 2016

机译：用于预测建模的特征工程的经验分析
5. Computer modeling of protein tertiary structure and DNA binding energetics. I. Empirical free energy analysis of the engrailed Q50K variant-DNA complex and its mutants. II. The predicted structure of the adenovirus E4 orf6 protein by threading and comparative protein modeling. [D] . Brown, Lawrence Milton, III. 2001

机译：蛋白质三级结构和DNA结合能学的计算机建模。 I.陷入困境的Q50K变异体-DNA复合体及其突变体的经验自由能分析。二。通过穿线和比较蛋白建模预测腺病毒E4 orf6蛋白的结构。
6. Advanced Wastewater Treatment Engineering—Investigating Membrane Fouling in both Rotational and Static Membrane Bioreactor Systems Using Empirical Modelling [O] . Parneet Paul, Franck Anderson Jones 2016

机译：先进的废水处理工程-使用经验模型研究旋转和静态膜生物反应器系统中的膜污染
7. An Empirical Analysis of Feature Engineering for Predictive Modeling [O] . Heaton, Jeff 2017

机译：预测模型特征工程的实证分析
8. An Empirical Density Model for Predicting Low-Altitude Satellite Ephemerides. Part 1. Data Analysis and Model Formulation. [R] . forbes, j. m. gillette, d. f. 1974

机译：一种预测低空卫星星历的经验密度模型。第1部分。数据分析和模型制定。

An empirical analysis of feature engineering for predictive modeling

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅