Imputation Analysis of Central Tendencies for Classification

机译：中央倾向进行分类的归责分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In real-world datasets missing values are so common. Most Machine Learning algorithms won't work with missing values, and so they should be handled before training the model. It is a common practice to impute the missing values with central tendencies (Mean, Median, Mode), but choosing a particular one among them is not an easy choice to make. This paper analyzes the impact of using each central tendency for different distributions of data. Skewness and the presence of outliers are considered for selecting the data for analysis. Certain presumptions have been made before the examination, and performance metrics such as accuracy, AUC-ROC, precision, recall, and F1 score are analyzed to prove/disprove the assumptions.

机译：在真实的数据集中缺少值是如此常见。大多数机器学习算法无法使用缺失值，因此应在培训模型之前处理它们。它是一种常见的做法，赋予中央倾向（平均值，中位数，模式）赋予缺失的值，但在其中选择特定的趋势不是一个简单的选择。本文分析了使用每个中央趋势对不同数据分布的影响。偏斜和异常值的存在被认为是选择分析数据。在考试之前已经进行了某些推定，并分析了准确性，AUC-ROC，精确，召回和F1分数等性能指标以证明/反驳假设。

著录项

来源
《IEEE International IOT, Electronics and Mechatronics Conference》|2021年|1-7|共7页
会议地点
作者
Ramprakash Pavithrakannan; Nikitta Baker Fenn; Sriram Raman; Varadharajan Kalyanaraman; Vignesh Kumar Murugananthan; Jeevanandham Janarthanan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Measurement; Mechatronics; Machine learning algorithms; Conferences;

机译：培训;测量;机电一体化;机器学习算法;会议;

相似文献

外文文献
中文文献
专利

1. Enhancing Fault Classification Accuracy of Ball Bearing Using Central Tendency Based Time Domain Features [J] . Muhammad Masood Tahir, Abdul Qayyum Khan, Naeem Iqbal, Quality Control, Transactions . 2017,第1期

机译：基于中心趋势的时域特征提高球轴承的故障分类精度
2. Automatic classification of high resolution land cover using a new data weighting procedure: The combination of k-means clustering algorithm and central tendency measures (KMC-CTM) [J] . Durduran Suleyman Savas Applied Soft Computing . 2015,第Null期

机译：使用新的数据加权程序对高分辨率土地覆盖物进行自动分类：k均值聚类算法和集中趋势量度（KMC-CTM）的结合
3. Central-tendency estimation and nearest-estimate classification of event related potentials [J] . Gupta L., Kota S., Yarlagadda P., Pattern Recognition: The Journal of the Pattern Recognition Society . 2011,第7期

机译：事件相关电位的集中趋势估计和最近估计分类
4. Central-Tendency Estimation and Nearest-Estimate Classification of Multi-Channel Evoked Potentials [C] . Srinivas Kota, Phani Yarlagadda, Lalit Gupta, Annual International Conference of the IEEE Engineering in Medicine and Biology Society . 2009

机译：中央倾向估计和最近的多通道诱发潜力分类
5. The impact of missing data treatments in a multiple regression analysis: A Monte Carlo comparison of deterministic imputation, stochastic imputation, multiple imputation, and the deletion procedures [D] . Newsome, Dwight Howard. 1996

机译：多元回归分析中缺失数据处理的影响：确定性归因，随机归因，多重归因和删除程序的蒙特卡洛比较
6. A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction [O] . Zhiyong Hu, Dongping Du 2020

机译：一种新的分析框架用于缺少数据避难和不确定性分类：缺少数据归档和心力衰竭入读预测
7. A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction [O] . Zhiyong Hu, Dongping Du 2020

机译：一种新的分析框架，用于缺少数据避难和不确定性分类：缺少数据归档和心力衰竭入读预测

Imputation Analysis of Central Tendencies for Classification

摘要

著录项

相似文献

相关主题

期刊订阅