SMOTE Approach to Imbalanced Dataset in Logistic Regression Analysis

机译：在逻辑回归分析中阐明了不平衡数据集的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Logistic regression is a classification model that is commonly used in bankruptcy studies. The classifier works well when data is balanced. However, imbalanced data set is found in almost all bankruptcy studies. The most common approach to deal with imbalanced data set is by selecting and matching the samples from both bankrupt and non-bankrupt samples. The problem of imbalanced data and the approach taken to deal with it can affect a good predictive model. The objective of the study is to improve the classification accuracy of a logit model when data is heavily loaded to one side. The approach taken is by using SMOTE sampling. The study used SMEs categorized under the accommodation and food service activities, and the hotel sector. There are 14 explanatory variables involved. The result from this study confirmed that the AUC and sensitivity values from SMOTE Logistic Regression (SLR) model is higher than the AUC and sensitivity values of a logit model.

机译：Logistic回归是一个常用于破产研究的分类模型。当数据平衡时，分类器运行良好。但是，在几乎所有破产研究中都发现了不平衡的数据集。处理不平衡数据集的最常见方法是通过选择和匹配来自破产和非破产样本的样本。数据不平衡的问题和对处理它的方法可能会影响一个良好的预测模型。该研究的目的是当数据大量加载到一侧时，提高Logit模型的分类准确性。采取的方法是通过使用Smote采样。研究使用中小企业分类为住宿和食品服务活动，以及酒店部门。有14个解释性变量涉及。本研究的结果证实，来自Smote Logistic回归（SLR）模型的AUC和敏感值高于Logit模型的AUC和灵敏度值。

著录项

来源
《International Conference on Computing, Mathematics and Statistics》|2019年|595p|共5页
会议地点
作者
Amirah Hazwani Abdul Rahim; Nurazlina Abdul Rashid; Asmahani Nayan; Abd-Razak Ahmad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 O1-53;
关键词
Imbalanced data; SMOTE sampling; Logistic regression;

机译：不平衡的数据;笑容抽样;Logistic回归;

相似文献

外文文献
中文文献
专利

1. SMOTE bagging algorithm for imbalanced dataset in logistic regression analysis (case: credit of bank X) [J] . Fithria Siti Hanifah, Hari Wijayanto, Anang Kurnia Applied mathematical sciences . 2015,第138期

机译：Logistic回归分析中不平衡数据集的SMOTE套袋算法（案例：银行X的贷方）
2. Analysis of SMOTE: Modified for Diverse Imbalanced Datasets Under the IoT Environment [J] . Ankita Bansal, Makul Saini, Rakshit Singh, International journal of information retrieval research . 2021,第2期

机译：SMOTE分析：在物联网环境下为各种不平衡数据集进行修改
3. OVERSAMPLING METHOD TO HANDLING IMBALANCED DATASETS PROBLEM IN BINARY LOGISTIC REGRESSION ALGORITHM [J] . Windyaning Ustyannie, S Suprapto Indonesian Journal of Computing and Cybernetics Systems . 2020,第1期

机译：二元Logistic回归算法中处理不平衡数据集问题的抽样方法。
4. SMOTE Approach to Imbalanced Dataset in Logistic Regression Analysis [C] . Amirah Hazwani Abdul Rahim, Nurazlina Abdul Rashid, Asmahani Nayan, International Conference on Computing, Mathematics and Statistics . 2019

机译：在逻辑回归分析中阐明了不平衡数据集的方法
5. Worst Case Datasets for Solving Binary Logistic Regression Via Deterministic First-order Methods [D] . ?Squires, Trevor 2020

机译：最坏情况数据集解决二元 Logistic回归通过确定性一阶方法
6. Machine-Learning Approach to Optimize SMOTE Ratio in Class Imbalance Dataset for Intrusion Detection [O] . Jae-Hyun Seo, Yong-Hyuk Kim 2018

机译：机器学习方法用于在类别不平衡数据集中优化SMOTE比率以进行入侵检测
7. Affinity Propagation SMOTE approach for Imbalanced dataset used in Predicting Student at Risk of Low Performance [O] . B. Laureano Lanie 2020

机译：亲和力传播击打了用于预测学生的不平衡数据集的方法，以低表现

SMOTE Approach to Imbalanced Dataset in Logistic Regression Analysis

摘要

著录项

相似文献

相关主题

期刊订阅