首页> 外文会议>The 14th International Conference on QiR >The impact of different fold for cross validation of missing values imputation method on hepatitis dataset

【24h】

The impact of different fold for cross validation of missing values imputation method on hepatitis dataset

机译：不同倍数对遗漏值插补方法的交叉验证对肝炎数据集的影响

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hepatitis is a liver disease caused by hepatitis viruses. Nowadays, hepatitis is a global health problem, including in Indonesia. Chronic hepatitis can lead to cirrhosis and liver cancer, therefore early diagnosis is needed. Several research works on development of computer aided systems have been conducted to improve the diagnosis process of hepatitis disease. California Irvine (UCI) machine-learning repository provides hepatitis disease dataset which can be publicly accessed; however, the dataset contains many missing values. The existing of missing values in the dataset may affect the quality of the results analysis. Therefore, it needs to be conducted for handling the missing values. This paper analyses the performance of applying varied number of fold for cross validation of missing values imputation methods. The imputation method is combined with the feature selection method and machine-learning algorithm on the hepatitis dataset. The results that varied fold in k-fold cross validation which applied in the imputation method does not reveal significant advantages.

机译：肝炎是由肝炎病毒引起的肝脏疾病。如今，肝炎已成为全球性健康问题，包括印度尼西亚在内。慢性肝炎可导致肝硬化和肝癌，因此需要早期诊断。已经进行了一些关于计算机辅助系统开发的研究工作，以改善肝炎疾病的诊断过程。加州尔湾（UCI）机器学习存储库提供了可以公开访问的肝炎疾病数据集；但是，数据集包含许多缺失值。数据集中缺失值的存在可能会影响结果分析的质量。因此，需要进行处理缺失值。本文分析了应用不同倍数进行缺失值插补方法的交叉验证的性能。在肝炎数据集上，将归因方法与特征选择方法和机器学习算法相结合。在插补方法中应用的k倍交叉验证中变化倍数的结果并没有显示出明显的优势。

著录项

来源
《The 14th International Conference on QiR》|2015年|51-55|共5页
会议地点 Lombok(ID)
作者
Tri Astuti; Hanung Adi Nugroho; Teguh Bharata Adji;
展开▼
作者单位

Dept. of Inf. Eng., STMIK Amikom Purwokerto, Purwokerto, Indonesia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
hepatitis; imputation method; k-fold cross validation; missing values; varied fold;

机译：肝炎;输入法; k倍交叉验证;缺失值;变异倍数;
入库时间 2022-08-26 14:26:49

相似文献

外文文献
中文文献
专利

1. Comparison of Interpolation, Statistical, and Data-Driven Methods for Imputation of Missing Values in a Distributed Soil Moisture Dataset [J] . Kurt Kornelsen, Paulin Coulibaly Journal of hydrologic engineering . 2014,第1期

机译：插值，统计和数据驱动方法在分布式土壤水分数据集中估算缺失值的比较
2. PLS/OPLS models in metabolomics: the impact of permutation of dataset rows on the K-fold cross-validation quality parameters [J] . Mohamed N. Triba, Laurence Le Moyec, Roland Amathieu, Molecular BioSystems . 2015,第1期

机译：代谢组学中的PLS / OPLS模型：数据集行排列对K折交叉验证质量参数的影响
3. Study on the Impact of Partition-Induced Dataset Shift on $k$-Fold Cross-Validation [J] . Moreno-Torres J. G., Saez J. A., Herrera F. Neural Networks and Learning Systems, IEEE Transactions on . 2012,第8期

机译：分区诱导的数据集移位对$ k $ -fold交叉验证的影响研究
4. The impact of different fold for cross validation of missing values imputation method on hepatitis dataset [C] . Tri Astuti, Hanung Adi Nugroho, Teguh Bharata Adji Asian Symposium on Material Processing . 2015

机译：不同折叠对缺失值归零方法对肝炎数据集的影响的影响
5. Methodological and clinical issues in analysis of data from HIV cardiovascular research: Validity of ultrasound methods, impact of anti-retroviral therapy on atherosclerosis, and imputation of missing values. [D] . Odueyungbo, Adefowope. 2010

机译：HIV心血管研究数据分析中的方法学和临床问题：超声方法的有效性，抗逆转录病毒疗法对动脉粥样硬化的影响以及缺失值的归因。
6. The ability of different imputation methods for missing values in mental measurement questionnaires [O] . Xueying Xu, Leizhen Xia, Qimeng Zhang, 2020

机译：心理测量问卷中不同插补方法对缺失值的处理能力
7. The impact of imputation procedures with machine learning methods on the performance of classifiers: An application to coronary artery disease data including missing values [O] . Jale Bektas, Turgay Ibrikci, Ismail Turkay Ozcan 2018

机译：用机器学习方法对分类器性能的估算方法的影响：冠状动脉疾病数据的应用，包括缺失值

The impact of different fold for cross validation of missing values imputation method on hepatitis dataset

摘要

著录项

相似文献

相关主题

期刊订阅