Application of k- Nearest Neighbour Classification in Medical Data Mining

Hassan Shee Khamis; Kipruto W. Cheruiyot; Stephen Kimani

首页> 外文期刊>International Journal of Information and Communication Technology Research >Application of k- Nearest Neighbour Classification in Medical Data Mining

【24h】

Application of k- Nearest Neighbour Classification in Medical Data Mining

机译：k最近邻分类法在医学数据挖掘中的应用

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Medical data is an ever-growing source of information from hospitals in form of patient records. When mined, the information hidden in these records is a huge resource bank for medical research. This data contains hidden patterns and relationships, which can lead to better diagnosis. Unfortunately, discovery of these patterns and relationships often goes unexploited. Studies have been carried out in medical diagnosis to predict heart diseases, lungs diseases, and various tumors based on the past data collected from patients. However, they are mostly limited to domain-specific systems that predict diseases restricted to their area of operations. In retrospect, the performance of the k-nearest neighborhoods (k-NN) classifier is highly dependent on the distance metric used to identify the k nearest neighbors of the query points. The standard Euclidean distance is commonly used in practice. This study uses vast storage of information so that diagnosis based on historical data can be made. It focuses on computing the probability of occurrence of a particular ailment by using a unique algorithm. This k-NN algorithm increases the accuracy of such diagnosis. The algorithm can be used to enhance the automated diagnoses, which include diagnosis of multiple diseases showing similar symptoms. To validate the experimental results, a hypothesis was tested for the following variables: accidents, age, allergies, blood pressure, smoking habit, total cholesterol, diabetes and hypertension, family history of heart disease, obesity, and lack of physical activity. It was evident that there was a strong relationship between the above variables to the causes of common chronic diseases like: heart ailment, diabetes and cancer.

机译：医疗数据是医院不断增加的以病历形式提供的信息来源。开采时，隐藏在这些记录中的信息是用于医学研究的巨大资源库。此数据包含隐藏的模式和关系，可以导致更好的诊断。不幸的是，这些模式和关系的发现常常得不到利用。已经基于从患者收集的过去数据在医学诊断中进行了研究以预测心脏病，肺部疾病和各种肿瘤。但是，它们大多限于特定领域的系统，这些系统可以预测局限于其手术区域的疾病。回想起来，第k个最近邻（k-NN）分类器的性能高度依赖于用于确定查询点的第k个最近邻居的距离度量。在实践中通常使用标准的欧几里得距离。这项研究使用了大量的信息存储，因此可以基于历史数据进行诊断。它着重于通过使用独特的算法来计算发生特定疾病的概率。这种k-NN算法提高了这种诊断的准确性。该算法可用于增强自动诊断，包括对显示相似症状的多种疾病进行诊断。为了验证实验结果，对以下变量进行了假设检验：事故，年龄，过敏，血压，吸烟习惯，总胆固醇，糖尿病和高血压，心脏病家族史，肥胖症和缺乏体育锻炼。显然，上述变量与诸如心脏病，糖尿病和癌症等常见慢性疾病的原因之间存在很强的关系。

著录项

来源
《International Journal of Information and Communication Technology Research》 |2014年第4期|共页
作者
Hassan Shee Khamis; Kipruto W. Cheruiyot; Stephen Kimani;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Review of classification in Web Usage Mining using K- Nearest Neighbour [J] . Manisha Kumari, Sarita Soni Advances in computational sciences and technology . 2017,第5a5a期

机译：使用K最近邻的Web用法挖掘中的分类综述
2. Augmentation Of A Nearest Neighbour Clustering Algorithm With A Partial Supervision Strategy For Biomedical Data Classification [J] . Sameh A. Salem, Nancy M. Salem, Asoke K. Nandi Expert Systems . 2009,第1期

机译：基于部分监督策略的生物医学数据分类的最近邻聚类算法增强
3. Predicting porosity, permeability and water saturation applying an optimized nearest-neighbour, machine-learning and data-mining network of well-log data [J] . Journal of Petroleum Science & Engineering . 2020,第期

机译：预测孔隙度，渗透性和水饱和度，应用优化的最近邻，机器 - 学习和良好的井口数据挖掘网络
4. Classification of Domestic Burning Smell using Covariance k- Nearest Neighbour Algorithm for Early Fire Detection Application [C] . Allan M. Andrew, Kamarulzaman Kamarudin, Syed M. Mamduh, International conference on environmental odour monitoring and control . 2014

机译：协方差k-最近邻算法在家庭燃烧气味分类中的应用
5. Comparative classification of prostate cancer data using the Support Vector Machine, Random Forest, DualKS and k-Nearest Neighbours. [D] . Sakouvogui, Kekoura. 2015

机译：使用支持向量机，Random Forest，DualKS和k-Nearest邻居对前列腺癌数据进行比较分类。
6. Love Thy Neighbour: Automatic Animal Behavioural Classification of Acceleration Data Using the K-Nearest Neighbour Algorithm [O] . Owen R. Bidder, Hamish A. Campbell, Agustina Gómez-Laich, 2010

机译：爱你的邻居：使用K最近邻居算法对加速度数据进行自动动物行为分类
7. Love thy neighbour: automatic animal behavioural classification of acceleration data using the K-nearest neighbour algorithm. [O] . Owen R Bidder, Hamish A Campbell, Agustina Gómez-Laich, 2014

机译：爱你的邻居：使用K最近邻算法对加速度数据进行自动动物行为分类。

Application of k- Nearest Neighbour Classification in Medical Data Mining

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅