Medical Health Big Data Classification Based on KNN Classification Algorithm

Xing Wenchao; Bei Yilin

首页> 外文期刊>Quality Control, Transactions >Medical Health Big Data Classification Based on KNN Classification Algorithm

【24h】

Medical Health Big Data Classification Based on KNN Classification Algorithm

机译：基于KNN分类算法的医疗健康大数据分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid development of information technology has led to the development of medical informatization in the direction of intelligence. Medical health big data provides a basic data resource guarantee for medical service intelligence and smart healthcare. The classification of medical health big data is of great significance for the intelligentization of medical information. Due to the simplicity of KNN (K-Nearest Neighbor) classification algorithm, it has been widely used in many fields. However, when the sample size is large and the feature attributes are large, the efficiency of the KNN algorithm classification will be greatly reduced. This paper proposes an improved KNN algorithm and compares it with the traditional KNN algorithm. The classification is performed in the query instance neighborhood of the conventional KNN classifier, and weights are assigned to each class. The algorithm considers the class distribution around the query instance to ensure that the assigned weight does not adversely affect the outliers. Aiming at the shortcomings of traditional KNN algorithm in processing large data sets, this paper proposes an improved KNN algorithm based on cluster denoising and density cropping. The algorithm performs denoising processing by clustering, and improves the classification efficiency of KNN algorithm by speeding up the search speed of K-nearest neighbors, while maintaining the classification accuracy of KNN algorithm. The experimental results show that the proposed algorithm can effectively improve the classification efficiency of KNN algorithm in processing large data sets, and maintain the classification accuracy of KNN algorithm well, and has good classification performance.

机译：信息技术的快速发展导致了智力方向发展的医学信息化。医疗健康大数据为医疗服务智能和智能医疗保健提供了基本的数据资源保证。医疗健康大数据的分类对于医疗信息的智能化具有重要意义。由于KNN（k最近邻居）分类算法的简单性，它已广泛用于许多领域。但是，当样本大小很大并且特征属性很大时，将大大降低KNN算法分类的效率。本文提出了一种改进的KNN算法，并将其与传统的KNN算法进行比较。分类在传统knn分类器的查询实例邻域中执行，并且将权重分配给每个类。该算法考虑了查询实例周围的类分布，以确保指定的权重不会对异常值产生不利影响。针对传统KNN算法在加工大数据集时的缺点，本文提出了一种基于集群去噪和密度裁剪的改进的KNN算法。该算法通过聚类执行去噪处理，并通过加速K-incolly邻居的搜索速度来提高KNN算法的分类效率，同时保持KNN算法的分类精度。实验结果表明，该算法可以有效地提高了kNN算法在处理大数据集中的分类效率，并保持了KNN算法的分类精度，具有良好的分类性能。

著录项

来源
《Quality Control, Transactions》 |2020年第2020期|28808-28819|共12页
作者
Xing Wenchao; Bei Yilin;
展开▼
作者单位

Jining Univ Sch Primary Educ Qufu 273100 Shandong Peoples R China;

Taishan Univ Sch Informat Sci & Technol Tai An 271000 Shandong Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Improved KNN classifier; weighted KNN algorithm; cluster denoising; density cropping;

机译：改进的KNN分类器;加权KNN算法;集群去噪;密度裁剪;

相似文献

外文文献
中文文献
专利

1. R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data [J] . Computer Methods and Programs in Biomedicine: An International Journal Devoted to the Development, Implementation and Exchange of Computing Methodology and Software Systems in Biomedical Research and Medical Practice . 2020,第期

机译：R-Ensembler：基于贪婪的粗略集合集合属性选择算法，用于医疗数据分类的KNN估算
2. A Case Study of Medical Data Classification Using Hybrid Adboost KNN along with Krill Herd Algorithm (KHA) [J] . Dudekula Mahammad Rafi, Chettiar Ramachandra Bharathi Ingenierie des Systemes d'Information . 2019,第1期

机译：用杂交adboost knn与磷虾群算法（Kha）进行医疗数据分类的案例研究
3. Data security rules/regulations based classification of file data using TsF-kNN algorithm [J] . Zardari Munwar Ali, Jung Low Tang Cluster computing . 2016,第1期

机译：使用TsF-kNN算法基于数据安全规则/法规的文件数据分类
4. Classification of IRIS Dataset using Classification Based KNN Algorithm in Supervised Learning [C] . K Thirunavukkarasu, Ajay S. Singh, Prakhar Rai, 2018 4th International Conference on Computing Communication and Automation . 2018

机译：在监督学习中使用基于分类的KNN算法对IRIS数据集进行分类
5. Clustering algorithms, classification algorithms and their applications in medical databases. [D] . Baddam, Sudheer R. 2005

机译：聚类算法，分类算法及其在医学数据库中的应用。
6. Heartbeat Classification Based on Multifeature Combination and Stacking-DWKNN Algorithm [O] . Shasha Ji, Runchuan Li, Shengya Shen, 2021

机译：基于多因素组合与堆叠DWKNN算法的心跳分类
7. Research and Improvement on Feature Selection and Classification Algorithms for Text Classification Based on KNN [O] . 黄娟娟 2014

机译：基于KNN的文本分类特征选择与分类算法的研究与改进。
8. Comparison of Classification Algorithms on MSTAR Data Using Risk-Based Empirical Statistics [R] . Wagenman, S. B., Thorsen, S. N., Kaziska, D. M., 2009

机译：基于风险的经验统计的msTaR数据分类算法比较

Medical Health Big Data Classification Based on KNN Classification Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅