Application Research of KNN Algorithm Based on Clustering in Big Data Talent Demand Information Classification

Xiao Qingtao; Zhong Xin; Zhong Chenghua

首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >Application Research of KNN Algorithm Based on Clustering in Big Data Talent Demand Information Classification

【24h】

Application Research of KNN Algorithm Based on Clustering in Big Data Talent Demand Information Classification

机译：KNN算法在大数据人才需求信息分类中基于集群的应用研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the growth of massive data in the current mobile Internet, network recruitment is gradually growing into a new recruitment channel. How to effectively mine available information in the massive network recruitment data has become the technical bottleneck of current education and social supply and demand development. The renewal of talent demand information is carried out every day, which produces a large amount of text data. How to manage these talents' demand information reasonably becomes more and more important. Artificial classification is time-consuming and laborious, which is unrealistic naturally. Therefore, using automatic text categorization technology to classify and manage this information becomes particularly important. To break through the bottleneck of this technology, a heuristic KNN text categorization algorithm based on ABC (artificial bee colony) is proposed to adjust the weight of features, and the similarity between test observation and training observation is measured by using the method of fuzzy distance measurement. Firstly, the recruitment information is segmented and feature selection and noise data elimination are carried out by using term frequency-inverse document frequency (TF-IDF) algorithm and AP (affinity propagation) clustering algorithm. Finally, the text information is classified by using KNN algorithm combined with heuristic search and fuzzy distance measurement. The experimental results show that this method effectively solves the problem of poor stability and low classification accuracy of traditional KNN algorithm in text categorization method for talent demand.

机译：随着当前移动互联网中大规模数据的增长，网络招聘逐渐发展到新的招聘渠道中。如何有效地在大规模网络招聘数据中获得可用信息已成为当前教育和社会供需发展的技术瓶颈。每天进行人才需求信息的更新，这产生了大量的文本数据。如何管理这些人才的需求信息合理变得越来越重要。人工分类是耗时和费力的，自然是不切实际的。因此，使用自动文本分类技术来分类和管理此信息变得尤为重要。为了突破该技术的瓶颈，提出了一种基于ABC（人造蜂菌落）的启发式KNN文本分类算法，调整特征的重量，通过使用模糊距离的方法测量试验观察和训练观察之间的相似性测量。首先，通过使用术语频率 - 逆文档频率（TF-IDF）算法和AP（亲和传播）聚类算法来执行招聘信息并进行特征选择和噪声数据消除。最后，通过使用KNN算法与启发式搜索和模糊距离测量相结合进行文本信息。实验结果表明，该方法有效解决了人才需求文本分类方法中传统KNN算法稳定性差和低分类精度的问题。

著录项

来源
《International Journal of Pattern Recognition and Artificial Intelligence》 |2020年第6期|2050015.1-2050015.18|共18页
作者
Xiao Qingtao; Zhong Xin; Zhong Chenghua;
展开▼
作者单位

Army Mil Univ Vocat Educ Ctr Chongqing Peoples R China;

Chongqing Technol & Business Univ Mental Hlth Educ & Counseling Ctr Chongqing Peoples R China;

Chongqing Technol & Business Univ Coll Environm & Resources Chongqing Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering; KNN algorithm; artificial intelligence; information classification;

机译：聚类;knn算法;人工智能;信息分类;

相似文献

外文文献
中文文献
专利

1. Automatic fast double KNN classification algorithm based on ACC and hierarchical clustering for big data [J] . Li Haiyun, Li Haifeng, Wei Kaibin International journal of communication systems . 2018,第16期

机译：基于ACC和层次聚类的大数据自动快速双KNN分类算法。
2. Medical Health Big Data Classification Based on KNN Classification Algorithm [J] . Xing Wenchao, Bei Yilin Quality Control, Transactions . 2020,第期

机译：基于KNN分类算法的医疗健康大数据分类
3. Data security rules/regulations based classification of file data using TsF-kNN algorithm [J] . Zardari Munwar Ali, Jung Low Tang Cluster computing . 2016,第1期

机译：使用TsF-kNN算法基于数据安全规则/法规的文件数据分类
4. Research on the high robustness data classification and the mining algorithm based on hierarchical clustering and KNN [C] . Haohang Li, Shen Wang, Rui Tang International Conference on Communication and Electronics Systems . 2016

机译：基于层次聚类和KNN的高鲁棒性数据分类及挖掘算法研究
5. Clustering algorithms, classification algorithms and their applications in medical databases. [D] . Baddam, Sudheer R. 2005

机译：聚类算法，分类算法及其在医学数据库中的应用。
6. Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data [O] . Saurav Mallik, Zhongming Zhao -1

机译：基于图和规则的学习算法：使用基因组数据全面审查其在癌症类型分类和预后中的应用
7. An Improved KNN Text Classification Algorithm Based on Clustering [O] . Shixiong Xia, Youwen Li, Yong Zhou 2009

机译：基于聚类的改进的KNN文本分类算法
8. Application of Cluster Analysis to Aerometric Data. Volume I. Part 1: Clustering, Validation, and Classification of Data. Part 2: Investigation and Report of Cluster Analysis [R] . Crutcher, H. L. , Nelson, C. , Fairbairn, B. , 1980

机译：聚类分析在航空数据中的应用。第一部分：数据的聚类，验证和分类。第2部分：聚类分析的调查和报告

Application Research of KNN Algorithm Based on Clustering in Big Data Talent Demand Information Classification

摘要

著录项

相似文献

相关主题

期刊订阅