首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >Application Research of KNN Algorithm Based on Clustering in Big Data Talent Demand Information Classification
【24h】

Application Research of KNN Algorithm Based on Clustering in Big Data Talent Demand Information Classification

机译:KNN算法在大数据人才需求信息分类中基于集群的应用研究

获取原文
获取原文并翻译 | 示例
           

摘要

With the growth of massive data in the current mobile Internet, network recruitment is gradually growing into a new recruitment channel. How to effectively mine available information in the massive network recruitment data has become the technical bottleneck of current education and social supply and demand development. The renewal of talent demand information is carried out every day, which produces a large amount of text data. How to manage these talents' demand information reasonably becomes more and more important. Artificial classification is time-consuming and laborious, which is unrealistic naturally. Therefore, using automatic text categorization technology to classify and manage this information becomes particularly important. To break through the bottleneck of this technology, a heuristic KNN text categorization algorithm based on ABC (artificial bee colony) is proposed to adjust the weight of features, and the similarity between test observation and training observation is measured by using the method of fuzzy distance measurement. Firstly, the recruitment information is segmented and feature selection and noise data elimination are carried out by using term frequency-inverse document frequency (TF-IDF) algorithm and AP (affinity propagation) clustering algorithm. Finally, the text information is classified by using KNN algorithm combined with heuristic search and fuzzy distance measurement. The experimental results show that this method effectively solves the problem of poor stability and low classification accuracy of traditional KNN algorithm in text categorization method for talent demand.
机译:随着当前移动互联网中大规模数据的增长,网络招聘逐渐发展到新的招聘渠道中。如何有效地在大规模网络招聘数据中获得可用信息已成为当前教育和社会供需发展的技术瓶颈。每天进行人才需求信息的更新,这产生了大量的文本数据。如何管理这些人才的需求信息合理变得越来越重要。人工分类是耗时和费力的,自然是不切实际的。因此,使用自动文本分类技术来分类和管理此信息变得尤为重要。为了突破该技术的瓶颈,提出了一种基于ABC(人造蜂菌落)的启发式KNN文本分类算法,调整特征的重量,通过使用模糊距离的方法测量试验观察和训练观察之间的相似性测量。首先,通过使用术语频率 - 逆文档频率(TF-IDF)算法和AP(亲和传播)聚类算法来执行招聘信息并进行特征选择和噪声数据消除。最后,通过使用KNN算法与启发式搜索和模糊距离测量相结合进行文本信息。实验结果表明,该方法有效解决了人才需求文本分类方法中传统KNN算法稳定性差和低分类精度的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号