首页> 外文期刊>Procedia Computer Science >A Context-Aware Approach for Extracting Hard and Soft Skills
【24h】

A Context-Aware Approach for Extracting Hard and Soft Skills

机译:提取硬和软技能的背景感知方法

获取原文
           

摘要

The continuous growth in the online recruitment industry has made the candidate screening process costly, labour intensive, and time-consuming. Automating the screening process would expedite candidate selection. In recent times, recruiting is moving towards skill-based recruitment where candidates are ranked according to the number of skills, skill’s competence level and skill’s experience. Therefore it is important to create a system which can accurately and automatically extract hard and soft skills from candidates’ resume and job descriptions. The task is less complex for hard skills which in some cases could be named entities but much more challenging for soft skills which may appear in different linguistic forms depending on the context. In this paper, we propose a context-aware sequence classification and token classification model for extracting both hard and soft skills. We utilized the most recent state-of-the-art word embedding representations as textual features for various machine learning classifiers. The models have been validated by evaluating them on a publicly available job description dataset. Our results indicated that the best performing sequence classification model used BERT embeddings in addition with POS and DEP tags as input for a logistic regression classifier. The best performing token classification model used fine-tuned BERT embeddings with a support vector machine classifier.
机译:在线招聘行业的持续增长使候选人筛选过程昂贵,劳动密集,耗时。自动化筛选过程将加快候选人选择。最近,招聘正在向基于技能的招聘招聘,候选人根据技能人数,技能的能力水平和技能体验排名。因此,重要的是创建一个系统可以准确,并自动从候选者的恢复和职位描述中提取硬和软技能。对于艰难技能而言,该任务不太复杂,在某些情况下,在某些情况下可能被命名为实体,但对软技能更具挑战性,这可能根据上下文出现不同的语言形式。在本文中,我们提出了一种背景感知序列分类和令牌分类模型,用于提取硬和软技能。我们利用最近最先进的单词嵌入表示作为各种机器学习分类器的文本特征。通过在公开的作业描述数据集上评估它们来验证这些模型。我们的结果表明,最佳执行序列分类模型除了POS和DEP标签之外还使用BERT Embeddings作为Logistic回归分类器的输入。最佳性能的令牌分类模型使用具有支持向量机分类器的微调BERT嵌入式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号