首页> 外文会议>International conference on service-oriented computing >RISE: Resolution of Identity Through Similarity Establishment on Unstructured Job Descriptions
【24h】

RISE: Resolution of Identity Through Similarity Establishment on Unstructured Job Descriptions

机译:RISE:通过对非结构化职位描述的相似性建立来解决身份识别问题

获取原文

摘要

Identity resolution of job description involving cross organizational data would go a long way in addressing several high valued business problems. Job data normalization/sanitation, automated creation of better job descriptions with context preference, description reuse and validation across different sources, semantic classification of jobs, routing of candidates to suitable jobs across different organization etc. are some of the business centric functionalities that can be efficiently built by resolving job description identities. Job descriptions are highly unstructured with free flow textual data consisting of lines describing important attributes of job requirements, like education, skills, experience, role, responsibility etc. Much of the problem is due to the highly unstructured nature of job descriptions. Further, the attributes that are representative of the information in a job description are not readily available from the description. Thus, the process of resolution involves deep data cleansing, classification, attributes identification, and building highly scalable similarity detection algorithms. In this paper, we propose RISE - that uses values of attributes in the underlying job description data and similarity observed in the attributes to resolve identities across organizations. It proposes classification followed by similarity establishment processes that eventually provides high quality of resolution. Through extensive experiments performed on corpus of job descriptions from several real world recruitment systems, we demonstrate that RISE can resolve the identities with high precision and recall.
机译:涉及跨组织数据的职位描述的身份解析对于解决一些高价值业务问题将大有帮助。作业数据规范化/卫生化,具有上下文首选项的更好的作业描述的自动创建,跨不同来源的描述重用和验证,作业的语义分类,候选人在不同组织中的路由到合适的作业等,都是可以实现的一些以业务为中心的功能通过解决职位描述身份有效地构建。职位描述是高度非结构化的,自由流动的文本数据由描述职位要求的重要属性的行组成,例如教育,技能,经验,角色,责任等。很多问题归因于职位描述的高度非结构化。此外,从描述中不容易获得代表工作描述中的信息的属性。因此,解决过程涉及深度数据清理,分类,属性识别以及构建高度可扩展的相似性检测算法。在本文中,我们提出了RISE-它使用基础工作描述数据中的属性值和在属性中观察到的相似性来解决跨组织的身份。它建议进行分类,然后进行相似性建立过程,最终提供高质量的分辨率。通过对来自多个实际招聘系统的职位描述语料库进行的广泛实验,我们证明RISE可以高精度地解决身份和召回问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号