首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Measuring semantic similarity of documents with weighted cosine and fuzzy logic
【24h】

Measuring semantic similarity of documents with weighted cosine and fuzzy logic

机译:用加权余弦和模糊逻辑测量文档的语义相似性

获取原文
获取原文并翻译 | 示例
       

摘要

Currently, the semantic analysis is used by different fields, such as information retrieval, the biomedical domain, and natural language processing. The primary focus of this research work is on using semantic methods, the cosine similarity algorithm, and fuzzy logic to improve the matching of documents. The algorithms were applied to plain texts in this case CVs (resumes) and job descriptions. Synsets of WordNet were used to enrich the semantic similarity methods such as the Wu-Palmer Similarity (WUP), Leacock-Chodorow similarity (LCH), and path similarity (hypernym/hyponym). Additionally, keyword extraction was used to create a postings list where keywords were weighted. The task of recruiting new personnel in the companies that publish job descriptions and reciprocally finding a company when workers publish their resumes is discussed in this research work. The creation of a new gold standard was required to achieve a comparison of the proposed methods. A web application was designed to match the documents manually, creating the new gold standard. Thereby the new gold standard confirming benefits of enriching the cosine algorithm semantically. Finally, the results were compared with the new gold standard to check the efficiency of the new methods proposed. The measures used for the analysis were precision, recall, and f-measure, concluding that the cosine similarity weighted semantically can be used to get better similarity scores.
机译:目前,不同的字段使用语义分析,例如信息检索,生物医学域和自然语言处理。本研究工作的主要重点是使用语义方法,余弦相似性算法和模糊逻辑来改善文档的匹配。在这种情况下,将算法应用于纯文本CVS(恢复)和作业描述。 WordNet的统一用于丰富语义相似性方法,例如Wu-Palmer相似性(WUP),LeaCock-Chodorow相似性(LCH)和路径相似度(HyperNym /以下)。此外,关键字提取用于创建重量关键字的邮寄列表。在本研究工作中讨论了当工人发布其恢复时,在发布职位描述和相互寻找公司的公司中招聘新人的任务。需要创建新的黄金标准来实现所提出的方法的比较。 Web应用程序旨在手动匹配文档,从而创建新的黄金标准。因此,新的黄金标准通过语义上证实了丰富余弦算法的益处。最后,将结果与新的金标准进行了比较,以检查提出的新方法的效率。用于分析的措施是精确的,召回和F测量,得出结论,即语义上的余弦相似度可用于获得更好的相似性分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号