首页> 外文会议>International Conference on Web Engineering >A Matter of Words: NLP for Quality Evaluation of Wikipedia Medical Articles
【24h】

A Matter of Words: NLP for Quality Evaluation of Wikipedia Medical Articles

机译:一个单词问题:Wikipedia医疗文章的质量评估NLP

获取原文

摘要

Automatic quality evaluation of Web information is a task with many fields of applications and of great relevance, especially in critical domains, like the medical one. We move from the intuition that the quality of content of medical Web documents is affected by features related with the specific domain. First, the usage of a specific vocabulary (Domain Informativeness); then, the adoption of specific codes (like those used in the infoboxes of Wikipedia articles) and the type of document (e.g., historical and technical ones). In this paper, we propose to leverage specific domain features to improve the results of the evaluation of Wikipedia medical articles, relying on Natural Language Processing (NLP) and dictionaries-based techniques. The results of our experiments confirm that, by considering domain-oriented features, it is possible to improve existing solutions, mainly with those articles that other approaches have less correctly classified.
机译:Web信息的自动质量评估是具有许多应用领域的任务,以及具有很大的相关性,特别是在关键域中,如医疗域名。我们从直觉移动,医疗Web文档的内容质量受与特定域相关的功能的影响。首先,使用特定词汇(域信息性);然后,采用特定代码(如Wikipedia文章的信息框中使用的代码)和文件类型(例如,历史和技术人员)。在本文中,我们建议利用特定的域特征来改善维基百科医学文章评估结果,依赖于自然语言处理(NLP)和基于词典的技术。我们的实验结果证实,通过考虑面向域的特征,可以改善现有解决方案,主要是与其他方法较少正确分类的物品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号