首页> 美国卫生研究院文献>Bioinformatics >MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence
【2h】

MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence

机译:MeSHLabeler:通过整合各种证据来提高大规模MeSH索引的准确性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Medical Subject Headings (MeSHs) are used by National Library of Medicine (NLM) to index almost all citations in MEDLINE, which greatly facilitates the applications of biomedical information retrieval and text mining. To reduce the time and financial cost of manual annotation, NLM has developed a software package, Medical Text Indexer (MTI), for assisting MeSH annotation, which uses k-nearest neighbors (KNN), pattern matching and indexing rules. Other types of information, such as prediction by MeSH classifiers (trained separately), can also be used for automatic MeSH annotation. However, existing methods cannot effectively integrate multiple evidence for MeSH annotation.>Methods: We propose a novel framework, MeSHLabeler, to integrate multiple evidence for accurate MeSH annotation by using ‘learning to rank’. Evidence includes numerous predictions from MeSH classifiers, KNN, pattern matching, MTI and the correlation between different MeSH terms, etc. Each MeSH classifier is trained independently, and thus prediction scores from different classifiers are incomparable. To address this issue, we have developed an effective score normalization procedure to improve the prediction accuracy.>Results: MeSHLabeler won the first place in Task 2A of 2014 BioASQ challenge, achieving the Micro F-measure of 0.6248 for 9,040 citations provided by the BioASQ challenge. Note that this accuracy is around 9.15% higher than 0.5724, obtained by MTI.>Availability and implementation: The software is available upon request.>Contact:
机译:>动机::国家医学图书馆(NLM)使用医学主题词(MeSH)来索引MEDLINE中的几乎所有引用,这大大促进了生物医学信息检索和文本挖掘的应用。为了减少手动注释的时间和财务成本,NLM开发了用于辅助MeSH注释的软件包Medical Text Indexer(MTI),该软件包使用k最近邻(KNN),模式匹配和索引规则。其他类型的信息(例如,由MeSH分类器进行的预测(单独训练))也可以用于自动MeSH注释。但是,现有方法不能有效地集成多个MeSH注释证据。>方法:我们提出了一种新颖的框架MeSHLabeler,该框架通过使用“学习排名”来集成多个证据以进行准确的MeSH注释。证据包括来自MeSH分类器,KNN,模式匹配,MTI以及不同MeSH术语之间的相关性等的大量预测。每个MeSH分类器都是独立训练的,因此来自不同分类器的预测得分无法比拟。为解决此问题,我们开发了有效的分数归一化程序以提高预测准确性。>结果:MeSHLabeler在2014年BioASQ挑战的任务2A中获得第一名,实现了0.6248的Micro F-measure BioASQ挑战赛提供了9,040条引用。请注意,此准确性比MTI获得的0.5724高9.15%左右。>可用性和实现:该软件可根据要求提供。>联系方式:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号