首页> 外文期刊>Journal of Biomedical Semantics >Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
【24h】

Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature

机译:预测投机:生物医学文献中对冲检测的简单消歧方法

获取原文
           

摘要

Background This paper presents a novel approach to the problem of hedge detection , which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of random indexing . Results The SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance. Conclusions This paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance.
机译:背景技术本文提出了一种对冲检测问题的新颖方法,该方法涉及识别所谓的对冲线索,以将句子标记为确定的或不确定的。这是CoNLL-2010共享任务的任务1的分类问题,其重点是生物医学领域的对冲。我们在这里建议将树篱检测视为一个简单的歧义消除问题,仅限于以前被视为树篱线索的单词。由于分类器的特征空间仍然很大,因此我们还使用随机索引方法进行了降维实验。结果本文开发的基于SVM的分类器在CoNLL-2010共享任务测试数据上的句子级不确定性预测方面取得了迄今为止最好的结果。我们还表明,可以在不牺牲分类器性能的情况下,将随机索引技术成功地用于将原始特征空间的维数降低几个数量级。结论本文介绍了一种简化的方法来检测文本中的推测或不确定性,重点是生物医学领域。在句子级别进行评估,我们基于SVM的分类器迄今为止取得了最佳的发布结果。我们还表明,可以使用随机索引积极地压缩特征空间,同时仍保持可比的分类器性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号